Skip to main content
BMC Nursing logoLink to BMC Nursing
. 2020 Apr 13;19:23. doi: 10.1186/s12912-020-00419-9

Hospital survey on patient safety culture (HSOPSC): a multi-method approach for target-language instrument translation, adaptation, and validation to improve the equivalence of meaning for cross-cultural research

Patrick A Palmieri 1,2,3,4,, Juan M Leyva-Moral 5,6,7, Doriam E Camacho-Rodriguez 7,8, Nina Granel-Gimenez 5, Eric W Ford 9, Kathleen M Mathieson 2, Joan S Leafman 2
PMCID: PMC7153229  PMID: 32308560

Abstract

Background

The Hospital Survey on Patient Safety Culture (HSOPSC) is widely utilized in multiple languages across the world. Despite culture and language variations, research studies from Latin America use the Spanish language HSOPSC validated for Spain and the United States. Yet, these studies fail to report the translation method, cultural adaptation process, and the equivalence assessment strategy. As such, the psychometric properties of the HSOPSC are not well demonstrated for cross-cultural research in Latin America, including Peru. The purpose of this study was to develop a target-language HSOPSC for cross-cultural research in Peru that asks the same questions, in the same manner, with the same intended meaning, as the source instrument.

Methods

This study used a mixed-methods approach adapted from the translation guideline recommended by Agency for Healthcare Research and Quality. The 3-phase, 7-step process incorporated translation techniques, pilot testing, cognitive interviews, clinical participant review, and subject matter expert evaluation.

Results

The instrument was translated and evaluated in 3 rounds of cognitive interview (CI). There were 37 problem items identified in round 1 (14 clarity, 12 cultural, 11 mixed); and resolved to 4 problems by round 3. The pilot-testing language clarity inter-rater reliability was S-CVI/Avg = 0.97 and S-CVI/UA = 0.86; and S-CVI/Avg = 0.96 and S-CVI/UA = 0.83 for cultural relevance. Subject matter expert agreement in matching items to the correct dimensions was substantially equivalent (Kappa = 0.72). Only 1 of 12 dimensions had a low Kappa (0.39), borderline fair to moderate. The remaining dimensions performed well (7 = almost perfect, 2 = substantial, and 2 = moderate).

Conclusions

The HSOPSC instrument developed for Peru was markedly different from the other Spanish-language versions. The resulting items were equivalent in meaning to the source, despite the new language and different cultural context. The analysis identified negatively worded items were problematic for target-language translation. With the limited literature about negatively worded items in the context of cross-cultural research, further research is necessary to evaluate this finding and the recommendation to include negatively worded items in instruments. This study demonstrates cross-cultural research with translated instruments should adhere to established guidelines, with cognitive interviews, based on evidence-based strategies.

Keywords: Hospital survey on patient safety culture, Patient safety, Organizational culture, Instrument translation, Instrument adaptation, Instrument validation, Cross cultural research, Agency for Healthcare Research and Quality, Spanish, Peru

Background

As many as 16% of hospitalized patients experience a medial error during their health encounter [1]. Hospital culture and patient safety are interrelated as organizational failures and system driven errors contribute to the unintended events that produce poor quality outcomes [2]. Prominent international agencies, such as the World Health Organization (WHO) and the United States Agency for Healthcare Research and Quality (AHRQ), recommend hospitals address this problem by improving their organizational cultures [3]. As such, health services researchers are interested in the intersection of organizational culture with patient safety [4, 5]. Safety culture can be defined as the overarching but emergent healthcare property where professional attitudes and work climates result in the degree of system reliability and resilience to adverse outcomes [6].

Clinical quality and patient safety outcomes are linked to the organizational culture dimensions that can be measured with safety culture instruments for hospitals [7]. Safety culture measures are correlated with employee performance (e.g. safety behavior), process and system errors, and accident rates across industries [8] and cultures [9] with similar results from the health sector [10]. For example, researchers using the Hospital Survey on Patient Safety Culture (HSOPSC) have reported strong correlations between safety culture, adverse event frequency, and patient outcomes [11, 12]. For these reasons, the measurement of safety culture has become the prerequisite for continuous quality improvement efforts to provide leaders with the essential feedback that stimulates organizational improvement [1315].

In developing countries basic hospital safety indicators are largely incomplete or unavailable [1618]. The sparse available data indicates adverse event prevalence in South American countries, including Peru [1921], is much higher than developed countries [22, 23]. Patient safety is an emerging focus in the Peruvian health sector, as demonstrated by the lack of basic safety programs, processes, and practices [24]. The HSOPSC is a validated tool to measure the effectiveness of work environment and organizational process associated with preventing the types of errors linked to consequential adverse events [25]. When administered yearly, the HSOPSC can provide leaders with a proxy measurement for the effectiveness of their quality improvement efforts focused on achieving patient safety [13].

Early instruments to measure safety culture were developed in the late 1980’s and into early 1990’s [14] but did not gain popularity due to poorly performing psychometric properties [15]. In a seminal review of contemporary instruments [26], multiple safety culture instruments were identified but varied considerably in the general characteristics, dimensions covered, and psychometric properties. Furthermore, the items, and associated dimensions, were not derived from a theoretically constructed framework [13]. In response to the need for a standard instrument to measure safety culture, the AHRQ commissioned the development of a psychometrically valid and reliable instrument [2729], called the HSOPSC.

The HSOPSC, developed in English [28] for use in American hospitals, has demonstrated excellent psychometric properties [27]. The instrument has been disseminated to other countries, in multiple languages [8, 24, 3036]; however, published studies generally neglect to report the instrument translation method and validation data. When provided, the information is either limited or poor quality such as reporting a simple post hoc psychometric analysis. In this regard, a Spanish language version of the instrument is available for use with Spanish speaking health workers in the United States [37]. This version has also been used in Mexico [38, 39], Colombia [40], and most recently Peru [24]. Also, a different Spanish-language version of the instrument [41] was developed for use in Spain [42]. However, studies applying either of these Spanish language instruments for cross-culture research do not report the validation method and provide limited psychometric data about reliability. According to the basic recommended translation strategies published by the AHRQ [43], there is no validated Spanish language version of the HSOPSC reported in the literature for cross-cultural research in Peru, as well as in other countries in Latin America. As the Spanish language differs between countries and the meaning of words differs across cultures, a validated Spanish language version of the HSOPSC needs to be developed for cross-cultural research in Peru, that confirms construct applicability and survey item integrity [44, 45].

Researchers working with Spanish speaking populations [46, 47] have reported the traditional forward-reverse translation technique does not result in a valid target-language instrument [48, 49]. However, the item meanings, dimension integrity, and construct validity need to remain constant regardless of the language across cultures [50], with an attention to eliminating bias and maximizing equivalence [51]. In order to study patient safety culture in a global perspective, the HSOPC needs to be translated from the source language (original language) into the target language (translated language) without losing the meaning [52] and context of the items and associated scales and/or dimensions during the translation process [51]. As such, the purpose of this study was to test a mixed method approach for target-language instrument translation to produce a valid HSOPSC translation for cross-cultural research in Peru.

Developing a psychometrically sound target-language instrument for cross-cultural research from an English-language source, originally validated for use in the United States, is not straightforward [53]. In cross-cultural research, the assumption that all instruments will automatically be equivalent across groups does not hold [54]. However, instruments that are products of their local environments at the time they were developed are likely to be more reliable [55]. Yet, researchers consistently fail to describe how the HSOPSC was translated and validated [56] for cross-cultural comparability and compatibility [57]. For cross-cultural research, the technical and semantic equivalence and cultural relevance of each item needs to be evaluated prior to data collection [5860]. Without a sufficient pre-data collection evaluation to ensure the correct representation of the instrument [60], the resulting factor analyses post-data collection will be flawed and less rigorous [56].

Methods

Despite the increased adaptation of English language instruments for cross-cultural research, there is not consensus about the gold standard method for instrument translation and validation, including cultural adaptation [56, 61, 62]. However, proper cross-cultural research with instrument translation generally requires multiple qualitative and quantitative methods and techniques [55, 56, 61, 63], including feedback questionnaires, pilot testing, expert panels, and cognitive interviews [6365]. As such, using an iterative mixed method approach to translate instruments [46], with cognitive interviews [57, 66] ensures the resulting instrument will have equivalence [51]; one that asks the same questions, in the same manner, with the same intended meaning, as the source instrument.

With globalization in the context of cross-cultural research, evidence-based methods are required to produce equivalent target-language instrument translations from the sources [52, 67]. As such, this is a mixed-method approach adapted from the translation guidelines recommended by AHRQ [43, 68, 69] with reference to other best practices and strategies [63, 7072]. The approach adhered to the adapted version of the equivalence criteria for cross-cultural research with instruments [50, 55, 63, 73], see Table 1. The AHRQ provided written permission by email (CRM:00350304) to use the HSOPSC in English and Spanish for this research project. In addition, permission was granted to publish the resulting instrument from this study as well as the existing Spanish and English HSOPSC instruments.

Table 1.

Equivalence Criteria for Cross-Cultural Research with Instruments

Criteria Definition Process
Content Equivalence The content of each item of the instrument is relevant to the phenomena of each culture being studied.

— Research Team Experts.

— Clinical Practice Experts.

— Subject Matter Experts.

— Content validity index score.

— Annotated survey dimension document.

Semantic Equivalence The meaning of each item is the same in each culture after translation into the language and idiom (written or oral) of each culture.

— Translation guide from AHRQ.

— Qualified / experienced translators.

— Forward- and reverse-translation.

— Pilot test (cultural relevance & readability).

— Confirmation of translation: Cognitive interviews and expert reviews.

Technical Equivalence The method of assessment is comparable in each culture with respect to the data that it yields.

— Translation guide from AHRQ

— Experienced translators.

— Subject Matter Experts.

— Pilot test (evaluation scores).

— Cognitive interviews.

Criterion Equivalence The interpretation of the measurement of the variable remains the same when compared with the norm for each culture studied.

— Research Team Experts.

— Subject Matter Experts.

— Pilot test (evaluation scores).

— Cognitive interviews.

Conceptual Equivalence The instrument is measuring the same theoretical construct in each culture.

— Translation guide from AHRQ.

— Qualified / experienced translators.

— Forward- and reverse-translation.

— Item to dimension selection/match.

— Dual scoring process with content validity index and pilot test..

Adapted from Squires et al. [63], and Flaherty et al. [50]

Data collection

Per the recommendations for cross-cultural instrument research where the source country, culture, and language are different than the target [72], the data collection process required three phases, including: Instrument translation, Cultural-adaptation, content validation and equivalence [47, 72]. These phases included a forward and reverse translation, cognitive interviews, targeted participant review, structured pilot test, content validation, and expert evaluation of equivalence. The work completed in each of the phases are described next.

Participants

Phase 1, the translation process, was completed by three bilingual (English/Spanish) professionals, with at least an undergraduate degree in linguistics and extensive experience in translation and interpretation. Phase 2 included nine participants (2 nurses and 7 physicians), called clinical practice experts (CPEs), purposefully selected from licensed health professionals in Peru with current hospital work experience, fluency in Spanish, and advanced English skills as self-reported and observed by the principal researcher during a brief interview. Phase 3, included 7 participants (4 nurses, 2 physicians, 1 pharmacist), called subject matter experts (SMEs), purposefully selected as recommended by Grant and Davis [74], intermediate to advanced English was required. All participants verbally consented to participate in this study approved by the A.T. Still University Institutional Review Board (Protocol #01146).

Instrument HSOPSC

The HSOPSC is a 42-item instrument grouped into 12 composite dimensions and nine non-dimensional items, including two safety assessment items and seven demographic items [69]. Each dimension is represented by three to five items measured with a 5-point Likert scale to ascertain agreement (strongly disagree to strongly agree) or frequency (never to always). The instrument includes 59.52% positively worded items and 48% negatively worded items. The outcome measures are the two single-item responses inquiring about the number of events reported within the past 12 months and the overall patient safety score (excellent to failing). For the reported error questions, errors are defined as any type, without regard to harm.

Instrument translation – phase 1

Step 1: instrument review and translator selection

The source American English version of the HSOPC was reviewed by two bilingual healthcare experts with the primary investigator to establish the content equivalence, or the relevance and sensitivity [50, 75]. Although some challenging terms and phrases were noted in items, through discussion, the experts determined the instrument content was generally adaptable for translation and implementation in Peru. As such, the instrument was sent for translation without changes [74].

Step 2: forward translations with synthesis

The instrument was independently forward-translated from American English to Peruvian Spanish by two translators [76, 77]. Then, the two versions were synthesized and consolidated into a single instrument [75] by the two healthcare experts and the primary investigator, all knowledgeable about the instrument properties and scientific foundation. The translators were involved in this process specifically for grammar and language clarifications.

Step 3: reverse translation with reconciliation

The goal of the reverse-translation process was to establish a conceptual rather than a literal “word-by-word” meaning [78]. An independent bilingual translator reverse-translated the synthesized instrument from Peruvian Spanish to American English. Then, the reverse-translated instrument was examined, in comparison to the original instrument, by the two healthcare experts and the primary investigator. Although discrete differences were identified in the reverse-translation, these were primarily related to nearly equivalent verb selections. There were five instances where phrases required clarification to achieve the same meaning in Spanish as expressed in English. However, these clarifications were noted for discussion during the cognitive interview phase. Again, the translator provided clarifications specific to grammar and language.

Cultural-adaptation – phase 2

Although a pilot test with a bilingual version of the instrument has been recommended to determine the equivalence between the newly translated and the original instruments [79], there were not enough bilingual participants readily accessible for this study. Alternatively, this study used a pretest to review and refine the instrument through a series of three rounds. The cognitive interviews followed each pretest to assesses the four stages of the participant engagement in completing the instrument, including: Comprehension, retrieval, judgement, and responding [80]. The procedure is described below.

Step 4. Pre-test

Nine CPEs engaged in the pilot testing with three CPE per round across three sequential rounds. First, the CPEs completed the instrument without interruption. Then, the CPEs were asked for their general perspective about the instrument. Finally, all CPEs were asked to complete the new instrument and for a final assessment for item clarity and cultural equivalence. The assessment required each CPE to respond to each item and then rank the item on a five-point Likert scale for language clarity (5 – completely readable and understandable, 4 – mostly readable and understandable, 3 – readable and understandable, 2 – somewhat readable and somewhat understandable; and 1 – not readable and understandable) and cultural equivalence (5 – completely culturally relevant, 4 – mostly culturally relevant, 3 – culturally relevant, 2 – somewhat culturally relevant, 1 – not culturally relevant). This coding schematic was necessary to identify the aggregate level of item clarity and cultural relevance across participants. The assessment data was entered into an Excel database for the analysis. The item specific scores guided the primary focus of the first round of cognitive interviews; scores less than four were considered opportunities for improvement.

Step 5. Cognitive interviews

Once the data entry was completed, the primary investigator conducted a cognitive interview with each CPE [69, 81], through a structured but open-ended discussion, to understand how participants read, comprehended, and responded to each item [57]. A cognitive interview script guided the content probes for additional evaluation of items scoring three or less in the first round and items scoring four or less in the subsequent rounds. This probative process was constructed to identify translation deficiencies by asking CPEs to describe their understanding of the item in Spanish and then again after considering the original question in English. Furthermore, the other two Spanish HSOPSC versions, United States [37] and Spain [41], were incorporated into the process as resources to review, discuss, analyze, and refine problematic items. The CPEs were also asked to identify items with unfamiliar or inappropriate grammar and syntax. Cognitive interview data was thematically coded to identify problematic items [81]. Notes were collected and compiled throughout the process.

Step 6. Research team review with item revision

The data from the pilot testing for each round was individually and aggregately reviewed by the research team with the primary focus on improving items with ratings less than four. The notes from the cognitive interviews were also referenced when reviewing each item. Revisions for problems with items was completed to improve the item performance in the next round. The translators from the forward- and reverse-translation were consulted with specific questions about word selections and phrase clarification. All revisions were noted in the Excel spreadsheet to provide a record of the progressive evolution of each item across rounds. Each round was conducted sequentially and distinctly but all items were compared between and across rounds.

Content validation and equivalence evaluation – phase 3

Step 7a. Subject matter expert evaluation of content validity

The SMEs were provided the final instrument for review with a content validity questionnaire [8287]. Each item in the preliminary final instrument was rated on a 4 point-Likert scale (1-irrelevant, 2-little relevance; 3-relevant; and 4-extremely relevant). In addition, the item evaluation used during the pilot testing for language clarity (readability/understandability) and cultural relevance (context) was incorporated with dichotomous acceptable or unacceptable scoring. Finally, there was a space for open ended comments for each item and at the end of the questionnaire. The content validity index (CVI) and the item evaluation for language and context was calculated per item [83]. For analysis, the rankings were merged into two categories as items scored one and two were considered irrelevant and items scored three and four were considered relevant [82]. Then, the CVI was calculated individually for each item (proportion of experts rating the item as relevant). The items were then calculated as part of the twelve dimensions, as well as for the total instrument as the mean CVI [86]. A significant CVI for this process was set, a priori, at 0.70 or greater [82, 85, 86].

Step 7b. Subject matter expert evaluation of equivalence

The SMEs were provided with the final items from the instrument and asked to indicate which items are associated with which dimensions. Each expert was required to select only one of the twelve dimensions for each item, but they were also asked to indicate a secondary dimension, with a short rational, in the case they were unable to easily decide the dimension for the item. Through this process, the equivalence can be established. The Kappa value was calculated for the SME agreement in identifying each item with the correct dimension. The minimally acceptable Kappa value was set to 0.40 [8890] as values below this point are not considered robust [91, 92]. According to the scale of Landis & Koch [93], the strength of the Kappa coefficients were defined in groupings (0.01–0.20 slight; 0.21–0.40 fair; 0.41–0.60 moderate; 0.61–0.80 substantial; 0.81–0.99 almost perfect, and 1.00 perfect).

Results

Sample characteristics

The pre-tests with cognitive interviews were completed with nine Peruvian health care professionals (6 males and 3 females) across three rounds with three different participants per round. The mean participant age was 44.6 years (± 6.7) and the mean experience level was 16.7 (± 6.5). In the previous five years, all participants worked in private hospitals (known as ‘clínicas’ in Peru), six also worked in social security hospitals (Social Health Insurance of Peru), and five also worked in public hospitals (Ministry of Health of Peru). All participants reported working between two to four different jobs, either part-time, full-time, and/or contracted. All participants were advanced to fluent in English (6 advanced, and 3 fluent) and all participants were fluent in Peruvian Spanish (native language). Table 2 summarizes the demographic and professional characteristics.

Table 2.

Participant Characteristics for Cognitive Interviews (N = 9)

Mean Age (SD) 44.6 (6.7)
Male 6 (55.6)
Health Professions Specialty
 Internal medicine 3 (33.3)
 Gynecology medicine 2 (22.2)
 Anesthesiology 1 (11.1)
 Pediatric medicine 1 (11.1)
 Intensive care nurse 1 (11.1)
 Medical-surgical nurse 1 (11.1)
Mean Years of Experience (SD) 16.7 (6.5)
Hospital Setting a
 Clinica 9 (100)
 EsSalud 6 (66.7)
 MINSA 5 (55.7)
English Comprehension
 Advanced 6 (66.7)
 Fluent 3 (33.3)

Note: Values are expressed as N (%) unless otherwise noted

a Participants could indicate experience in more than one hospital setting

Pilot testing and cognitive interviews

The pilot testing of the instrument required on average 18 min (range 15 to 22 min), while the cognitive interviews required on average 50 min (range 40 min to 85 min). From the item evaluations, the participants identified multiple problem items: Round 1 there were 37 problem items (14 unique to language clarity, 12 specific to cultural, and 11 mixed); Round 2 there were 14 problematic items (6 specific to language clarity, 5 specific to cultural, and 3 mixed); and Round 3 there were four problematic items (2 specific to language clarity, 2 specific to cultural, and 0 mixed). The type, frequency, and the mix of issues improved progressively across rounds. In the aggregate analysis for each round, the language clarity issues were the most common with the lowest point percentage (Round 1 = 77.4%) followed by cultural relevance (Round 1 = 81.3%). The analysis evidenced although the initial translation was adequate from a linguistics perspective, there were opportunities to improve the clarity and relevance from the targeted population perspective [94, 95]. With focused item revisions between rounds, the cumulative item measurements for language clarity and cultural relevance improved by 10% in round 2 (79.4 to 89.2%) and improved 5% from round 2 to round 3 (89.2 to 94.5%). From the interviews, participants identified problems understanding negatively worded items. However, the participant feedback resulted in improvements to all negative items for the final version of the instrument. Table 3 summarizes the aggregate item analysis for all scores.

Table 3.

Participant Item Rating (Aggregate Raw Score by Measure Type by Round)

Testing Round Language Claritya Cultural Relevancea Totalb
Round 1 201.3 (77.4%) 211.3 (81.3%) 412.7 (79.4%)
Round 2 227.3 (87.4%) 236.7 (91.0%) 464.0 (89.2%)
Round 3 243.0 (93.5%) 248.3 (95.5%) 491.3 (94.5%)

a Score calculation 51 items, 1–5 points per item, % is X points / 255 total

b Score calculation 51 items, 1–5 points per item for each score, % is X points / 510 total

Overall, the cognitive interviews identified translation issues across rounds. For example, Round 1 had 33 problems (language clarity 42.5%, cultural 33.3%, and general design 24.2%) but this number decreased to only four problems in Round 3 (50.0, 25.0, and 25.0% respectively). Table 4 provides a summary of the item issues related to translation, context, and design. The remaining four issues identified in Round 3 improved from Round 1 as multiple participants initially rated these items as a one or two and, in the end, only a single participant rated one item as a three. The participant response for the sole issue in Round 3 was reportedly based on a negative impression about the item versus a lack of understanding or concern about cultural relevance. At the conclusion of the three-phase process, the expert panel believed each item was acceptable in terms of readability, comprehension, and cultural relevance. Table 5 presents examples of the language clarity issues identified by the participants. Regarding the cultural issues, the predominant problems were related to defining definitions of concepts or seemingly general knowledge across cultures, with two specific causes: 1) Organizational constructs that differed between the United States and Peru; and 2) References to health system constructs in the United States that are not present or not well known to professionals in Peru. Table 6 presents examples of the cultural relevance issues identified by the participants.

Table 4.

Problems Identified from Cognitive Interviews (Number and Percent)

Round a Language Clarity Cultural Relevance Design Total
Round 1 14 (42.5%) 11 (33.3%) 8 (24.2%) 33 (100%)
Round 2 7 (43.8%) 5 (31.2%) 4 (25.0%) 16 (100%)
Round 3 2 (50.0%) 1 (25.0%) 1 (25.0%) 4 (100%)

a Number reported per identified problem per item regardless of frequency per item

Table 5.

Examples of Language Clarity Issues Identified from Cognitive Interviews

Translation issue Item
content
Illustrative
example
Spanish words do not convey intended construct

Problem:

Crisis mode or “modo de crisis”

Description of issue: “Crisis mode” translates to “modo de crisis” en other Spanish speaking countries. The concept exists in Peru but translates differently.

English question: We work in “crisis mode” trying to do too much, too quickly.

Initial translation: Trabajamos en “modo de crisis”, tratando de hacer demasiado y muy rápidamente.

Final translation: Trabajamos bajo presión intentando realizar demasiadas cosas muy rápidamente.

Problem:

Perden or to lose for “falls through the cracks”

Description: “Falls between the cracks” translates into Spanish as “perden” or to lose things which is slightly different in meaning for participants.

English question: Things “fall between the cracks” when transferring patients from one unit to another.

Initial translation: Las cosas se pierden cuando los pacientes son transferidos de una unidad a otra.

Final translation: La información y/o objetos de los pacientes se pierde cuando éstos se transfieren de una unidad a otra.

Spanish words are unfamiliar or have different meanings in some cultures or different nationalities

Problem:

Importante información for specific information important to patient safety

Description of issue: Information is a generic word in Spanish and the participants wanted further clarification about what information (e.g. documents or forms) is important.

English question: Important patient care information is often lost during shift changes.

Initial translation: Con frecuencia se pierde importante información sobre el cuidado del paciente durante los cambios de turno.

Final translation: Se pierde a menudo información importante de cuidado de pacientes durante cambios de turno.

Problem:

Linking the meaning of mistakes to chance through the phrase “por casualidad”

Description of issue: The phrase for chance in Peru exists but linking this to mistakes that occur in the hospital requires careful word order arrangement.

English question: It is just by chance that more serious mistakes don’t happen around here.

Initial translation: Aquí no suceden errores más serios sólo por casualidad.

Final translation: Es sólo por casualidad que errores más serios no suceden aquí.

Table 6.

Examples of Cultural Relevance Issues Identified from Cognitive Interviews

Cultural
issue
Item
content
Illustrative
example
Definition of concepts or knowledge differ across cultures or nationalities Problem: Different healthcare constructs in the provision of care between the United States and Peru

Description of issue: Temporary staff and professionals are not frequently utilized in Peruvian healthcare facilities.

English Question: We use more agency/temporary staff than is best for patient care.

Initial translation: Usamos más personal temporal o de agencia de lo conveniente para el cuidado del paciente.

Final translation: Usamos más personal de reemplazo o temporal de lo que es mejor para el cuidado del paciente.

Problem: Reference to errors and recording in personnel files

Description of issue: The concept of blame and recording of errors in a personal file is not clear in Peru.

English question: Staff worry that mistakes they make are kept in their personnel file.

Initial translation: Al personal le preocupaba que los errores cometidos permanezcan en sus expedientes personales.

Final translation: Cuando se comete un error, el personal teme que eso quede en su expediente.

Content validity and equivalence

From the seven SME invited to participate in the item content review, only six completed the entire process. After reviewing the items (N = 42) for clarity, the S-CVI/Avg (scale-level content validity index, average) was 0.97 and the S-CVI/UA (scale-level content validity index, universal agreement) was 0.86 with a total item agreement of 36 of 42 items (4 items at 0.83, and 2 items at 0.67) and for cultural relevance, the S-CVI/Avg was 0.96 and the S-CVI/UA was 0.83 with a total item agreement of 35 items (5 items = 0.83, 1 item = 0.67, and 1 item = 0.50). For the equivalence of the items to the dimension for the entire instrument, the Kappa value was 0.72, or substantially equivalent. In terms of the individual dimensions, only one of the twelve dimensions (dimension 2) had a Kappa value of 0.39, which is at the borderline of moderate and fair. The remaining dimensions performed well (7 = almost perfect to perfect, 2 = substantial, and 2 = moderate).

When aggregating the HSOPSC items, then averaging the readability and cultural relevance scores for the three participants for Round 1, there were 15 items (10 for readability and 5 for cultural relevance) which scored < 3.00. For these items, 87% were negatively worded questions (readability 8 of the 10 items and cultural relevance all 5 items). In Round 2, of the 15 identified issues (those < 3.00 in Round 1), only one negatively worded item remained problematic (average score = 2.67). In Round 3, the only two items receiving an individual score less than four on either readability or cultural by participants were negatively worded. Finally, cognitive analysis revealed three additional items, recommended by participants, which should be added to the instrument. These questions are specific to the reporting systems for errors and adverse events as the participants realized these systems are either not present or not widely utilized. This represents a cultural difference in the construction of the instrument that needs to be considered for interpretation with the single measurement item. Table 7 presents the additional questions identified by the participants, but not included in the final instrument.

Table 7.

Additional Questions Recommended by Participants

Rational for recommendation Proposed question
Question specific to the presence of a adverse event reporting system in the facility. There is another question specific to the reporting of adverse events but not one about the presence of the system. En su hospital funciona el Sistema de Notificación Hospitalario de eventos adversos.
Question specific to the frequency the person has reported an adverse event. Cuando se reporta un evento ¿con qué frecuencia es analizado?
Question specific to the rating of the patient safety in the hospital or clinic. There is another question specific to rating the area or unit. Por favor, dele a su HOSPITAL de trabajo una calificación promedio de acuerdo al grado de seguridad del paciente.

In addition to the translational, cultural, and general design issues, some navigational issues were identified during the cognitive interviews. For example, two of the three participants in Round 3 indicated they were confused about when to turn the instrument pages to answer additional items. Similarly, participants found it difficult to determine when they completed the entire instrument. Another participant indicated the choices for professions were not listed in a sensible “ranked” order and yet another participant indicated the demarcation for professional experience and years working in a facility were not clear. Finally, one participant in Round 2 and two participants in Round 3 commented that breaking item sections between pages was distracting and seemed to disrupt the instrument flow.

Discussion

Similar to Levin et al. [47], the item analysis revealed two types of translational issues: 1) Words which when translated from English to Spanish did not convey similar constructs; and 2) Phrases or specific words which when translated from English to Spanish were not familiar to the participants as they had different meanings across cultures and/or national borders. Through the cognitive interview process; however, the participants refined these issues into four distinct domains, including: 1) translation (“reads wrong”); 2) culture relevance (“don’t understand”); 3) general instrument design (“looks strange”); and 4) navigational issues with completing instrument (“where do I go” or “what do I do” next). The fourth category has not been reported in the literature but describes some problems associated with the format and flow of the instrument. Finally, the participants realized one item asked about patient safety outcomes. As such, they all recommended additional items focused on the types and quantities of adverse errors observed in practice or personally committed.

The general design issues identified during the cognitive probing process, can be classified into four categories: 1) Formal structures and process; 2) Physician work environment; 3) Professional domains; and 4) Terminology for the public and private systems. Although the first three categories were improved with relatively minor item adjustments, the final category necessitated an instrument design with words to satisfy the distinct vocabulary of private hospitals (called clínicas) and public hospitals (called hospitales). As such, the final instrument included specific words in combination such as “hospital/clinica” and “area/unidad” to satisfy the differences in vocabulary additional items”. Table 8 presents examples of the four categories of general design issues.

Table 8.

Examples of General Design Issues Identified from Cognitive Interviews

General design problem Illustrative example
Health system differences between countries

Description of issue: In Peru, there is not a formal adverse event and reporting system at every facility. As such, the concept is unfamiliar to many participants.

Revision: Addition of a question specific to whether or not the participant is aware of the presence of a formal error reporting system at their respective facility.

Work environment for physicians is different

Description of issue: In Peru, most physicians work at more than one facility. Also, they are usually employed full-time by the public health system (or semi-public) and then work as contractors in the private health system.

Revision: Incorporated a question to capture the number of hours worked at all health facilities in addition to their primary work facility in Peru.

Professional domains are different or absent

Description of issue: There are many professions in the United States and in Peru which are different. For example, registered nurses are present in both places but respiratory therapists do not exist in Peru.

Revision: Reconstruct the questions specific to professional roles and positions in facilities in Peru

Multiple concepts are different between the public and private health systems

Description of issue: Within Peru, there are differences between the reference terms for areas, departments, and even facilities.

Revision: For example, the public sector term for an acute care facility is “hospital” while the private sector term is “clinica.”

Additional step: Incorporate changes by crafting two instruments: 1) Instrument specific for public facilities; and 2) Instrument for private facilities but only with changes to distinct descriptions and terminology but without changes to the item meaning.

Description of issue: Within Peru, there are differences in job titles and descriptions between the public and the private health facilities.

Revision: For public system, use the appropriate terms and for the private system use the specific terms.

Additional step: Incorporate changes throughout the instrument for public facilities and then throughout the instrument for private facilities.

Across the cognitive interviews, participants consistently identified similar issues with the identified problematic items. For example, in Round 2 about 82% of the items rated by the participants as a three or less, at least two of the three participants identified the same or similar issue. Also, there were multiple items where the identification agreement highlighted items that the traditional forward- and reverse-translation process would have missed. For example, a cultural issue specific to asking nurses and physicians about work hours in their primary facility was captured by cognitive interviews. In Peru, nurses and physicians routinely work enough hours at two or three facilities to be equivalent to two full-time jobs in the United States. Often, professionals are employed full-time by the public health system (or semi-public) and then work full-time in the private health system. As such, the four participants suggested we needed to incorporate an additional question to capture the total number of hours worked at all facilities in addition to their primary work facility.

Most items translated were etic [52], or concepts that were universally transferable. These items had very good clarity and substantial cultural relevance, without or with quite minor modification. Then, there were some emic concepts that needed to be addressed by the research team. These are the items reflecting culturally specific concepts in meaning, such as ideas and behaviors, in the source language [52], but prove to be inequivalent with only translation [96]. For example, an English worded item, “We work in ‘crisis mode’ trying to do too much, too quickly” translated easily but the language did not capture the meaning of the item. Although ‘crisis mode’ translates easily to ‘modo de crisis’ in Spanish speaking countries, the concept translates different in many countries such as Peru. As such, the initial translation included the phrase ‘modo de crisis’ (Trabajamos en “modo de crisis”, tratando de hacer demasiado y muy rápidamente) required significant modification that did not include the phrase (Trabajamos bajo presión intentando realizar demasiadas cosas muy rápidamente).

Another example of a seemingly discrete issue was discovered with the word ‘chance’. Although ‘chance’ exists in Spanish, when ‘chance’ is linked in a phrase with the word ‘mistake’ the participants had difficulty understanding the context of the item. In English, the item is “It is just by chance that more serious mistakes don’t happen around here” and the initial translation was “Aquí no suceden errores más serios sólo por casualidad”. By rearranging the word order and using an underline to emphasis the ‘sólo por casualidad’ the final translation was “Es sólo por casualidad que errores más serios no suceden aquí”. In this case, the item score improved significantly from an average of a 2.3 to a 4.7 in the next round and the cognitive interviews generated no additional concerns.

With the final item analysis for grammar, syntax, and other issues, we discovered negatively worded items performed poorly, required multiple conversations during cognitive interviews, and generated disagreement between experts in panel discussions. With using only positively worded questions there is higher risk for acquiescence bias [9799]; however, the literature in this regard is primarily from English language countries and cultures. Similar to our finding about negatively worded questions, Solis-Salazar [100] reported combining positive and negative items can seriously damages the internal consistency of dimensions or subscales in Spanish language instruments.

Finally, this study has three limitations. First, the cognitive interviews required health professionals with advance to almost native levels of English comprehension. As such, these professionals might have more knowledge about safety culture than those without the same level of comprehension. In addition, the English language requirement narrowed the number of varied professionals to participate in the cognitive interviews. Second, this study assumes the etic approach to cross-cultural research. In this regard, the theoretical assumptions and operational constructs specific to safety culture were assumed to be translatable to Peru. However, the study method was a strength in providing evidence the language used to describe the assumptions and construct was transferable. Third, and last, the translators from the forward and reverse translation process were utilized as language consultants for the expert deliberations. In this regard, with justification, the “purity” of the forward and reverse translation was not maintained. A side-by-side instrument comparative each item, by dimensions, is provided in the Supplemental Table 1 for the AHRQ English and Spanish versions, the Spanish Version for Spain, and the instrument produced from this study.

Conclusion

This is the first study to report the application of the AHRQ recommended target-language translation process, with the optional cognitive interviews, as an effective method for cross-cultural research. Incorporating cognitive interviews into a rigorous translation method resulted in an equivalent target-language HSOPSC for cross-cultural research. The feedback from the cognitive interviews contributed to substantially improved items as well as systematically validated the resulting Spanish version of the HSOPSC for the Peruvian context. By pilot testing each item with a numerical rating provided a comparison within and between rounds, the resulting evaluation was robust in assessing the clarity and cultural applicability, and provided a composite quality score. The sequential rounds revealed improved item ratings for readability and cultural relevance and the cognitive interviews resulted in successive reductions in item revisions. Furthermore, our analysis identified negatively worded questions are problematic for target-language translations from American English to Peruvian Spanish. The inclusion of a content validity measurement provided additional evidence about semantic equivalence. Finally, this is the first HSOPSC instrument translation, adaptation, and validation study reported in the literature for Latin America and appears to be the only such study for all studies undertaking the translation of the HSOPSC.

Supplementary information

12912_2020_419_MOESM1_ESM.docx (27.9KB, docx)

Additional file 1 Table S1. Item Comparison Across Instruments: Original English and Spanish Versions

Acknowledgements

We want to thank Dr. Adrian Anast, Director of the Online Writing Center at A.T. Still University for working with the research team to edit and revise this manuscript in preparation for publication. Also, we want to thank Professor Karen Dominguez-Cancino, Deputy Director of the EBHC South America: A Joanna Briggs Affiliated Group for her statistical advisement and technical assistance during the manuscript revisions.

Abbreviations

AHRQ

Agency for Healthcare Research and Quality

CPE

Clinical practice expert

CI

Cognitive interview

CVI

Content validity index

HSOPSC

Hospital Survey on Patient Safety Culture

I-CVI

Item-CVI

S-CVI

Scale-level content validity index

S-CVI/Avg

Scale-level content validity index, average

S-CVI/UA

Scale-level content validity index, universal agreement

SME

Subject matter experts

Authors’ contributions

All authors participated in the final approval of the manuscript submitted for journal review and are accountable for ensuring that questions related to the accuracy and/or integrity of any part of the manuscript are appropriately investigated and resolved. In the addition, the following authors were involved in the: Study conception (PAP); study design (EWF, JLM, JSL, KMM,); data collection (PAP); data analysis (PAP, JLM, JSL,); data interpretation (PAP, EWF, JLM, JSL, KMM, NGG,); drafting the manuscript (EWF, JLM, JSL, PAP,); developing the tables (DEC, JLM, NGG, PAP); substantial revision to the manuscript (DEC, JLM, NGG, PAP), critical revisions to the final manuscript (DEC, JLM, NGG, PAP).

Author’s information

PAP is a Fellow of the American Academy of Nursing and a nationally certified researcher (No. P0022630) at the Peruvian National Council of Science, Technology, and Innovation (CONCYTEC). He is also the Director, EBHC South America: A Joanna Briggs Institute Affiliated Group (Chile, Colombia, Peru); Vice Chancellor for Research, Universidad Norbert Wiener (Peru); Adjunct Professor, College of Graduate Health Studies, A.T. Still University (USA); Adjunct Professor, Doctoral Program, College of Health Sciences, Walden University (USA); and Chairman of the Board of Directors, Sigma Theta Tau International Foundation for Nursing (USA).

Funding

This research project was partially funded through a research dissemination grant from the Universidad Cooperativa de Colombia received by Dr. Doriam E. Camacho-Rodríguez.

Availability of data and materials

The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.

Ethics approval and consent to participate

The study protocol (#01146) was approved by the A.T. Still University Institutional Review Board (Mesa, Arizona). As approved by the Institutional Review Board, each participant verbally consented to participate in this study.

Consent for publication

This consent is not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Patrick A. Palmieri, Email: Patrick.Palmieri@jbisa.org, Email: patrick.palmieri@uwiener.edu.pe, Email: ppalmieri@atsu.edu, Email: patrick.palmieri@mail.waldenu.edu

Juan M. Leyva-Moral, Email: JuanManuel.Leyva@uab.cat, Email: Juan.Leyva@jbisa.org

Doriam E. Camacho-Rodriguez, Email: doriam.camacho@ucc.edu.co

Nina Granel-Gimenez, Email: nina.granel@uab.cat.

Eric W. Ford, Email: ewford@uab.edu

Kathleen M. Mathieson, Email: kmathieson@atsu.edu

Joan S. Leafman, Email: jleafman@atsu.edu

Supplementary information

Supplementary information accompanies this paper at 10.1186/s12912-020-00419-9.

References

  • 1.Jha AK, Prasopa-Plaizier N, Larizgoitia I, Bates DW. Research priority setting working group of the WHOWAfPS: patient safety research: an overview of the global evidence. Qual Saf Health Care. 2010;19:42–47. doi: 10.1136/qshc.2008.029165. [DOI] [PubMed] [Google Scholar]
  • 2.Clarke SG. The relationship between safety climate and safety performance: a meta analytic review. J Occup Health Psychol. 2006;11:315–327. doi: 10.1037/1076-8998.11.4.315. [DOI] [PubMed] [Google Scholar]
  • 3.Kohn LT, Corrigan JM, Donaldson MS, editors. To err is human: building a safer health system. Washington, D.C.: National Academy Press; 2000. [PubMed] [Google Scholar]
  • 4.Gershon RR, Karkashian CD, Grosch JW, Murphy LR, Escamilla-Cejudo A, Flanagan PA, Bernacki E, Kasting C, Martin L. Hospital safety climate and its relationship with safe work practices and workplace exposure incidents. Am J Infect Control. 2000;28:211–221. doi: 10.1067/mic.2000.105288. [DOI] [PubMed] [Google Scholar]
  • 5.Scott T, Mannion R, Marshall M, Davies H. Does organisational culture influence health care performance? A review of the evidence. J Health Serv Res Policy. 2003;8:105–117. doi: 10.1258/135581903321466085. [DOI] [PubMed] [Google Scholar]
  • 6.Palmieri PA, Peterson LT. Attribution theory and healthcare culture: Translational management science contributes a framework to identify the etiology of punitive clinical environments. In: Savage GT, Fottler MD, editors. Biennial Review of Health Care Management: Meso Perspective. Bingley: Emerald Group Publishing; 2009. pp. 81–111. [Google Scholar]
  • 7.Clark G. Organisational culture and safety: an interdependent relationship. Aust Health Rev. 2002;25:181–189. doi: 10.1071/AH020181. [DOI] [PubMed] [Google Scholar]
  • 8.Hellings J, Schrooten W, Klazinga NS, Vleugels A. Improving patient safety culture. Int J Health Care Qual Assur. 2010;23:489–506. doi: 10.1108/09526861011050529. [DOI] [PubMed] [Google Scholar]
  • 9.Noort MC, Reader TW, Shorrock S, Kirwan B. The relationship between national culture and safety culture: implications for international safety culture assessments. J Occup Organ Psychol. 2016;89:515–538. doi: 10.1111/joop.12139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Singer S, Falwell A, Gaba DM, Meterko M, Rosen A, Hartmann CW, Baker L. Identifying organizational cultures that promote patient safety. Health Care Manag Rev. 2009;34:300–311. doi: 10.1097/HMR.0b013e3181afc10c. [DOI] [PubMed] [Google Scholar]
  • 11.El-Jardali F, Dimassi H, Jamal D, Jaafar M, Hemadeh N. Predictors and outcomes of patient safety culture in hospitals. BMC Health Serv Res. 2011;11:1–12. doi: 10.1186/1472-6963-11-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Mardon RE, Khanna K, Sorra J, Dyer N, Famolaro T. Exploring relationships between hospital patient safety culture and adverse events. J Patient Saf. 2010;6:226–232. doi: 10.1097/PTS.0b013e3181fd1a00. [DOI] [PubMed] [Google Scholar]
  • 13.Palmieri PA, Peterson LT, Pesta BJ, Flit MA, Saettone DM. Safety culture as a contemporary healthcare construct: Theoretical review, research assessment, and translation to human resource management. In: Savage GT, Khatri N, Fottler MD, editors. Strategic Human Resource Management in Health Care. Bingley: Emerald Group Publishing Limited; 2010. pp. 97–133. [Google Scholar]
  • 14.Shortell SM, Denise M, Rouseau DM, Gillies RR, Devers KJ, Simons TL. Organizational assessment in intensive care units (ICUs): construct development, reliability, and validity of the ICU nurse-physician questionnaire. Med Care. 1991;29:709–723. doi: 10.1097/00005650-199108000-00004. [DOI] [PubMed] [Google Scholar]
  • 15.Hofoss D, Deilkas E. Roadmap for patient safety research: approaches and roadforks. Scandanavian J Public Health. 2008;36:812–817. doi: 10.1177/1403494808096168. [DOI] [PubMed] [Google Scholar]
  • 16.World Alliance for Patient Safety . Forward Programme: 2008–2009. 1. Geneva: World Health Organization; 2008. [Google Scholar]
  • 17.Drösler SE, Klazinga NS, Romano PS, Tancredi DJ, Gogorcena Aoiz MA, Hewitt MC, Scobie S, Soop M, Wen E, Quan H, et al. Application of patient safety indicators internationally: a pilot study among seven countries. Int J Qual Health Care. 2009;21:272–278. doi: 10.1093/intqhc/mzp018. [DOI] [PubMed] [Google Scholar]
  • 18.Kruk ME, Freedman LP. Assessing health system performance in developing countries: a review of the literature. Health Policy. 2008;85:263–276. doi: 10.1016/j.healthpol.2007.09.003. [DOI] [PubMed] [Google Scholar]
  • 19.Marcel JP, Alfa M, Baquero F, Etienne J, Goossens H, Harbarth S, Hryniewicz W, Jarvis W, Kaku M, Leclercq R, et al. Healthcare-associated infections: think globally, act locally. Clin Microbiol Infect. 2008;14:895–907. doi: 10.1111/j.1469-0691.2008.02074.x. [DOI] [PubMed] [Google Scholar]
  • 20.Hernandez K, Ramos E, Seas C, Henostroza G, Gotuzzo E. Incidence of and risk factors for surgical-site infections in a Peruvian hospital. Inf Contrl Hosp Epidemiol. 2005;26:473–477. doi: 10.1086/502570. [DOI] [PubMed] [Google Scholar]
  • 21.Velasco E, Thuler LC, Martins CA, Dias LM, Goncalves VM. Nosocomial infections in an oncology intensive care unit. Am J Infect Control. 1997;25:458–462. doi: 10.1016/S0196-6553(97)90067-5. [DOI] [PubMed] [Google Scholar]
  • 22.Basu S, Andrews J, Kishore S, Panjabi R, Stuckler D. Comparative performance of private and public healthcare systems in low- and middle-income countries: a systematic review. PLoS Med. 2012;9:e1001244. doi: 10.1371/journal.pmed.1001244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ramirez-Wong FM, Atencio-Espinoza T, Rosenthal VD, Ramirez E, Torres-Zegarra SL, Díaz Tavera ZR, Sarmiento López F, Silva Astete N, Campos Guevara F, Bazan Mendoza C, et al. Surgical site infections rates in more than 13,000 surgical procedures in three cities in Peru: findings of the international nosocomial infection control consortium. Surg Infect. 2015;16:572–576. doi: 10.1089/sur.2014.201. [DOI] [PubMed] [Google Scholar]
  • 24.Arrieta A, Suárez G, Hakim G. Assessment of patient safety culture in private and public hospitals in Peru. Int J Qual Health Care. 2018;30:186–191. doi: 10.1093/intqhc/mzx165. [DOI] [PubMed] [Google Scholar]
  • 25.Payne SC, Bergman ME, Beus JM, Rodríguez JM, Henning JB. Safety climate: leading or lagging indicator of safety outcomes? J Loss Prev Process Ind. 2009;22:735–739. doi: 10.1016/j.jlp.2009.07.017. [DOI] [Google Scholar]
  • 26.Colla JB, Bracken AC, Kinney LM, Weeks WB. Measuring patient safety climate: a review of surveys. Qual Saf Health Care. 2005;14:364–366. doi: 10.1136/qshc.2005.014217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Blegen MA, Gearhart S, O'Brien R, Sehgal NL, Alldredge BK. AHRQ's hospital survey on patient safety culture: psychometric analyses. J Patient Saf. 2009;5:139–144. doi: 10.1097/PTS.0b013e3181b53f6e. [DOI] [PubMed] [Google Scholar]
  • 28.Nieva VF, Sorra J. Safety culture assessment: A tool for improving patient safety in healthcare organizations. Qual Saf Health Care. 2003;12:ii17–ii23. doi: 10.1136/qhc.12.suppl_2.ii17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sorra J, Dyer N. Multilevel psychometric properties of the AHRQ hospital survey on patient safety culture. BMC Health Serv Res. 2010;10:199. doi: 10.1186/1472-6963-10-199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Chen IC, Li H-H. Measuring patient safety culture in Taiwan using the hospital survey on patient safety culture (HSOPSC) BMC Health Serv Res. 2010;10:152. doi: 10.1186/1472-6963-10-152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.El-Jardali F, Jaafar M, Dimassi H, Jamal D, Hamdan R. The current state of patient safety culture in Lebanese hospitals: a study at baseline. Int J Qual Health Care. 2010;22:386–395. doi: 10.1093/intqhc/mzq047. [DOI] [PubMed] [Google Scholar]
  • 32.Pfeiffer Y, Manser T. Development of the German version of the hospital survey on patient safety culture: dimensionality and psychometric properties. Saf Sci. 2010;48:1452–1462. doi: 10.1016/j.ssci.2010.07.002. [DOI] [Google Scholar]
  • 33.Sunol R, Vallejo P, Groene O, Escaramis G, Thompson A, Kutryba B, Garel P. Implementation of patient safety strategies in European hospitals. Qual Saf Health Care. 2009;18:i57–i61. doi: 10.1136/qshc.2008.029413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Güneş ÜY, Gürlek Ö, Sönmez M. A survey of the patient safety culture of hospital nurses in Turkey. Collegian. 2016;23:225–232. doi: 10.1016/j.colegn.2015.02.005. [DOI] [Google Scholar]
  • 35.Raeissi P, Reisi N, Nasiripour AA. Assessment of patient safety culture in Iranian academic hospitals: strengths and weaknesses. J Patient Saf. 2018;14:213–226. doi: 10.1097/PTS.0000000000000199. [DOI] [PubMed] [Google Scholar]
  • 36.Granel N, M-DJ M, Barth A, Papp K, Bernabeu-Tamayo MD. Patient safety culture in Hungarian hospitals. Int J Health Care Qual Assur. 2019;32:412–424. doi: 10.1108/IJHCQA-02-2018-0048. [DOI] [PubMed] [Google Scholar]
  • 37.Agency for Healthcare Research and Quality . Cuestionario sobre la de seguridad de los pacientes en los hospitales. Rockville: Agency for Healthcare Research and Quality; 2009. [Google Scholar]
  • 38.Fajardo-Dolci G, Rodriguez-Suarez J, Arboleya-Casanova H, Rojano-Fernandez C, Hernandez-Torres F, Santacruz-Varela J. Cultura sobre seguridad del paciente en profesionales de la salud. Cirugía y Cirujanos. 2010;78:522–527. [PubMed] [Google Scholar]
  • 39.Ramírez-Martínez ME, González Pedraza-Avilés A. Cultura de seguridad y eventos adversos en una clínica de primer nivel. Enfermería Universitaria. 2017;14:111–117. doi: 10.1016/j.reu.2017.02.006. [DOI] [Google Scholar]
  • 40.Gómez Ramírez O, Arenas Gutiérrez W, González Vega L, Garzón Salamanca J, Mateus Galeano E, Soto Gámez A. Cultura de seguridad del paciente por personal de enfermeria en Bogata, Colombia. Ciencia y enfermería. 2011;17:97–111. doi: 10.4067/S0717-95532011000300009. [DOI] [Google Scholar]
  • 41.Saturno Hernández PJ, Da Silva Gama ZA, de Oliveira Sousa SL, YA YAF, De Souza Oliveira AC, editors. Análisis de la cultura sobre seguridad del paciente en el ámbito hospitalario del Sistema Nacional de Salud español. Madrid: Ministerio de Sanidad y Consumo; 2007. [DOI] [PubMed] [Google Scholar]
  • 42.Pinheiro MdP, Junior OCdS: Evaluación de la cultura de seguridad del paciente en una organización hospitalaria de un hospital universitario. Enfermería Global 2016, 16:309–324.
  • 43.Agency for Healthcare Research and Quality. Translation guidelines for the surveys on patient safety culture. Rockville; 2010.
  • 44.Flin R, Burns C, Mearns K, Yule S, Robertson EM. Measuring safety climate in health care. Qual Saf Health Care. 2006;15:109–115. doi: 10.1136/qshc.2005.014761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hutchinson A, Cooper KL, Dean JE, McIntosh A, Patterson M, Stride CB, Laurence BE, Smith CM. Use of a safety climate questionnaire in UK health care: factor structure, reliability and usability. Qual Saf Health Care. 2006;15:347–353. doi: 10.1136/qshc.2005.016584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Fujishiro K, Gong F, Baron S, Jacobson CJ, DeLaney S, Flynn M, Eggerth DE. Translating questionnaire items for a multi-lingual worker population: the iterative process of translation and cognitive interviews with English-, Spanish-, and Chinese-speaking workers. Am J Ind Med. 2010;53:194–203. 10.1002/ajim.20733. [DOI] [PubMed]
  • 47.Levin K, Willis GB, Forsyth BH, Norberg A, Kudela MS, Stark D, Thompson FE. Using cognitive interviews to evaluate the Spanish-language translation of a dietary questionnaire. Surv Res Methods. 2009;3:13–25. [Google Scholar]
  • 48.Lopez GI, Figueroa M, Connor SE, Maliski SL. Translation barriers in conducting qualitative research with Spanish speakers. Qual Health Res. 2008;18:1729–1737. doi: 10.1177/1049732308325857. [DOI] [PubMed] [Google Scholar]
  • 49.Usunier J-C. Language as a resource to assess cross-cultural equivalence in quantitative management research. J World Bus. 2011;46:314–319. doi: 10.1016/j.jwb.2010.07.002. [DOI] [Google Scholar]
  • 50.Flaherty JA, Gaviria FM, Pathak D, Mitchell T, Wintrob R, Richman JA, Birz S. Developing instruments for cross-cultural psychiatric research. J Nerv Ment Dis. 1988;176:257–263. [PubMed] [Google Scholar]
  • 51.van de Vijver FJR, Poortinga YH. Towards an integrated analysis of bias in cross-cultural assessment. Eur J Psychol Assess. 1997;13:29–37. doi: 10.1027/1015-5759.13.1.29. [DOI] [Google Scholar]
  • 52.Waltz CF, Strickland OL, Lenz ER, editors. Measurement in nursing and health research. 4. New York: Springer Publishing Company; 2010. [Google Scholar]
  • 53.Knafl K, Deatrick J, Gallo A, Holcombe G, Bakitas M, Dixon J, Grey M. The analysis and interpretation of cognitive interviews for instrument development. Res Nurs Health. 2007;30:224–234. doi: 10.1002/nur.20195. [DOI] [PubMed] [Google Scholar]
  • 54.Thrasher JF, Quah ACK, Dominick G, Borland R, Driezen P, Awang R, Omar M, Hosking W, Sirirassamee B, Boado M. Using cognitive interviewing and behavioral coding to determine measurement equivalence across linguistic and cultural groups: an example from the international tobacco control policy evaluation project. Field Methods. 2011;23:439–460. doi: 10.1177/1525822X11418176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Squires A. Methodological challenges in cross-language qualitative research: a research review. Int J Nurs Stud. 2009;46:277–287. doi: 10.1016/j.ijnurstu.2008.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Maneesriwongul W, Dixon JK. Instrument translation process: a methods review. J Adv Nurs. 2004;48:175–186. doi: 10.1111/j.1365-2648.2004.03185.x. [DOI] [PubMed] [Google Scholar]
  • 57.Willis GB. Cognitive interviewing: a tool for improving questionnaire design. Thousand Oaks: Sage; 2005. [Google Scholar]
  • 58.Erkut S. Developing multiple language versions of instruments for intercultural research. Child Dev Perspect. 2010;4:19–24. doi: 10.1111/j.1750-8606.2009.00111.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Mason TC. Cross-cultural instrument translation: assessment, translation, and statistical applications. Am Ann Deaf. 2005;150:67–72. doi: 10.1353/aad.2005.0020. [DOI] [PubMed] [Google Scholar]
  • 60.Temple B. Nice and tidy: translation and representation. Sociol Res Online. 2005;10:1–10. doi: 10.5153/sro.1058. [DOI] [Google Scholar]
  • 61.Acquadro C, Conway K, Hareendran A, Aaronson N. Literature review of methods to translate health-related quality of life questionnaires for use in multinational clinical trials. Value Health. 2008;11:509–521. doi: 10.1111/j.1524-4733.2007.00292.x. [DOI] [PubMed] [Google Scholar]
  • 62.Wild D, Grove A, Martin M, Eremenco S, McElroy S, Verjee-Lorenz A, Erikson P. Principles of good practice for the translation and cultural adaptation process for patient-reported outcomes (PRO) measures: report of the ISPOR task force for translation and cultural adaptation. Value Health. 2005;8:94–104. doi: 10.1111/j.1524-4733.2005.04054.x. [DOI] [PubMed] [Google Scholar]
  • 63.Squires A, Aiken LH, van den Heede K, Sermeus W, Bruyneel L, Lindqvist R, Schoonhoven L, Stromseng I, Busse R, Brzostek T, et al. A systematic survey instrument translation process for multi-country, comparative health workforce studies. Int J Nurs Stud. 2013;50:264–273. doi: 10.1016/j.ijnurstu.2012.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Beatty PC, Willis GB. Research synthesis: the practice of cognitive interviewing. Public Opin Q. 2007;71:287–311. doi: 10.1093/poq/nfm006. [DOI] [Google Scholar]
  • 65.Weech-Maldonado R, Morales LS, Spritzer K, Elliott M, Hays RD. Racial and ethnic differences in parents' assessments of pediatric care in Medicaid managed care. Health Serv Res. 2001;36:575–594. [PMC free article] [PubMed] [Google Scholar]
  • 66.Garcia AA. Cognitive interviews to test and refine questionnaires. Public Health Nurs. 2011;28:444–450. doi: 10.1111/j.1525-1446.2010.00938.x. [DOI] [PubMed] [Google Scholar]
  • 67.Harkness JA, Van de Vijver FJR, Mohler P. Cross-cultural survey methods. Hoboken: Wiley; 2003. [Google Scholar]
  • 68.Agency for Healthcare Research and Quality. Hospital Survey on Patient Safety Culture: Background and information for translators. Rockville: Agency for Healthcare Research and Quality; 2009.
  • 69.Agency for Healthcare Research and Quality . Pilot study: Validity and reliability of the hospital survey on patient safety. Rockville: Agency for Healthcare Research and Quality; 2004. [Google Scholar]
  • 70.Gjersing L, Caplehorn JR, Clausen T. Cross-cultural adaptation of research instruments: language, setting, time and statistical considerations. BMC Med Res Methodol. 2010;10:13. doi: 10.1186/1471-2288-10-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Sousa VD, Rojjanasrirat W. Translation, adaptation and validation of instruments or scales for use in cross-cultural health care research: a clear and user-friendly guideline. J Eval Clin Pract. 2011;17:268–274. doi: 10.1111/j.1365-2753.2010.01434.x. [DOI] [PubMed] [Google Scholar]
  • 72.Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol. 1993;46:1417–1432. doi: 10.1016/0895-4356(93)90142-N. [DOI] [PubMed] [Google Scholar]
  • 73.Squires A. Language barriers and qualitative nursing research: methodological considerations. Int Nurs Rev. 2008;55:265–273. doi: 10.1111/j.1466-7657.2008.00652.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Grant JS, Davis LL. Selection and use of content experts for instrument development. Res Nurs Health. 1997;20:269–274. doi: 10.1002/(SICI)1098-240X(199706)20:3&#x0003c;269::AID-NUR9&#x0003e;3.0.CO;2-G. [DOI] [PubMed] [Google Scholar]
  • 75.Bracken BA, Barona A. State of the art procedures for translating, validating and using psychoeducational tests in cross-cultural assessment. Sch Psychol Int. 1991;12:119–132. doi: 10.1177/0143034391121010. [DOI] [Google Scholar]
  • 76.Brislin RW. Back-translation for cross-cultural research. J Cross-Cult Psychol. 1970;1:185–216. doi: 10.1177/135910457000100301. [DOI] [Google Scholar]
  • 77.Chapman DW, Carter JF. Translation procedures for the cross cultural use of measurement instruments. Educ Eval Policy Anal. 1979;1:71–76. doi: 10.3102/01623737001003071. [DOI] [Google Scholar]
  • 78.Jones PS, Lee JW, Phillips LR, Zhang XE, Jaceldo KB. An adaptation of Brislin’s translation model for cross-cultural research. Nurs Res. 2001;50:300–304. doi: 10.1097/00006199-200109000-00008. [DOI] [PubMed] [Google Scholar]
  • 79.Chang AM, Chau JPC, Holroyd E. Translation of questionnaires and issues of equivalence. J Adv Nurs. 1999;29:316–322. doi: 10.1046/j.1365-2648.1999.00891.x. [DOI] [PubMed] [Google Scholar]
  • 80.Tourangeau R. Cognitive sciences and survey methods. In: Jabine TB, Straf ML, Tanur JM, Tourangeau R, editors. Cognitive aspects of survey methodology: Building a bridge between disciplines. Washington, DC: National Academy Press; 1984. pp. 73–100. [Google Scholar]
  • 81.Miller K. Conducting cognitive interviews to understand question-response limitations. Am J Health Behav. 2003;27:S264–S272. doi: 10.5993/AJHB.27.1.s3.10. [DOI] [PubMed] [Google Scholar]
  • 82.Polit DF, Beck CT. The content validity index: are you sure you know what's being reported? Critique and recommendations. Res Nurs Health. 2006;29:489–497. doi: 10.1002/nur.20147. [DOI] [PubMed] [Google Scholar]
  • 83.Polit DF, Beck CT, Owen SV. Is the CVI an acceptable indicator of content validity? Appraisal and recommendations. Res Nurs Health. 2007;30:459–467. doi: 10.1002/nur.20199. [DOI] [PubMed] [Google Scholar]
  • 84.Rubio DM, Berg-Weger M, Tebb SS, Lee ES, Rauch S. Objectifying content validity: conducting a content validity study in social work research. Soc Work Res. 2003;27:94–104. doi: 10.1093/swr/27.2.94. [DOI] [Google Scholar]
  • 85.Lawshe CH. A quantitative approach to content validity. Pers Psychol. 1975;28:563–575. doi: 10.1111/j.1744-6570.1975.tb01393.x. [DOI] [Google Scholar]
  • 86.Lynn MR. Determination and quantification of content validity. Nurs Res. 1986;35:382–385. doi: 10.1097/00006199-198611000-00017. [DOI] [PubMed] [Google Scholar]
  • 87.Zamanzadeh V, Rassouli M, Abbaszadeh A, Alavi Majd H, Nikanfar A, Ghahramanian A. Details of content validity and objectifying it in instrument development. Nurs Pract Today. 2015;1:163–171. [Google Scholar]
  • 88.Okochi J, Takahashi T, Takamuku K, Matsuda S, Takagi Y. Reliability of a geriatric assessment instrument with illustrations. Geriatr Gerontol Int. 2005;5:37–47. doi: 10.1111/j.1447-0594.2005.00268.x. [DOI] [Google Scholar]
  • 89.Gorelick MH, Wagner D, McLellan SL. Development and validation of a self-administered questionnaire to measure water exposures in children. Ambul Pediatr. 2008;8:388–391. doi: 10.1016/j.ambp.2008.07.004. [DOI] [PubMed] [Google Scholar]
  • 90.Dufour S, Barkema HW, DesCôteaux L, DeVries TJ, Dohoo IR, Reyher K, Roy J-P, Scholl DT. Development and validation of a bilingual questionnaire for measuring udder health related management practices on dairy farms. Prev Vet Med. 2010;95:74–85. doi: 10.1016/j.prevetmed.2010.02.018. [DOI] [PubMed] [Google Scholar]
  • 91.Cicchetti DV, Sparrow SA. Developing criteria for establishing interrater reliability of specific items: applications to assessment of adaptive behavior. Am J Ment Defic. 1981;86:127–137. [PubMed] [Google Scholar]
  • 92.Fleiss JL. Measuring nominal scale agreement among many raters. Psychol Bull. 1971;76:378–382. doi: 10.1037/h0031619. [DOI] [Google Scholar]
  • 93.Landis JR, Koch GG. An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics. 1977;33:363–374. doi: 10.2307/2529786. [DOI] [PubMed] [Google Scholar]
  • 94.DeVillis RF. Scale development: theory and application. Thousand Oaks: Sage Publications; 2003.
  • 95.Napoles-Springer AM, Santoyo-Olsson J, O'Brien H, Stewart AL. Using cognitive interviews to develop surveys in diverse populations. Med Care. 2006;44:S21–S30. doi: 10.1097/01.mlr.0000245425.65905.1d. [DOI] [PubMed] [Google Scholar]
  • 96.Banville D, Desrosiers P, Genet-Volet Y. Translating questionnaires and inventories using a cross-cultural translation technique. J Teach Phys Educ. 2000;19:374–387. doi: 10.1123/jtpe.19.3.374. [DOI] [Google Scholar]
  • 97.Messick S. The psychology of acquiescence: an interpretation of research evidence. ETS Res Bull Ser. 1966;1966:1–44. [Google Scholar]
  • 98.Ray JJ. Reviving the problem of acquiescent response bias. J Soc Psychol. 1983;121:81–96. doi: 10.1080/00224545.1983.9924470. [DOI] [Google Scholar]
  • 99.Lavrakas PJ, editor. Encyclopedia of survey research methods. Thousand Oaks: SAGE Publications; 2008.
  • 100.Solís-Salazar M. The dilemma of combining positive and negative items in scales. Psicothema. 2015;27:192–199. doi: 10.7334/psicothema2014.266. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12912_2020_419_MOESM1_ESM.docx (27.9KB, docx)

Additional file 1 Table S1. Item Comparison Across Instruments: Original English and Spanish Versions

Data Availability Statement

The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.


Articles from BMC Nursing are provided here courtesy of BMC

RESOURCES