Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

Alexandre Bellier; Philippe Chaffanjon; Edward Krupat; Patrice Francois; José Labarère

doi:10.1371/journal.pone.0230672

. 2020 Apr 16;15(4):e0230672. doi: 10.1371/journal.pone.0230672

Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

Alexandre Bellier ^1,^2,^3,^4,^*, Philippe Chaffanjon ², Edward Krupat ⁵, Patrice Francois ¹, José Labarère ^1,^2,^3,⁴

Editor: Gian Mauro Manzoni⁶

PMCID: PMC7161987 PMID: 32298278

Abstract

Background

The Four Habits Coding Scheme (4-HCS) is a standardized instrument designed to assess physicians’ communication skills from an external rater’s perspective, based on video-recorded consultations.

Objective

To perform the cross-cultural adaptation of the 4-HCS into French and to assess its psychometric properties.

Methods

The 4-HCS was cross-culturally adapted by conducting forward and backward translations with independent translators, following international guidelines. Four raters rated 200 video-recorded medical student consultations with standardized patients, using the French version of the 4-HCS. We examined the internal consistency, factor structure, construct validity, and reliability of the 4-HCS.

Results

The mean overall 4-HCS score was 76.44 (standard deviation, 12.34), with no floor or ceiling effects across subscales. The median rating duration of rating was 8 min (range, 4–19). Cronbach’s alpha was 0.94 for the overall 4-HCS, ranging from 0.72 to 0.88 across subscales. In confirmatory factor analysis, goodness-of-fit statistics did not corroborate the hypothesized 4-habit structure. Exploratory factor analysis resulted in two dimensions, with the merging of three conceptually related habits into a single dimension and substantial cross-loading for 15 out of 23 items. Median average absolute-agreement intra-class correlation coefficient estimates were 0.74 (range, 0.68–0.84) and 0.85 (range, 0.76–0.91) for inter- and intra-rater reliability of habit subscales, respectively.

Conclusion

The French version of the 4-HCS demonstrates satisfactory internal consistency but requires the use of two independent raters to achieve acceptable reliability. The underlying factor structure of the original US version and cross-cultural adaptations of the 4-HCS deserve further investigation.

Introduction

Physician communication skills are key components of effective medical consultations [1] and comprise core physician competences that are most desired by patients [2]. Evidence has accumulated, supporting the conclusion that high-quality communication relates with enhanced patient satisfaction [3], greater adherence to treatment [4], better health outcomes [5], and decreased risk of malpractice claims [6]. Many organizations have therefore implemented structured training programs and routinely assessed physicians’ communication skills [3, 7].

The Four Habits Coding Scheme (4-HCS) is a standardized instrument designed to assess 23 physician communication skills or behaviors from an external rater’s perspective, based on video-recorded consultations [8]. The 4-HCS is based on the conceptual framework of the “Four Habits Model,” a training program that was developed within the US Kaiser Permanente Health Maintenance Organization and implemented for teaching effective communication skills to thousands of clinicians in this organization over the two last decades [9]. The Four Habits Model refers to basic medical interview tasks that are organized within four dimensions for didactic purposes, namely, Invest in the beginning (six items), Elicit the patient’s perspective (three items), Demonstrate empathy (four items), and Invest in the end (ten items) [9].

The original US version of the 4-HCS demonstrated acceptable inter-rater reliability and evidence for construct validity despite moderate internal consistency, with Cronbach’s alpha coefficients ranging from 0.51 to 0.81 across the Four Habits, over 100 video-recorded physician–patient encounters [8]. Since its original development, the 4-HCS has been utilized outside the Kaiser Permanente system [9–13]. Cross-cultural adaptations of the 4-HCS have been published in different languages: Norwegian [14], German [15], and Brazilian Portuguese [16]. Given the potential of the 4-HCS for assessing baseline communication skills and measuring the effectiveness of a training program aiming to alter communication skills [9], there is a need for a French version that can be used with medical students during the 4-year competency-based communication curriculum. To our knowledge, only three studies examined the psychometric properties of the 4-HCS and none has investigated the underlying factor structure [8, 14, 15]. Although the findings of previous studies were promising, the authors recommended examining the validity and reliability of the 4-HCS further in different settings and populations [8].

In the present study, we aimed to perform the cross-cultural adaptation of the 4-HCS into French and to assess the psychometric properties of the adapted version, using the original data of video-recorded medical student consultations with standardized patients.

Materials and methods

Study design

A two-step procedure was used. Firstly, the 4-HCS was translated and cross-culturally adapted into French. Secondly, the psychometric properties of the French version were studied, with regard to internal consistency, validity, and reliability. The present study was conducted in the Grenoble Alpes University Hospital and the University Grenoble Alpes School of Medicine, France.

Cross-cultural adaptation of the 4-HCS into French

The translation and cross-cultural adaptation of the source version of the 4-HCS and its codebook were performed by Mapi Language Services, according to published guidelines [17]. Mapi Language Services is an international company with expertise in the field of translation and cross-cultural adaptation of patient reported outcome measures (www.mapigroup.com/services/language-services/).

The aim of the cross-cultural adaptation process was to obtain a French translation that was conceptually equivalent to the US source version, culturally relevant to the French context, and easily understood by the people who would use the instrument. For this purpose, we used a rigorous methodology involving input from the 4-HCS developer on conceptual issues and a centralized review process coordinated by a consultant with experience in the field. This process included a common understanding of the 4-HCS concepts by all participants involved in the project, quality control by translators, and discussion about translation decisions at each step.

Practically speaking, the 4-HCS developer (EK) was contacted to obtain permission to use and translate the instrument and to invite him to participate in the project. Two qualified native French-speaking translators independently translated the source version of the 4-HCS into French. A single version was obtained after a reconciliation meeting of the two translators. Then this version was back-translated into English by a third qualified native English speaker, who was blinded to the original US version. The back translation was reviewed for semantic and operational equivalence against the source version of the 4-HCS. We followed a universalist approach for equivalence, assessing conceptual, item, semantic, operational, measurement and functional equivalence [18]. After the resulting version was pretested by two raters using 63 video-recorded medical student consultations with standardized patients, minor adjustments were made to obtain the final version. The principal investigator checked the proofs of the final version and corrected any errors.

4-HCS scoring

In accordance with the source version of the 4-HCS, each item was rated on a 5-point Likert scale, ranging from 1 to 5, with higher scores denoting better performance. The midpoint (i.e., 3) and the two endpoints (i.e., 1 and 5) were anchored, with specific behavioral descriptions [8]. The raters were encouraged to use the midpoint and endpoint categories, with other categories (i.e., 2 and 4) to be used only if they thought that communication skills fell between the anchored points [8]. This approach ensured full use of the 5-point Likert scale [8].

An overall communication skill score was computed by summing ratings for the individual items, ranging from 23 (i.e., less effective) to 115 (i.e., more effective). Four subscale scores were also computed, each of them corresponding to a key dimension of communication skills (i.e., Invest in the beginning [range, 6–30], Elicit the patient’s perspective [range, 3–15], Demonstrate empathy [range, 4–20], and Invest in the end [range, 10–50]).

Physician–patient relationship competence assessment scale

The physician–patient relationship competence assessment scale was developed in French and validated in Canada [19]. This scale consists in 15 items exploring two dimensions of interpersonal skills, namely “understanding of the patient's disease experience” (eight items) and “efficient and respectful communication” (seven items). Each item could be rated on a 4-point Likert scale, ranging from 15 (i.e., less effective) to 60 (i.e., more effective).

Study sample and data collection

The study sample consisted of video-recorded medical student consultations with standardized patients. All 218 medical students were invited to participate as part of the 4-year competency-based communication curriculum. They were allocated in consecutive alphabetical order based on their surname to the 1^st or 2^nd semester sessions that took place in October 2017 and April 2018, respectively.

Standardized patients were 20 actors recruited at the Department of Performing Arts in Grenoble Alpes University. Seven standardized medical consultation scenarios were developed and combined with nine personality types or character traits. The actors were instructed on each medical consultation scenario by two clinicians during 2-h sessions. Then they were trained by their improvisation instructor on each personality type or character trait. The combinations of medical consultation scenarios with personality type or character trait were pretested as part of a pilot study.

All medical consultations with standardized patients were video-recorded using professional video equipment. Video-recording of consultations made it possible to overcome some of the challenges of direct observation [20]. Video-recording accurately recorded all events that occurred during consultations, allowing raters to verify their observations as many times as necessary [20]. Video-recordings could be rated by different observers without consultations being disrupted [21]. Finally, video-recording allowed for providing medical students with feedback on their own performance [20, 22].

Four raters were recruited for the project, including a full professor of medicine (PC) and a resident in medicine (AB), both with experience in teaching communication skills, and two medical students. They underwent a training session, which consisted in independently coding five video-recorded consultations with the 4-HCS and subsequently discussing these ratings. After completing the training, the raters independently rated consultations, with each consultation being rated by at least three different raters. Additionally, two of the raters evaluated each video-recorded medical student consultation twice, with the two ratings 2 months apart, in order to quantify intra-rater reliability.

Data for both the cross-cultural adaptation of the 4-HCS and the physician–patient relationship competence assessment scale were captured using an electronic case report form. The completion of each item was mandatory, so there were no missing values.

Sample size

A sample size of 200 video-recorded consultations was required for confirmatory factor analysis, based on previous simulations [23]. Assuming a Cronbach alpha coefficient point estimate close to 0.80, this sample size would provide a precision of ±0.07 (i.e., 95% confidence interval [CI] ranging from 0.73 to 0.87).

Assuming an intra-rater correlation coefficient point estimate of 0.80, we estimated that a sample of 117 video-recorded consultations would achieve 80% power to demonstrate that it would be higher than 0.70, with a 0.05 two-sided significance level [24]. Assuming an inter-rater correlation coefficient point estimate of 0.80 with four raters, we estimated that a sample of 68 video-recorded consultations would achieve 80% power to demonstrate that it would be higher than 0.70, with a 0.05 two-sided significance level [24].

Statistical analysis

Descriptive statistics

To account for inter-rater variability, we computed the students’ average 4-HCS overall and subscale rating scores [25]. The 4-HCS overall and subscale scores were reported as means along with standard deviations. The numbers and percentages of responses on anchor points for items and overall and subscale scores were examined to detect floor or ceiling effects. Floor and ceiling effects lower than 15% for subscale scores were considered acceptable [26].

Internal consistency

Internal consistency was evaluated through average inter-item correlation, item-rest correlation (i.e., the correlation between an item and the score that was formed by all other items in the subscale), and Cronbach’s alpha coefficient [27]. The internal consistency criterion was fulfilled for item-rest correlation >0.40, and Cronbach’s alpha >0.70 was considered satisfactory [26].

Internal structure

The internal structure of the cross-cultural adaptation of the 4-HCS into French was verified using confirmatory and exploratory factor analysis, following current guidelines [27]. First, structural equation modeling of the four predefined habits was carried out with the 23 items assigned to the intended habits to determine whether the video-recorded medical consultation data fit with the internal structure of the original US version of the 4-HCS. This structural equation modeling corresponded to an external model representing the relationships between latent variables (i.e., the four predefined habits) and the manifest variables (i.e., the 23 related items). Various goodness-of-fit statistics were obtained, including the comparative fit index (CFI) [28], the standardized root mean squared residual (SRMR), and the root mean square error of approximation (RMSEA) along with its 90% CI [29]. A CFI value of 0.90 or higher and a SRMR value lower than 0.08 were considered indicative of satisfactory model fit. A 90% CI lower bound for the RMSEA estimate lower than 0.05 would not reject the hypothesis that the fit was close. An upper bound higher than 0.10 would not reject the hypothesis that the fit was poor.

Second, exploratory factor analysis was performed in order to examine possible alternative structures to the original US version of the 4-HCS. An orthogonal rotation method (Varimax) of factors with eigenvalues higher than 1.00 was used, assuming that they were independent [30]. Primary loadings on intended dimensions higher than 0.40 with cross-loadings lower than 0.30 were considered satisfactory.

Construct validity

We assessed construct validity by comparing 4-HCS overall and subscale score values between first and second-semester video-recorded medical consultations. We hypothesized that mean scores were higher for medical consultations recorded during the second semester. Indeed, second semester students were assumed to be more experienced in basic medical interview tasks and more sensitized to communication skills than their counterparts evaluated during the first semester. We also examined convergent validity between the 4-HCS and the physician–patient relationship competence assessment scale, using Pearson correlation coefficients.

Reliability

Inter- and intra-rater reliability assessment complied with the Guidelines for Reporting Reliability and Agreement Studies (GRAAS) [31]. The reliability of the French version of the 4-HCS overall and subscale scores was quantified by the intra-class correlation coefficient (ICC) [32]. The ICC is suitable for reliability studies with unbalanced designs involving different sets of raters [33]. Both absolute- and consistency-of-agreement ICCs were computed. Under the absolute-agreement approach, the ratings were considered in absolute agreement if the 4-HCS scores from all raters matched exactly [32]. Under the consistency-of-agreement approach, the ratings were considered consistent if the 4-HCS scores from any two raters differed by the same constant value for all video-recorded consultations. This implied that raters gave the same ranking to all video-recorded consultations [34]. Individual and average ICCs were estimated, with average ICCs computed over two raters. Although the agreement measured between individual ratings is more common, the use of average ICCs was indicated in this study because the 4-HCS was intended to be used by teams of raters for assessing video-recorded medical consultations. An ICC value equal to or higher than 0.70 was indicative of satisfactory reliability [14].

The study protocol was approved by the Comité d’Ethique du Centre d’Investigation Clinique de Clermont-Ferrand, France (IRB 5891). All participants received information from the principal investigator about the study’s overall purpose and the confidentiality requirements and they then provided written informed consent.

Results

Mapi Language Services translated the 4-HCS scale in January 2018 and issued a translation validation certificate on February 19, 2018. They scrupulously complied with the protocol drawn up according to international standards, producing a cross-cultural adaptation of the 4-HCS scale in line with expectations. The French version of the 4-HCS is shown in S1 Appendix. The full version included a translation of the “codebook,” a detailed scoring guide for each item with examples of behaviors and suggested ratings.

Of 218 eligible medical students, 200 (92%) participated in the study. A total of 200 consultations with standardized patients were video-recorded, including 115 and 85 during the first and second semesters, respectively (Fig 1). The median duration was 8 min (range, 4–19 min) per video-recorded consultation. Inter-rater reliability assessment involved 800 ratings while 400 ratings contributed to intra-rater reliability assessment (Fig 1).

The mean 4-HCS score was 76.44 (standard deviation, 12.34) for 200 video-recorded medical consultations, with no floor or ceiling effects observed for subscales (Table 1). Yet the highest (5/5) and lowest values (1/5) accrued more than 15% of the respondents for three items and one item, respectively. Mean 4-HCS scores ranged from 66.97 (SD, 10.26) based on 200 video-recorded consultations for rater 1 (i.e., the least experienced rater) to 93.42 (SD 13.46) based on 85 video-recorded consultations for rater 4 (i.e., the most experienced rater).

Table 1. Summary statistics and internal consistency for the cross-cultural adaptation of the 4 Habit Coding Scheme into French (n = 200).

Habit [range]–Item	Mean score (SD)		Ceiling effect, n (%)		Floor effect, n (%)		Average inter-item correlation	Item-total correlation	Item-rest correlation^*	Cronbach Alpha^†
1. Invest in the beginning [6–30]	18.77	(2.80)	0	(0)	0	(0)	0.48	-	-	0.80
1. Show familiarity	3.09	(0.18)	0	(0)	0	(0)	0.43	0.55	0.50	0.81
2. Greet warmly	3.18	(0.27)	0	(0)	0	(0)	0.48	0.64	0.57	0.79
3. Engage in small talk	2.66	(0.89)	6	(3.0)	4	(2.0)	0.46	0.79	0.62	0.76
4. Question style	3.25	(0.74)	2	(1.0)	0	(0)	0.52	0.84	0.72	0.73
5. Expansion of concerns	3.51	(0.70)	0	(0)	4	(2.0)	0.50	0.82	0.70	0.73
6. Elicit full agenda	3.07	(0.81)	3	(1.5)	1	(0.5)	0.47	0.77	0.60	0.76
2. Elicit the patient’s perspective [3–15]	10.01	(2.06)	0	(0)	0	(0)	0.48	-	-	0.72
7. Patient’s understanding	3.70	(0.78)	0	(0)	4	(2.0)	0.48	0.79	0.55	0.62
8. Goals for visit	2.81	(0.79)	6	(3.0)	0	(0)	0.51	0.82	0.60	0.57
9. Impact on life	3.49	(0.98)	2	(1.0)	14	(7.0)	0.43	0.81	0.49	0.72
3. Demonstrate empathy [4–20]	13.75	(2.95)	0	(0)	3	(1.5)	0.52	-	-	0.87
10. Encourage emotional expression	3.33	(0.90)	1	(0.5)	9	(4.5)	0.61	0.90	0.84	0.81
11. Accept feelings	3.63	(0.73)	1	(0.5)	4	(2.0)	0.61	0.87	0.81	0.82
12. Identify feelings	2.60	(1.04)	26	(13.0)	23	(1.5)	0.55	0.84	0.73	0.84
13. Show good nonverbal behavior	4.19	(0.71)	0	(0)	42	(20.5)	0.53	0.76	0.66	0.84
4. Invest in the end [10–50]	33.91	(6.13)	0	(0)	0	(0)	0.46	-	-	0.88
14. Use patient’s frame of reference	3.60	(0.81)	0	(0)	15	(7.5)	0.45	0.70	0.56	0.86
15. Allow time to absorb	4.37	(0.56)	0	(0)	33	(16.5)	0.36	0.55	0.43	0.88
16. Give clear explanation	4.24	(0.76)	0	(0)	63	(31.5)	0.12	0.29	0.15	0.90
17. Offer rationale for tests	3.56	(0.95)	8	(4.0)	10	(5.0)	0.45	0.73	0.63	0.86
18. Test for comprehension	3.33	(0.96)	8	(4.0)	4	(2.0)	0.59	0.89	0.84	0.84
19. Involve in decision	3.01	(0.83)	7	(3.5)	2	(1.0)	0.57	0.86	0.81	0.84
20. Explore plan acceptability	3.21	(0.98)	6	(3.0)	3	(1.5)	0.58	0.87	0.82	0.84
21. Explore barriers	2.02	(0.84)	47	(23.5)	0	(0)	0.46	0.73	0.64	0.86
22. Encourage questions	3.56	(0.93)	1	(0.5)	25	(12.5)	0.44	0.72	0.61	0.86
23. Plan for follow-up	3.00	(0.95)	8	(4.0)	1	(0.5)	0.43	0.70	0.58	0.87
Overall [23–115]	76.44	(12.34)					0.42	-	-	0.94

Open in a new tab

Abbreviations: SD, standard deviation.

* Item-rest correlation was computed as the correlation between an item and the composite score that was formed by all other items in the habit.

† Item Cronbach alpha was computed for composite score that was formed by all other items in the habit.

Cronbach’s alpha was 0.94 (95% CI, 0.93–0.95) for the 4-HCS, ranging from 0.72 to 0.88 across habit subscales (Table 1). All but one item fulfilled the internal consistency criterion, with item-rest correlations higher than 0.40. The exception was item 16 with the item-rest correlation as low as 0.15. Removing this item from the “Invest in the end” habit subscale improved Cronbach’s alpha from 0.88 to 0.90.

In confirmatory factor analysis, structural equation modeling of 23 items apportioned in four latent factors yielded CFI (0.79) and SRMR (0.09) estimates that did not achieve recommended thresholds (CFI >0.90 and SRMR <0.08, respectively). The RMSEA estimate was 0.12 (90% CI, 0.10–0.13), with the 90% CI lower bound not rejecting the hypothesis that the fit was close and the 90% CI upper bound not rejecting the hypothesis that the fit was poor.

Exploratory factor analysis of the 23 items identified four principal components with eigenvalues higher than 1.0 and explaining 66.8% of overall variance (Table 2). Graphical assessment of the scree plot suggested that the instrument was close to unidimensionality (S2 Appendix), supporting the use of an overall 4-HCS score. Yet, the factorial structure for the French version of the 4-HCS departed from the hypothesized four-dimension structure. Twenty-one out of 23 items had primary factor loadings over 0.40 while 15 items yielded cross-loadings over 0.30. Exploratory factor analysis of the French version of the 4-HCS resulted in the merging of three habits (namely, Invest in the beginning [all six items], Elicit the patient’s perspective [two out of three items], and Demonstrate empathy [all four items]) in a single dimension.

Table 2. Exploratory factor analysis for the 23 items of the cross-cultural adaptation of the 4-Habit Coding Scheme into French after orthogonal Varimax rotation (n = 200)^*.

Habit–Item	Factor 1	Factor 2	Factor 3	Factor 4
1. Invest in the beginning
1. Show familiarity	.41			.40
2. Greet warmly	.40	.34		.33
3. Engage in small talk	.58
4. Question style	.58	.49	.32
5. Expansion of concerns	.74	.33
6. Elicit full agenda	.64	.32
2. Elicit the patient’s perspective
7. Patient’s understanding	.53	.61
8. Goals for visit	.48	.45	.47	.35
9. Impact on life	.58
3. Demonstrate empathy
10. Encourage emotional expression	.91
11. Accept feelings	.74	0.35
12. Identify feelings	.86
13. Show good nonverbal behavior	.54	.37
4. Invest in the end
14. Use patient’s frame of reference	.33	.73
15. Allow time to absorb	.36
16. Give clear explanation
17. Offer rationale for tests		.71
18. Test for comprehension	.31	.84	.42
19. Involve in decision	.39	.76	.67
20. Explore plan acceptability		.78	.63
21. Explore barriers		.61	.46
22. Encourage questions		.69
23. Plan for follow-up		.56	.47
Overall variance explained, %	46.5	8.8	6.7	4.8

Open in a new tab

* Values are item loadings ≥.30

As hypothesized, all mean habit scores were significantly higher for medical student consultations recorded at the second semester (Table 3). Most pairwise Pearson correlation coefficients between habit and physician–patient relationship competence assessment scale scores were higher than 0.70, indicating satisfactory convergent validity (Table 4).

Table 3. Comparison of 4-Habit Coding Scheme scores for medical student consultations recorded during first and second semesters.

Habit [range]	1^st semester (n = 115)		2^nd semester (n = 85)		P
1. Invest in the beginning [6–30]	17.54	(2.56)	19.83	(2.56)	< .001
2. Elicit the patient’s perspective [3–15]	8.88	(1.93)	10.98	(1.63)	< .001
3. Demonstrate empathy [4–20]	13.02	(3.34)	14.36	(2.43)	< .001
4. Invest in the end [10–50]	31.84	(5.95)	31.84	(35.70)	< .001
Overall [23–115]	71.28	(12.09)	80.87	(10.77)	< .001

Open in a new tab

* Values are mean (standard deviation)

Table 4. Correlation of the cross-cultural adaptation of the 4-Habit Coding Scheme into French and the physician–patient relationship competence assessment scale (n = 200)^*.

	Physician–patient relationship competence assessment scale
Habit [range]	1. Understanding of the patient’s experience	2. Communication, consultation management	Overall
1. Invest in the beginning	.85	.61	.81
2. Elicit the patient’s perspective	.92	.47	.79
3. Demonstrate empathy	.77	.64	.76
4. Invest in the end	.70	.79	.78
Overall	.90	.74	.90

Open in a new tab

* Values are Pearson correlation coefficients. All P-values were < .001.

None of the four habits fulfilled the 0.70 inter-rater reliability criterion, with individual absolute-agreement ICC point estimates ranging from 0.42 to 0.64 (Table 5). Interestingly, the average absolute-agreement ICC for the 4-HCS and three out of four habits fulfilled the inter-rater reliability criterion of 0.70.

Table 5. Absolute-agreement intra-class correlation coefficient estimates for inter- and intra-rater reliability for the cross-cultural adaptation of the 4-Habit Coding Scheme into French (n = 200).

	Inter-rater ICC (95% CI)				Intra-rater ICC (95% CI)
Habit	Individual		Average		Individual		Average
1. Invest in the beginning	.45	(.36 to .53)	.71	(.63 to .77)	.59	(.52 to .65)	.85	(.81 to .88)
2. Elicit the patient’s perspective	.42	(.33 to .60)	.68	(.60 to .75)	.45	(.37 to .52)	.76	(.71 to .81)
3. Demonstrate empathy	.53	(.45 to .60)	.77	(.71 to .82)	.58	(.51 to .64)	.85	(.81 to .88)
4. Invest in the end	.64	(.58 to .71)	.84	(.80 to .88)	.71	(.66 to .76)	.91	(.89 to .93)
Overall	.60	(.53 to .67)	.82	(.77 to .86)	.72	(.67 to .77)	.91	(.89 to .93)

Open in a new tab

Abbreviations: CI, confidence interval; ICC, intra-class correlation.

The individual absolute-agreement ICC was 0.72 for intra-rater reliability of the 4-HCS score, with point estimates ranging from 0.45 to 0.71 across habits. The average absolute-agreement ICC for the 4-HCS and all habits fulfilled the 0.70 intra-rater reliability criterion.

After restricting our analytical sample to the most experienced raters (raters #1 and #2), the individual absolute-agreement ICC was 0.83 for inter-rater reliability and 0.89 for intra-rater reliability.

Discussion

The use of validated standardized instruments is advocated for assessing physicians’ interpersonal skills [8]. Indeed, validated instruments are likely to accurately reflect the concept to be measured while standardized instruments allow large-scale comparisons of physicians’ communication skills across studies [30, 35]. The 4-HCS was therefore developed and validated from over 1,025 video-recorded medical consultations across various settings in the US and Western Europe (Table 6). Yet, our study was the first to examine the underlying factor structure of the 4-HCS and to report on its cross-cultural adaptation into French.

Table 6. Primary studies reporting on the development or cross-cultural adaptation of the 4-Habit Coding Scheme.

Author, year	Krupat, 2006	Fossli Jensen, 2010	Clayton, 2011	Scholl, 2014	Present study
Country	USA	Norway	USA	Germany	France
Setting	Hospital	Hospital	Family practice clinics	Primary and specialty consultations	School of medicine
Recruitment period	1994	2007–2008	-	2009–2010	2017–2018
Experience	Resident and senior staff	Resident and senior staff	Resident and senior staff	Senior staff	Medical students
No. physicians	50	71	21	22	200
Simulated consultations	No	No	No	No	Yes
Recording	Video-recorded	Video-recorded	Video-recorded	Audio-taped	Video-recorded
No. consultations	100	497	174	54	200
Mean score
1. Invest in the beginning	17.7	-	24.1	12.1	18.8
2. Elicit the patient’s perspective	7.6	-	11.5	4.5	10.0
3. Demonstrate empathy	11.3	-	14.5	-	13.7
4. Invest in the end	31.5	-	33.0	26.5	33.9
Overall	68.0	60.1	83.1	-	76.4
Exploratory factor analysis	Not performed	Not performed	Not performed	Not performed	4 principal components (66.8% of overall variance)
Cronbach’s alpha
1. Invest in the beginning	.71	-	-	.41	.80
2. Elicit the patient’s perspective	.51	-	-	.46	.72
3. Demonstrate empathy	.81	-	-	.38	.87
4. Invest in the end	.61	-	-	.31	.88
Overall	-	.85	-	-	.94
Inter-rater reliability^*
1. Invest in the beginning	.70	-	.48	.83	.45
2. Elicit the patient’s perspective	.80	-	.57	.79	.42
3. Demonstrate empathy	.71	-	.39	.85	.53
4. Invest in the end	.69	-	.65	.78	.64
Overall	.72	.78	.72	-	.60
Intra-rater reliability^*
1. Invest in the beginning	-	-	-	.87	.59
Author, year	Krupat, 2006	Fossli Jensen, 2010	Clayton, 2011	Scholl, 2014	Present study
2. Elicit the patient’s perspective	-	-	-	.72	.45
3. Demonstrate empathy	-	-	-	.84	.58
4. Invest in the end	-	-	-	.83	.71
Overall	-	-	-	-	.72

Open in a new tab

* Inter-rater reliability was quantified by the Pearson correlation coefficient in the studies by Krupat et al. and Clayton et al., and by the intra-class correlation coefficient in the study by Fossli Jensen et al.

‡ Inter- and intra-rater reliability scores were quantified by computing absolute agreement intra-class correlation coefficient in the study by Scholl et al. In the present study, individual absolute agreement intra-class correlation coefficient was used for assessing inter- and intra-rater reliability.

Each of the 200 video-recorded medical consultations in this study was rated without missing data by three out of four different raters, reflecting the acceptability and feasibility of the French version of the 4-HCS. Although the consultations were simulated with standardized patients and involved 4-year medical students in this study, the mean overall and subscale 4-HCS scores were consistent with previous reports (Table 6).

Surprisingly, the French version of the 4-HCS yields better performance regarding internal consistency than the original US instrument and previous cross-cultural adaptations. Cronbach’s alpha was higher than 0.70 across habit subscales (median 0.83; range, 0.72–0.88) and compared favorably with those reported in the original US development (median, 0.66; range, 0.51–0.81) and German cross-cultural adaptation (median, 0.39; range, 0.31–0.46) (Table 6). This finding contrasts with previous studies that usually report worse performance for cross-cultural adaptations in comparison with original standardized instruments [30]. Krupat et al. were not concerned by the moderate internal consistency of the original version of the 4-HCS and speculated that successful training in communication would result in more effective communication and therefore greater internal consistency [8]. Another potential explanation for this discrepancy may be that the conceptual framework of the 4-HCS lacks generalizability or robustness and therefore does not apply equally to all to target populations’ experience. Noticeably, the French version of the 4-HCS was used for rating medical student consultations with standardized patients while the original US version was used for rating resident and senior staff medical consultations (Table 6).

To our knowledge, no previous study examined the factor structure of the 4-HCS. In confirmatory factor analysis, goodness-of-fit statistics did not support the hypothesized 4-dimension structure for the French version of the 4-HCS. Exploratory factor analysis resulted in two dimensions, with the merging of three conceptually related habits (Invest in the beginning, Elicit the patient’s perspective, Demonstrate empathy) in a single dimension. Additionally, substantial cross-loading was observed for 15 out of 23 items, suggesting that the underlying factor structure of the French version of the 4-HCS was questionable.

These findings do not necessarily invalidate the postulated structure of the 4-HCS. Indeed, factor analysis can only discriminate uncorrelated constructs in a data set [36]. That three habits were correlated with each other in the present study sample does not imply that these scales measure the same concept [36]. The medical students participating in the current study were not trained with the Four Habit Model before their communication skills were assessed using the 4-HCS. This might explain why the factor structure for the French version departed from the postulated four-dimension structure of the 4-HCS. Yet, we cannot exclude that this issue is inherent to the original 4-HCS rather than being specific to our study sample. No previous study (including the original development study) examined the factor structure of the 4-HCS. Hence, there is a need for further investigation of the underlying factor structure of the original US version of the 4-HCS.

One item (16. Give clear explanation) was not allocated to any empirical dimension in exploratory factorial analysis and also yielded the lowest item-rest correlation, deteriorating the internal consistency of the corresponding habit scale. Altogether, these two observations question the relevance of this item, and therefore its removal from the French version of the 4-HCS should be discussed.

Evidence for convergent validity was provided by the expected correlation between the 4-HCS and physician–patient relationship competence assessment scale scores. Our observation that medical students more experienced in basic interview tasks yielded higher scores for all four habits supported the construct validity of the 4-HCS.

Comparisons of inter- and intra-reliability estimates were confounded by between-study heterogeneity in the types of correlation coefficients used. Only two studies, including the present one, used absolute-agreement ICC (Table 6). ICC is a recommended alternative to Pearson’s r coefficient correlation for assessing inter- or intra-reliability [14, 31]. The median absolute-agreement ICC for inter-rater (0.49, range, 0.42–0.64) and intra-rater (0.58, range, 0.45–0.71) reliability for the French version of the 4-HCS were lower than those reported in the German cross-cultural adaptation study (0.81, range, 0.78–0.85 and 0.83, range, 0.72–0.87, respectively). Lower inter-rater reliability might be explained by the use of simulated consultations with standardized patients, the use of video- rather than audio-taped consultations, or varying levels of rater experience in our study. We observed that the greater the level of rater experience, the higher the 4-HCS scores in our study. Inconsistent associations have been reported between experience or seniority and communication scores in objective structured clinical examinations [37]. After restricting our analytical sample to the most experienced raters (i.e., raters #1 and #2), ICC estimates were higher than 0.80 for inter-rater reliability.

Interestingly, substantial improvement in inter-rater (median, 0.74; range, 0.68–0.84) and intra-rater (0.85, range, 0.76–0.91) reliability of habit scores was achieved using average absolute-agreement ICC estimates (Table 5). This latter finding supports the need for using two independent raters to rate communication skills with the 4-HCS based on video-recorded medical consultations, in routine practice.

This study has potential implications for routine assessment of physician communication skills using the cross-cultural adaptation of the 4-HCS into French. First, this study provides evidence on the validity of the 4-HCS scale for simulated consultations with standardized patients. Second, the French version of the 4-HCS scale demonstrated internal consistency, which was even higher than the original US version, allowing international comparisons. Third, our study questioned the hypothesized underlying factor structure of the 4-HCS. Since the 4-HCS was originally developed in the US, the conceptual framework and factor structure may lack generalizability or robustness and therefore may not apply equally well to other countries. Investigating the factor structure of the original US version is required to address this issue. Fourth, the moderate reliability of the French version of the 4-HCS implies that communication skills should be assessed by two independent experienced raters.

This cross-cultural adaptation study has a few caveats that must be considered. First, real patient encounters would have been preferable to standardized patient encounters for assessing communication skills, because of their authenticity [38]. Indeed, simulated consultations with standardized patients differ from real patient consultations in many ways [39]. Simulated patients are not suffering from illness and only attempt to portray the same through their acting. Moreover, recruiting and training standardized patients is time consuming in order to produce a high-quality simulation [40]. Although our study did not explore real patient encounters, standardized patients allowed us to provide a large number of students with reproducible and consistent clinical scenarios of the same level of difficulty [41].

Second, our study was conducted with 4^th-year medical students at a single university-affiliated hospital and the findings may not apply to other settings. A broader spectrum of participants would strengthen the confidence in the psychometric properties of this cross-cultural adaptation of the 4-HCS into French.

Third, the 4-HCS and physician–patient relationship competence assessment scale were completed by the same rater so that a halo effect cannot be excluded when assessing convergent validity of the two instruments. Demonstrating that 4-HCS scores correlate with (standardized) patient-reported experience of physician communication skills would provide stronger evidence of construct validity.

Fourth, the one-semester interval separating the two groups of students might be too short to assess the relationship between 4-HCS scores and experience in communication and basic interview tasks, although the associations were significant. The ability of the 4-HCS to discriminate subjects with varying levels of communication skills warrants further investigation. The sensitivity of the French version of the 4-HCS to changes in communication skills also remains to be documented by way of longitudinal studies.

Conclusions

The French version of the 4-HCS demonstrates satisfactory internal consistency but moderate reliability, requiring the use of two independent raters to assess communication skills of medical students based on video-recorded consultations with standardized patients. The empirical factor structure of the French version does not conform with the hypothesized habits of the original 4-HCS. Whether this reflects a specific issue with our cross-cultural adaptation study sample or a more general problem with the instrument is unclear and deserves further investigation.

Supporting information

S1 Appendix. Cross-cultural adaptation of the 4-HCS scale into French.

(DOCX)

Click here for additional data file.^{(30.3KB, docx)}

S2 Appendix. Scree plot of the 23 items of the 4-HCS scale.

(TIF)

Click here for additional data file.^{(8.4MB, tif)}

Acknowledgments

The authors are indebted to Pauline Bouchet and Ariane Martinez, Grenoble Alpes University, for their collaboration in setting up simulated medical consultations with standardized patients, Noémie Kaladzé and Alexis Dechosal, Grenoble Alpes University, for rating video-recorded medical consultations. The authors thank Alison Foote, Grenoble Alpes University Hospital, and Linda Northrup for their help in editing the manuscript in English.

Data Availability

The dataset are available in Harvard Dataverse (view at https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IZTLWR).

Funding Statement

This study was supported by unrestricted grants from the MACSF Corporate Foundation (19,951€, AB: https://www.macsf-exerciceprofessionnel.fr/Publications-actions-et-mecenat/Fondation-MACSF The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Ong LML, de Haes JCJM, Hoos AM, Lammes FB. Doctor-patient communication: A review of the literature. Soc Sci Med. 1995;40: 903–918. 10.1016/0277-9536(94)00155-m [DOI] [PubMed] [Google Scholar]
2.Simpson SA, Shaw C, McNamara R. What is the most effective way to maintain weight loss in adults? BMJ. 2011;343: d8042 10.1136/bmj.d8042 [DOI] [PubMed] [Google Scholar]
3.Boissy A, Windover AK, Bokar D, Karafa M, Neuendorf K, Frankel RM, et al. Communication Skills Training for Physicians Improves Patient Satisfaction. J Gen Intern Med. 2016;31: 755–761. 10.1007/s11606-016-3597-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Zolnierek KBH, Dimatteo MR. Physician communication and patient adherence to treatment: a meta-analysis. Med Care. 2009;47: 826–834. 10.1097/MLR.0b013e31819a5acc [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Kelley JM, Kraft-Todd G, Schapira L, Kossowsky J, Riess H. The influence of the patient-clinician relationship on healthcare outcomes: a systematic review and meta-analysis of randomized controlled trials. PloS One. 2014;9: e94207 10.1371/journal.pone.0094207 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Levinson W, Roter DL, Mullooly JP, Dull VT, Frankel RM. Physician-patient communication. The relationship with malpractice claims among primary care physicians and surgeons. JAMA. 1997;277: 553–559. 10.1001/jama.277.7.553 [DOI] [PubMed] [Google Scholar]
7.Brown JB, Boles M, Mullooly JP, Levinson W. Effect of clinician communication skills training on patient satisfaction. A randomized, controlled trial. Ann Intern Med. 1999;131: 822–829. 10.7326/0003-4819-131-11-199912070-00004 [DOI] [PubMed] [Google Scholar]
8.Krupat E, Frankel R, Stein T, Irish J. The Four Habits Coding Scheme: validation of an instrument to assess clinicians’ communication behavior. Patient Educ Couns. 2006;62: 38–45. 10.1016/j.pec.2005.04.015 [DOI] [PubMed] [Google Scholar]
9.Stein T, Frankel RM, Krupat E. Enhancing clinician communication skills in a large healthcare organization: a longitudinal case study. Patient Educ Couns. 2005;58: 4–12. 10.1016/j.pec.2005.01.014 [DOI] [PubMed] [Google Scholar]
10.Fossli Jensen B, Gulbrandsen P, Dahl FA, Krupat E, Frankel RM, Finset A. Effectiveness of a short course in clinical communication skills for hospital doctors: results of a crossover randomized controlled trial (ISRCTN22153332). Patient Educ Couns. 2011;84: 163–169. 10.1016/j.pec.2010.08.028 [DOI] [PubMed] [Google Scholar]
11.Blanch-Hartigan D, Hall JA, Roter DL, Frankel RM. Gender bias in patients’ perceptions of patient-centered behaviors. Patient Educ Couns. 2010;80: 315–320. 10.1016/j.pec.2010.06.014 [DOI] [PubMed] [Google Scholar]
12.Gulbrandsen P, Madsen HB, Benth JS, Laerum E. Health care providers communicate less well with patients with chronic low back pain—a study of encounters at a back pain clinic in Denmark. Pain. 2010;150: 458–461. 10.1016/j.pain.2010.05.024 [DOI] [PubMed] [Google Scholar]
13.Clayton MF, Latimer S, Dunn TW, Haas L. Assessing patient-centered communication in a family practice setting: how do we measure it, and whose opinion matters? Patient Educ Couns. 2011;84: 294–302. 10.1016/j.pec.2011.05.027 [DOI] [PubMed] [Google Scholar]
14.Fossli Jensen B, Gulbrandsen P, Benth JS, Dahl FA, Krupat E, Finset A. Interrater reliability for the Four Habits Coding Scheme as part of a randomized controlled trial. Patient Educ Couns. 2010;80: 405–409. 10.1016/j.pec.2010.06.032 [DOI] [PubMed] [Google Scholar]
15.Scholl I, Nicolai J, Pahlke S, Kriston L, Krupat E, Härter M. The German version of the Four Habits Coding Scheme—association between physicians’ communication and shared decision making skills in the medical encounter. Patient Educ Couns. 2014;94: 224–229. 10.1016/j.pec.2013.10.006 [DOI] [PubMed] [Google Scholar]
16.Catani RR, Valadares EDS, Lacombe JB, Mendonça TM da S, Silva CHM da, Paro HBM da S. Cross-cultural adaptation of the Four Habits Coding Scheme (4HCS) for teaching and assessing patient-centered communication skills in Brazil. Cad Saude Publica. 2018;34: e00013918 10.1590/0102-311X00013918 [DOI] [PubMed] [Google Scholar]
17.Epstein J, Santo RM, Guillemin F. A review of guidelines for cross-cultural adaptation of questionnaires could not bring out a consensus. J Clin Epidemiol. 2015;68: 435–441. 10.1016/j.jclinepi.2014.11.021 [DOI] [PubMed] [Google Scholar]
18.Harachi TW, Choi Y, Abbott RD, Catalano RF, Bliesner SL. Examining Equivalence of Concepts and Measures in Diverse Samples. Prev Sci. 2006;7: 359–368. 10.1007/s11121-006-0039-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Côté L, Savard A, Bertrand R. [Evaluation of the physician-patient relationship competence. Development and validation of an assessment instrument]. Can Fam Physician. 2001;47: 512–518. [PMC free article] [PubMed] [Google Scholar]
20.Asan O, Montague E. Using video-based observation research methods in primary care health encounters to evaluate complex interactions. Inform Prim Care. 2014;21: 161–170. 10.14236/jhi.v21i4.72 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Ram P, Grol R, Rethans JJ, Schouten B, van der Vleuten C, Kester A. Assessment of general practitioners by video observation of communicative and medical performance in daily practice: issues of validity, reliability and feasibility. Med Educ. 1999;33: 447–454. 10.1046/j.1365-2923.1999.00348.x [DOI] [PubMed] [Google Scholar]
22.Nilsen S, Baerheim A. Feedback on video recorded consultations in medical teaching: why students loathe and love it—a focus-group based qualitative study. BMC Med Educ. 2005;5: 28 10.1186/1472-6920-5-28 [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Rouquette A, Falissard B. Sample size requirements for the internal validation of psychiatric scales. Int J Methods Psychiatr Res. 2011;20: 235–249. 10.1002/mpr.352 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Walter SD, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Stat Med. 17: 101–110. [DOI] [PubMed] [Google Scholar]
25.Bland JM, Altman DG. Correlation, regression, and repeated data. BMJ. 1994;308: 896 10.1136/bmj.308.6933.896 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60: 34–42. 10.1016/j.jclinepi.2006.03.012 [DOI] [PubMed] [Google Scholar]
27.Nunnally JC, Bernstein I. Psychometric Theory. 3rd ed. New York: McGraw-Hill; 1994. [Google Scholar]
28.Bentler PM. Comparative fit indexes in structural models. Psychol Bull. 1990;107: 238–246. 10.1037/0033-2909.107.2.238 [DOI] [PubMed] [Google Scholar]
29.Hancock GR, Mueller RO. Structural equation modeling: A second course CT: Information Age. Greenwich; 2006. [Google Scholar]
30.Boussat B, François P, Gandon G, Giai J, Seigneurin A, Perneger T, et al. Inconsistencies Between Two Cross-Cultural Adaptations of the Hospital Survey on Patient Safety Culture Into French. J Patient Saf. 2017. 10.1097/PTS.0000000000000443 [DOI] [PubMed] [Google Scholar]
31.Kottner J, Audigé L, Brorson S, Donner A, Gajewski BJ, Hróbjartsson A, et al. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. J Clin Epidemiol. 2011;64: 96–106. 10.1016/j.jclinepi.2010.03.002 [DOI] [PubMed] [Google Scholar]
32.Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86: 420–428. 10.1037//0033-2909.86.2.420 [DOI] [PubMed] [Google Scholar]
33.Hallgren KA. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial. Tutor Quant Methods Psychol. 2012;8: 23–34. 10.20982/tqmp.08.1.p023 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.McGraw KO, P S. Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996;1: 30–46. 10.1037/1082-989X.1.1.30 [DOI] [Google Scholar]
35.Gjersing L, Caplehorn JRM, Clausen T. Cross-cultural adaptation of research instruments: language, setting, time and statistical considerations. BMC Med Res Methodol. 2010;10: 13 10.1186/1471-2288-10-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Perneger TV, Leplège A, Etter JF. Cross-cultural adaptation of a psychometric instrument: two methods compared. J Clin Epidemiol. 1999;52: 1037–1046. 10.1016/s0895-4356(99)00088-8 [DOI] [PubMed] [Google Scholar]
37.Chong L, Taylor S, Haywood M, Adelstein B-A, Shulruf B. Examiner seniority and experience are associated with bias when scoring communication but not examination skills in Objective Structured Clinical Examinations (OSCEs) in Australia. J Educ Eval Health Prof. 2018. 10.3352/jeehp.2018.15.17 [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Bokken L, Rethans J-J, Jöbsis Q, Duvivier R, Scherpbier A, van der Vleuten C. Instructiveness of real patients and simulated patients in undergraduate medical education: a randomized experiment. Acad Med J Assoc Am Med Coll. 2010;85: 148–154. 10.1097/ACM.0b013e3181c48130 [DOI] [PubMed] [Google Scholar]
39.Bokken L, Rethans J-J, Scherpbier AJJA, van der Vleuten CPM. Strengths and weaknesses of simulated and real patients in the teaching of skills to medical students: a review. Simul Healthc J Soc Simul Healthc. 2008;3: 161–169. 10.1097/SIH.0b013e318182fc56 [DOI] [PubMed] [Google Scholar]
40.Barrows HS. An overview of the uses of standardized patients for teaching and evaluating clinical skills. AAMC. Acad Med J Assoc Am Med Coll. 1993;68: 443–453. 10.1097/00001888-199306000-00002 [DOI] [PubMed] [Google Scholar]
41.Collins RMH J. P. AMEE Medical Education Guide No. 13: real patients, simulated patients and simulators in clinical examinations. Med Teach. 1998;20: 508–521. 10.1080/01421599880210 [DOI] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0230672.r001

Decision Letter 0

Gian Mauro Manzoni

22 Oct 2019

PONE-D-19-24879

Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

PLOS ONE

Dear Dr. BELLIER,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Dec 06 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Gian Mauro Manzoni, Ph.D., Psy.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This brief manuscript reports upon a French-language validation of an instrument for assessing physicians’ communication to patients (4-Habits Coding Scheme, 4HCS). The findings and the previous literature are clearly laid out in easily digested formats (e.g. informative tables). I only have three substantive concerns.

1. Unlike previous work with this coding scheme, the authors rightly use ICC to assess interrater reliability. However, it is problematic that those ICC reliability scores are so low. The low reliability implies that you may not be accurately measuring physician communication. Can you estimate the amount of noise in your ratings? That is, how much of the variance in the four habits might be due to inconsistent measurement? Also, I would like to hear more about the differences between raters, including greater contextualization of the ratings in the literature.

2. Actual patient interactions differ from enacted consultations in many important ways. The examination of simulated interactions in the current study is not necessarily a problem, but it is distinct. Thus, the research report should more fully explore how such an application of 4HCS contrasts with a genuine doctor-patient meeting.

3. Kaiser developed the four habits to guide physicians’ interactions with patients. Then, researchers assessed their communication with the 4HCS, studying the extent to which they employed those target strategies. How similar is that original context in which the 4HCS was developed to the medical school where your data were collected? To what extent does the curriculum include the four habits? I ask this question in light of the factor analysis finding only two, rather than the four, dimensions of the 4HCS. One might not hypothesize finding all four habits in a population where physician training differs substantially from the original setting.

4. Some minor suggestions. Line 59 “supporting that” is awkward. Try “supporting the conclusion that” or “suggesting that.” Lines 214-5: double negative maybe unavoidable here, but you could add a line to increase clarity. Same suggestion for line 290. Lines 219-220: Are these thresholds common? Cite someone to support their appropriateness.

Best of luck on your manuscript.

Reviewer #2: Thank you for allowing me to review your article. I find the manuscript overall in decent shape. The objectives and aims of your study are clear. The study is itself useful. I did have several issues and questions I summarize below that led me to recommend that you revise and resubmit.

1. Provide evidence for the cultural (in)variance of the 4-HCS. The tool has been translated into several languages. Can you briefly summarize the psychometric findings of these instruments (e.g., what were CFA results in these studies)? In yet other words, would you expect, on the basis of these past studies, to reproduce the factor structure that was generated with the US American samples?

2. What *is* the potential of the 4-HCS?

3. There is arguably a problematic issue with the 4-HCS: it does not help physicians learn communication skills, which are behaviors. The tool merely assesses whether trained observers can identify four stages of the physician-patient conversation. HOW these stages are enacted is neither coded, nor assessed; nor does the tool assess how "well* the physician, say, demonstrates empathy; that is what does "good" nonverbal behavior mean? Indeed, the generality and broadness of the scale is what makes it useful. I would certainly not use it to train communication competency.

4. What was the length of the standardized patient conversations?

5. I don't quite understand why you conducted the CFA first? It seems to me that we usually run a structural model first before we run a measurement model? It seems to me that if you are predicting four factors, you want to use CFA procedures. You may not need the EFA. But perhaps I am not quite understanding your analyses. I admit, I had a bit of a hard time following your data write-up since it's all not quite in APA style.

6. I am missing a discussion of the practical implications. So what about cross-cultural differences in US and French samples for the measures? What does that mean for physicians and patients and communication skills.

7. In all, the manuscript could probably profit from a really strong edit.

Thank you for letting me review this manuscript. I'd be delighted to review it once, should the editor extend a revise and resubmit.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2020 Apr 16;15(4):e0230672. doi: 10.1371/journal.pone.0230672.r002

Author response to Decision Letter 0

5 Dec 2019

Reviewer #1

This brief manuscript reports upon a French-language validation of an instrument for assessing physicians’ communication to patients (4-Habits Coding Scheme, 4HCS). The findings and the previous literature are clearly laid out in easily digested formats (e.g. informative tables). I only have three substantive concerns.

We are grateful to this reviewer for his/her kind assessment of our study.

We agree with this reviewer that ICC estimates were disappointingly low for interrater reliability in our study. This observation might reflect heterogeneity in the level of clinical experience across raters. Consistent with Krupat et al., we recruited senior physicians as well as medical students for rating interpersonal skills using the 4-HCS. A significant trend towards higher scores was found as the level of clinical experience increased, with mean overall scores ranging from 66.97 (SD 10.26) to 93.42 (SD 13.46) out of 115 for rater #4 (a medical student) and rater #1 (a senior physician), respectively. Noticeably, this trend was the opposite of that reported in a previous study (see Chong et al. J Educ Eval Health Prof 2018.). After restricting our analytical sample to the most experienced raters (raters #1 and #2), ICC estimates were higher than 0.80 for inter-rater reliability.

To address this reviewer’s comment, we incorporated this new information in the Results and Discussion sections of the revised manuscript (page 20, line 340 and page 24, line 427).

The Reviewer is right that actual physician–patient interactions differ from consultations with standardized patients in many ways (Cleland et al. 2009).

As suggested, we elaborated on this point in the Discussion section of the revised manuscript (page 22, line 371).

This reviewer is right that the 4 Habits Coding Scheme (4-HCS) is based on the conceptual framework of the Four Habits Model, a training program that was developed for teaching effective communication to clinicians. We acknowledge that the medical students participating in our study were not trained with the Four Habits Model before their communication skills were assessed using the 4-HCS. This might explain why the factor structure for the French version departed from the hypothesized 4-dimension structure of the 4-HCS. However, it should be underlined that no previous study (including the original development study of this scale) has examined the factor structure of the 4-HCS study. A paragraph addressing this issue was added to the Discussion section of the revised manuscript (page 23, line 400).

We have made the proposed corrections (page 4, line 59 and page 11, line 216).

5. Lines 219-220: Are these thresholds common? Cite someone to support their appropriateness.

The following reference supporting the threshold used for Varimax rotation was incorporated into the list of citations of the revised manuscript (page 11, line 221).

Reviewer #2

Thank you for allowing me to review your article. I find the manuscript overall in decent shape. The objectives and aims of your study are clear. The study is itself useful. I did have several issues and questions I summarize below that led me to recommend that you revise and resubmit.

We are grateful to this Reviewer for his/her kind assessment of our study.

Since the publication of the original US version, the 4-HCS has been translated into different languages, including Norwegian, German, and Brazilian Portuguese, and used across various settings. To our knowledge, only three studies (including the US development study) reported on the psychometric properties of the 4-HCS and none examined the underlying factor structure. The 4-HCS developers recommended further examining its validity and reliability in different settings and populations (Krupat E, et al. Patient Educ Couns 2006)

To address this reviewer’s concern, we summarized the psychometric properties for the original US version and the cross-cultural adaptations of the 4-HCS in Table 6. We also emphasized that our study was the first to examine the factor structure of the 4-HCS (page 5, line 83 and page 23, line 405).

2. What *is* the potential of the 4-HCS?

Because the 4-HCS scale is a standardized, validated, and reliable instrument, it has the potential for assessing baseline physician communication skills and/or measuring the effectiveness of a training program in altering physician communication skills. Importantly, the 4-HCS overcomes the challenges of unstructured assessment of physician–patient interaction during medical consultations, which is a complex task because of less tangible aspects of communication, often referred to as interpersonal skills (e.g., expression of humanistic attitudes such as understanding, supportiveness, and empathy). As suggested, we clarified this expression in the rationale of the revised manuscript (Page 4, line 80).

Actually, the 4-HCS scale is an assessment tool but not a training program by itself. The 4-HCS scale provides physicians with feedback on their own performance regarding basic medical interview tasks but does not help acquire communication skills. A training program remains to be developed and implemented for teaching effective communication skills.

This reviewer might have misunderstood that each item of the 4-HCS is rated on a 5-point Likert scale, ranging from 1 to 5, with higher scores denoting better performance. In the original US version, the midpoint (i.e., 3) and the two endpoints (i.e., 1 and 5) were anchored, with specific behavioral descriptions. To address this reviewer’s concern, we reported the translation of specific behavioral descriptions into French, in Appendix #1.

4. What was the length of the standardized patient conversations?

The median duration was 8 min (range, 4–19 min) per video-recorded consultation. This information was provided in the Results section of the original version of our manuscript (page 13, line 264).

According to current guidelines addressing the cross-cultural adaptation of standardized instruments (Nunnaly and Bernstein. 1994), we first performed structural equation modeling of the 23 items assigned to the four intended habits to determine whether our data fit with the hypothesized internal structure of the original US version of the 4-HCS. Then we performed exploratory factor analysis in order to examine possible alternative structures to the original US version. The following reference supporting our CFA and EFA strategy was incorporated into the list of citations of the revised manuscript (page 10, line 205 and page 11, line 221).

Our study has potential implications for routine assessment of physician communication skills using the cross-cultural adaptation of the 4-HCS into French. First, our study provides evidence on the validity of 4-HCS scale for assessing communication skills based on simulated consultations with standardized patients. Second, the French version of the 4-HCS scale demonstrated internal consistency, which was even higher than the original US version, allowing large-scale international comparisons of study results. Third, our study questioned the hypothesized underlying factor structure of the 4-HCS. Because the 4-HCS was originally developed in the US, the conceptual framework and factor structure may lack generalizability or robustness and therefore may not apply equally well to other countries. Investigating the factor structure of the original US version is required to address this issue. Fourth, the moderate reliability of the French version of the 4-HCS implies that communication skills should be assessed by two independent experienced raters. As requested, a paragraph was added summarizing the practical implications of our study in the Discussion section of the revised manuscript (page 25, line 438).

7. In all, the manuscript could probably profit from a really strong edit.

As suggested, the revised manuscript has been edited by a native English speaker.

PLoS One. doi: 10.1371/journal.pone.0230672.r003

Decision Letter 1

Gian Mauro Manzoni

16 Jan 2020

PONE-D-19-24879R1

Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

PLOS ONE

Dear Dr. BELLIER,

We would appreciate receiving your revised manuscript by Mar 01 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Gian Mauro Manzoni, Ph.D., Psy.D.

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #1: Well done!

For my comment 2 (Actual patient interactions differ from enacted consultations) on the previous submission, the revision doesn’t address my concern, and I’m afraid that my comment was unclear. Let me explain what I meant. As an expert in interpersonal communication, I can say that scholars who study health communication will likely be interested in your research. It would help you reach these readers if you gave a brief assessment of your choice of methods in this study of physicians’ communication skills. In particular, I’m suggesting that the authors clearly differentiate their work from a study with normal patients. There are clear advantages of using standardized patients, but there are also limitations, which remain unmentioned in the revision. So, I’m requesting a sentence or two to justify the choice to study enacted as opposed to genuine interactions. Perhaps something along the lines of “Although the study did not explore actual patient-doctor interaction, standardized patients allowed us to...”

Reviewer #2: Admittedly, I had several critical questions in my initial review of the manuscript. In my view, the authors have done a *most* effective job in addressing not only my concerns but the other reviewer's concerns as well. I am wholly satisfied with these changes.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

PLoS One. 2020 Apr 16;15(4):e0230672. doi: 10.1371/journal.pone.0230672.r004

Author response to Decision Letter 1

27 Jan 2020

Reviewer #1

We apologize for our failure in addressing this Reviewer’s comment adequately in the first revision of our manuscript. We agree with this reviewer that real patient encounters differ from consultations with standardized patients in many ways. As suggested, we have incorporated a paragraph addressing this important limitation of our study in the discussion of the revised manuscript (pages 25-26, lines 455-465). The revised section reads as follows:

“First, real patient encounters would have been preferable to standardized patient encounters for assessing communication skills, because of their authenticity [38]. Indeed, simulated consultations with standardized patients differ from real patient consultations in many ways [39]. Simulated patients are not suffering from illness and only attempt to portray the same through their acting. Moreover, recruiting and training standardized patients is time consuming in order to produce a high-quality simulation [40]. Although our study did not explore real patient encounters, standardized patients allowed us to provide a large number of students with reproducible and consistent clinical scenarios of the same level of difficulty [41].”

Reviewer #2

Admittedly, I had several critical questions in my initial review of the manuscript. In my view, the authors have done a *most* effective job in addressing not only my concerns but the other reviewer's concerns as well. I am wholly satisfied with these changes.

We are grateful to this Reviewer for his/her kind assessment of our manuscript.

PLoS One. doi: 10.1371/journal.pone.0230672.r005

Decision Letter 2

Gian Mauro Manzoni

6 Mar 2020

Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

PONE-D-19-24879R2

Dear Dr. BELLIER,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Gian Mauro Manzoni, Ph.D., Psy.D.

Academic Editor

PLOS ONE

PLoS One. doi: 10.1371/journal.pone.0230672.r006

Acceptance letter

Gian Mauro Manzoni

25 Mar 2020

PONE-D-19-24879R2

Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

Dear Dr. Bellier:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Gian Mauro Manzoni

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix. Cross-cultural adaptation of the 4-HCS scale into French.

(DOCX)

Click here for additional data file.^{(30.3KB, docx)}

S2 Appendix. Scree plot of the 23 items of the 4-HCS scale.

(TIF)

Click here for additional data file.^{(8.4MB, tif)}

Data Availability Statement

The dataset are available in Harvard Dataverse (view at https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/IZTLWR).

[pone.0230672.ref001] 1.Ong LML, de Haes JCJM, Hoos AM, Lammes FB. Doctor-patient communication: A review of the literature. Soc Sci Med. 1995;40: 903–918. 10.1016/0277-9536(94)00155-m [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref002] 2.Simpson SA, Shaw C, McNamara R. What is the most effective way to maintain weight loss in adults? BMJ. 2011;343: d8042 10.1136/bmj.d8042 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref003] 3.Boissy A, Windover AK, Bokar D, Karafa M, Neuendorf K, Frankel RM, et al. Communication Skills Training for Physicians Improves Patient Satisfaction. J Gen Intern Med. 2016;31: 755–761. 10.1007/s11606-016-3597-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref004] 4.Zolnierek KBH, Dimatteo MR. Physician communication and patient adherence to treatment: a meta-analysis. Med Care. 2009;47: 826–834. 10.1097/MLR.0b013e31819a5acc [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref005] 5.Kelley JM, Kraft-Todd G, Schapira L, Kossowsky J, Riess H. The influence of the patient-clinician relationship on healthcare outcomes: a systematic review and meta-analysis of randomized controlled trials. PloS One. 2014;9: e94207 10.1371/journal.pone.0094207 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref006] 6.Levinson W, Roter DL, Mullooly JP, Dull VT, Frankel RM. Physician-patient communication. The relationship with malpractice claims among primary care physicians and surgeons. JAMA. 1997;277: 553–559. 10.1001/jama.277.7.553 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref007] 7.Brown JB, Boles M, Mullooly JP, Levinson W. Effect of clinician communication skills training on patient satisfaction. A randomized, controlled trial. Ann Intern Med. 1999;131: 822–829. 10.7326/0003-4819-131-11-199912070-00004 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref008] 8.Krupat E, Frankel R, Stein T, Irish J. The Four Habits Coding Scheme: validation of an instrument to assess clinicians’ communication behavior. Patient Educ Couns. 2006;62: 38–45. 10.1016/j.pec.2005.04.015 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref009] 9.Stein T, Frankel RM, Krupat E. Enhancing clinician communication skills in a large healthcare organization: a longitudinal case study. Patient Educ Couns. 2005;58: 4–12. 10.1016/j.pec.2005.01.014 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref010] 10.Fossli Jensen B, Gulbrandsen P, Dahl FA, Krupat E, Frankel RM, Finset A. Effectiveness of a short course in clinical communication skills for hospital doctors: results of a crossover randomized controlled trial (ISRCTN22153332). Patient Educ Couns. 2011;84: 163–169. 10.1016/j.pec.2010.08.028 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref011] 11.Blanch-Hartigan D, Hall JA, Roter DL, Frankel RM. Gender bias in patients’ perceptions of patient-centered behaviors. Patient Educ Couns. 2010;80: 315–320. 10.1016/j.pec.2010.06.014 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref012] 12.Gulbrandsen P, Madsen HB, Benth JS, Laerum E. Health care providers communicate less well with patients with chronic low back pain—a study of encounters at a back pain clinic in Denmark. Pain. 2010;150: 458–461. 10.1016/j.pain.2010.05.024 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref013] 13.Clayton MF, Latimer S, Dunn TW, Haas L. Assessing patient-centered communication in a family practice setting: how do we measure it, and whose opinion matters? Patient Educ Couns. 2011;84: 294–302. 10.1016/j.pec.2011.05.027 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref014] 14.Fossli Jensen B, Gulbrandsen P, Benth JS, Dahl FA, Krupat E, Finset A. Interrater reliability for the Four Habits Coding Scheme as part of a randomized controlled trial. Patient Educ Couns. 2010;80: 405–409. 10.1016/j.pec.2010.06.032 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref015] 15.Scholl I, Nicolai J, Pahlke S, Kriston L, Krupat E, Härter M. The German version of the Four Habits Coding Scheme—association between physicians’ communication and shared decision making skills in the medical encounter. Patient Educ Couns. 2014;94: 224–229. 10.1016/j.pec.2013.10.006 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref016] 16.Catani RR, Valadares EDS, Lacombe JB, Mendonça TM da S, Silva CHM da, Paro HBM da S. Cross-cultural adaptation of the Four Habits Coding Scheme (4HCS) for teaching and assessing patient-centered communication skills in Brazil. Cad Saude Publica. 2018;34: e00013918 10.1590/0102-311X00013918 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref017] 17.Epstein J, Santo RM, Guillemin F. A review of guidelines for cross-cultural adaptation of questionnaires could not bring out a consensus. J Clin Epidemiol. 2015;68: 435–441. 10.1016/j.jclinepi.2014.11.021 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref018] 18.Harachi TW, Choi Y, Abbott RD, Catalano RF, Bliesner SL. Examining Equivalence of Concepts and Measures in Diverse Samples. Prev Sci. 2006;7: 359–368. 10.1007/s11121-006-0039-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref019] 19.Côté L, Savard A, Bertrand R. [Evaluation of the physician-patient relationship competence. Development and validation of an assessment instrument]. Can Fam Physician. 2001;47: 512–518. [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref020] 20.Asan O, Montague E. Using video-based observation research methods in primary care health encounters to evaluate complex interactions. Inform Prim Care. 2014;21: 161–170. 10.14236/jhi.v21i4.72 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref021] 21.Ram P, Grol R, Rethans JJ, Schouten B, van der Vleuten C, Kester A. Assessment of general practitioners by video observation of communicative and medical performance in daily practice: issues of validity, reliability and feasibility. Med Educ. 1999;33: 447–454. 10.1046/j.1365-2923.1999.00348.x [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref022] 22.Nilsen S, Baerheim A. Feedback on video recorded consultations in medical teaching: why students loathe and love it—a focus-group based qualitative study. BMC Med Educ. 2005;5: 28 10.1186/1472-6920-5-28 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref023] 23.Rouquette A, Falissard B. Sample size requirements for the internal validation of psychiatric scales. Int J Methods Psychiatr Res. 2011;20: 235–249. 10.1002/mpr.352 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref024] 24.Walter SD, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Stat Med. 17: 101–110. [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref025] 25.Bland JM, Altman DG. Correlation, regression, and repeated data. BMJ. 1994;308: 896 10.1136/bmj.308.6933.896 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref026] 26.Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60: 34–42. 10.1016/j.jclinepi.2006.03.012 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref027] 27.Nunnally JC, Bernstein I. Psychometric Theory. 3rd ed. New York: McGraw-Hill; 1994. [Google Scholar]

[pone.0230672.ref028] 28.Bentler PM. Comparative fit indexes in structural models. Psychol Bull. 1990;107: 238–246. 10.1037/0033-2909.107.2.238 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref029] 29.Hancock GR, Mueller RO. Structural equation modeling: A second course CT: Information Age. Greenwich; 2006. [Google Scholar]

[pone.0230672.ref030] 30.Boussat B, François P, Gandon G, Giai J, Seigneurin A, Perneger T, et al. Inconsistencies Between Two Cross-Cultural Adaptations of the Hospital Survey on Patient Safety Culture Into French. J Patient Saf. 2017. 10.1097/PTS.0000000000000443 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref031] 31.Kottner J, Audigé L, Brorson S, Donner A, Gajewski BJ, Hróbjartsson A, et al. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. J Clin Epidemiol. 2011;64: 96–106. 10.1016/j.jclinepi.2010.03.002 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref032] 32.Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86: 420–428. 10.1037//0033-2909.86.2.420 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref033] 33.Hallgren KA. Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial. Tutor Quant Methods Psychol. 2012;8: 23–34. 10.20982/tqmp.08.1.p023 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref034] 34.McGraw KO, P S. Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996;1: 30–46. 10.1037/1082-989X.1.1.30 [DOI] [Google Scholar]

[pone.0230672.ref035] 35.Gjersing L, Caplehorn JRM, Clausen T. Cross-cultural adaptation of research instruments: language, setting, time and statistical considerations. BMC Med Res Methodol. 2010;10: 13 10.1186/1471-2288-10-13 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref036] 36.Perneger TV, Leplège A, Etter JF. Cross-cultural adaptation of a psychometric instrument: two methods compared. J Clin Epidemiol. 1999;52: 1037–1046. 10.1016/s0895-4356(99)00088-8 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref037] 37.Chong L, Taylor S, Haywood M, Adelstein B-A, Shulruf B. Examiner seniority and experience are associated with bias when scoring communication but not examination skills in Objective Structured Clinical Examinations (OSCEs) in Australia. J Educ Eval Health Prof. 2018. 10.3352/jeehp.2018.15.17 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0230672.ref038] 38.Bokken L, Rethans J-J, Jöbsis Q, Duvivier R, Scherpbier A, van der Vleuten C. Instructiveness of real patients and simulated patients in undergraduate medical education: a randomized experiment. Acad Med J Assoc Am Med Coll. 2010;85: 148–154. 10.1097/ACM.0b013e3181c48130 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref039] 39.Bokken L, Rethans J-J, Scherpbier AJJA, van der Vleuten CPM. Strengths and weaknesses of simulated and real patients in the teaching of skills to medical students: a review. Simul Healthc J Soc Simul Healthc. 2008;3: 161–169. 10.1097/SIH.0b013e318182fc56 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref040] 40.Barrows HS. An overview of the uses of standardized patients for teaching and evaluating clinical skills. AAMC. Acad Med J Assoc Am Med Coll. 1993;68: 443–453. 10.1097/00001888-199306000-00002 [DOI] [PubMed] [Google Scholar]

[pone.0230672.ref041] 41.Collins RMH J. P. AMEE Medical Education Guide No. 13: real patients, simulated patients and simulators in clinical examinations. Med Teach. 1998;20: 508–521. 10.1080/01421599880210 [DOI] [Google Scholar]

PERMALINK

Cross-cultural adaptation of the 4-Habits Coding Scheme into French to assess physician communication skills

Alexandre Bellier

Philippe Chaffanjon

Edward Krupat

Patrice Francois

José Labarère

Roles

Abstract

Background

Objective

Methods

Results

Conclusion

Introduction

Materials and methods

Study design

Cross-cultural adaptation of the 4-HCS into French

4-HCS scoring

Physician–patient relationship competence assessment scale

Study sample and data collection

Sample size

Statistical analysis

Descriptive statistics

Internal consistency

Internal structure

Construct validity

Reliability

Results

Fig 1. Flow of medical students and raters throughout the study.

Table 1. Summary statistics and internal consistency for the cross-cultural adaptation of the 4 Habit Coding Scheme into French (n = 200).

Table 2. Exploratory factor analysis for the 23 items of the cross-cultural adaptation of the 4-Habit Coding Scheme into French after orthogonal Varimax rotation (n = 200)*.

Table 3. Comparison of 4-Habit Coding Scheme scores for medical student consultations recorded during first and second semesters.

Table 4. Correlation of the cross-cultural adaptation of the 4-Habit Coding Scheme into French and the physician–patient relationship competence assessment scale (n = 200)*.

Table 5. Absolute-agreement intra-class correlation coefficient estimates for inter- and intra-rater reliability for the cross-cultural adaptation of the 4-Habit Coding Scheme into French (n = 200).

Discussion

Table 6. Primary studies reporting on the development or cross-cultural adaptation of the 4-Habit Coding Scheme.

Conclusions

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Gian Mauro Manzoni

Roles

Author response to Decision Letter 0

Decision Letter 1

Gian Mauro Manzoni

Roles

Author response to Decision Letter 1

Decision Letter 2

Gian Mauro Manzoni

Roles

Acceptance letter

Gian Mauro Manzoni

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 2. Exploratory factor analysis for the 23 items of the cross-cultural adaptation of the 4-Habit Coding Scheme into French after orthogonal Varimax rotation (n = 200)^*.

Table 4. Correlation of the cross-cultural adaptation of the 4-Habit Coding Scheme into French and the physician–patient relationship competence assessment scale (n = 200)^*.