. 2017 Aug 1;22(4):872–903. doi: 10.1111/bjhp.12260

Table 4.

Qualities, category, and number of studies qualities were reported in

Group of quality	Quality	Category	Number of studies reported in	Fidelity studies	Engagement studies
Psychometric qualities
Use of multiple researchers	Coding	R	11	^{20,26,27,29,33,34,45,51,58,64}	⁴⁷
	Data collection		3	^6,29,31
	Develop measures		3	^14,26,60
	Data analysis		2	^10,42
	Data entry		1	²⁶
	Validate coding frame		1	²⁶
Validity of measures	Validated	V	9	^{21,22,34,48,51}	^4,17,25,51
Validity of measures	Not validated	V	8	^{2,10,34,35,41,42,50}	¹³
Use of independent researchers	Used – coding	R	12	^{20,22,26,27,29,34,38,45,51,55,63,64}
	Not used – coding		1	⁵⁸
	Used – develop measures		1	¹⁴
	Used – analysis		1	⁴²
	Not used	V	1	²⁰
Measurement of conditions	All conditions (result output)	V	8	^7,50	^{4,13,17,18,51,53}
	All conditions (reported)	V	5	^2,48,51	^2,3,35
	Intervention only		3	^2,24	^24,25
Reliability of measures	Reliable	R	6	^21,22,48	^4,17,51
Reliability of measures	Not reliable	R	5	^{2,14,23,34,50}	^2,23
Random selection of data	Randomly selected	V	9	^{31,40,51,55,57,58,63,64}	^{52 (data entry)}
Random selection of data	Not randomly selected	V	2	^45,48
Reporting of inter‐rater agreement	Reported – high	R	3	^26,59	¹⁷
	Not reported		2	^29,33
	Reported – poor to fair		2	^27,58
	Reported – fair to excellent		1	⁵⁸
	Reported – no coder drift		1	²⁶
Coding of sessions	A percentage	V	7	^{33,45,51,55,57,58,63}
Coding of sessions	All	V	1	²⁷
Calculated inter‐rater agreement		R	8	^{20,26,27,29,33,58,59}	¹⁷
Use of experts	Coding	V	5	^{10,21,22,36,38}
	Develop measures		1	²⁷
	Not used – coding		1	²⁷
	Checked % of data input	R	1	¹⁰
Blinding	Coders	V	3	^7,26,48
	Not blinded		2	²	⁵²
	Researchers		1		¹⁵
	Participants		1	²
Measurement of content of intervention	Some aspects of intervention	V	3	^20,38	^36,38
Measurement of content of intervention	All aspects of intervention	V	2	^33,63
Problems with scoring criteria	Scoring criteria not sensitive	V	2	^20,26
	No success cut‐off point		1	¹⁴
	Dichotomized responses reduce variability		1		²⁵
	Measures may capture different aspects of fidelity		1	²⁶
Standardization of procedure	Script	V	2	^34,66
	Data entry		1		⁵²
	Coding guidelines		1	⁶⁴
	Not used standardized procedure		1	³³
	Not used standardized measure		1		⁵²
Self‐report bias		V	4	^10,26,26,30
Self‐report bias		R	2	⁵	⁴
Sampling	Across all providers	V	2	^27,45
	Across all sites		1	¹⁰
	Across all sites (purposively)		1	³³
	Across all participants		1	²⁷
	Balanced facilitator and gender (purposively)		1	²⁶
Audit	Data collection	R	1	⁶
	Data analysis		1		⁶
	Coding		1	²⁰	²⁰
	Data entry	V	1	²³
	Recordings	V	1	⁴⁰
Missing responses	Missing responses	V	1		¹⁵
Trained researchers	Trained coders	V	3	^7,27,58
Trained researchers	Trained researcher (data collection)	V	1		⁵²
Observation effects		V	4	^22,26,27,34
Use of one researcher	Coding	R	1	³⁸
Use of one researcher	Trained observers	R	1	³⁴
Revised coding guidelines		R	3	^20,26,48
Revised coding guidelines		V	1	³³
Team meetings		R	4	^1,6,23,36	²³
Recording of sessions	All sessions	V	2	^40,55
Recording of sessions	% of sessions	V	1	³⁵
Triangulation	Method	V	2	^34,42
Triangulation	Researcher	V	1	⁴²
Problems with analysis plan	Did not control for provider	V	1	³⁶
Problems with analysis plan	Missing responses excluded	V	1	¹⁰
Social desirability		V	3	²²	^13,52
Objective verification		V	2		^15,43
Objective verification		R	1		¹²
Used coding guidelines		R	2	^20,27
Analysis consideration – coded missing responses as no adherence		V	1		¹⁵
Independently validated coding frame		V	1	²⁶
Measurement differences – observation and self‐report		V	1	²⁶
Measurement period – year after intervention		V	1		²⁵
Piloted coding guidelines		V	1	²⁶
Practice period before recording		V	1	²⁷
Pre‐specified dates for recordings		V	1	²⁷
Statistician involved in sampling (stratified)		V	1	¹⁰
Training before recording may overestimate adherence		V	1	⁵⁸
Piloted measure		V	1	³⁴
Provided a reason for inter‐rater agreement		R	1	²⁷
Supervision		R	1	⁵⁸
Measures were internally consistent indicating content validity		R+V	1	²⁷
Implementation qualities
Resource challenges	Time restrictions	P	4	^5,20,27,62
	Technical difficulties	P	3	^5,5,58
	Financial restrictions	P	2	^5,27
	Sharing Dictaphones	P	1	⁴⁵
Providers’ attitudes	Dislike paperwork	A	1	¹⁰
	Fear of discouraging participants	A	1	²⁷
	Nerves	A	1	²⁷
	Report participants behaving differently	A	1	²⁷
	Positive attitudes	A	1	⁴²
	Additional work	A	1	⁶²
	Not enthusiastic	A	1	⁶²
Measurement of content of intervention	Telephone calls not assessed due to difficulty	P	1	³⁸
Measurement of content of intervention	Measure cannot capture non‐verbal data	P	1	²⁰
Problems with documentation	No record of responses	P	2	^10,58
	Providers did not document everything		1	¹⁰
	No record of refusals	A+P	1	²⁷
Missing responses	Missing responses	P	1	^{10,10 (different aspects)}
Problems with sampling	Low recruitment	P	1	⁶⁰
Problems with analysis plan	Analysis not feasible	P	1	¹⁰
Incentives	Incentives used	P	2		^15,52
Incentives	Incentives required	P	1	⁶²
Feedback to providers		P	2	^21,27
Feedback delay		P	1	³⁸
Forgetting to return data		P	1		¹⁵
Logbook showed that not all steps were applied		P	1	⁴²
Paper and digital version of measures given		P	1		⁵
Need simpler coding guidelines to achieve agreement		P	1	²⁷
Reviewed fidelity after trial		P	1	⁴⁵
Participants – dislike paperwork		A	1		¹⁵
Did not do a cost analysis		C	1		¹³
Cost of materials		C	1		³⁷
Both psychometric and implementation qualities
Problems with scoring criteria	Lack of clarity on items	V+P	1		²⁵
Missing responses	Missing responses	V+P	1	⁵⁸
Use of one researcher	Data collection	R+P	2	⁵	⁵²
Problems with sampling	Selection bias	V+A	1	²	²
Problems with sampling	Not randomly selected	V+P	1	²⁷

This table is ordered by the number of studies that reported a quality that fits into the ‘group of quality’ column (e.g., ‘use of multiple researchers’). Most frequent → Least frequent. The numbers in this table will not add up to the total number of studies included, as some studies included information on multiple qualities.

R = reliability; V = validity; A = acceptability; P = practicality; C = cost.