Guaiac‐based faecal occult blood tests versus faecal immunochemical tests for colorectal cancer screening in average‐risk individuals

Esmée J Grobbee; Pieter HA Wisse; Eline H Schreuders; Aafke Roon; Leonie Dam; Ann G Zauber; Iris Lansdorp-Vogelaar; Wichor Bramer; Sarah Berhane; Jonathan J Deeks; Ewout W Steyerberg; Monique E Leerdam; Manon CW Spaander; Ernst J Kuipers

doi:10.1002/14651858.CD009276.pub2

. 2022 Jun 6;2022(6):CD009276. doi: 10.1002/14651858.CD009276.pub2

Guaiac‐based faecal occult blood tests versus faecal immunochemical tests for colorectal cancer screening in average‐risk individuals

Esmée J Grobbee ^1,^✉, Pieter HA Wisse ¹, Eline H Schreuders ¹, Aafke Roon ², Leonie Dam ¹, Ann G Zauber ³, Iris Lansdorp-Vogelaar ⁴, Wichor Bramer ⁵, Sarah Berhane ⁶, Jonathan J Deeks ⁶, Ewout W Steyerberg ⁴, Monique E Leerdam ¹, Manon CW Spaander ¹, Ernst J Kuipers ¹

Editor: Cochrane Colorectal Group

PMCID: PMC9169237 PMID: 35665911

Abstract

Background

Worldwide, many countries have adopted colorectal cancer (CRC) screening programmes, often based on faecal occult blood tests (FOBTs). CRC screening aims to detect advanced neoplasia (AN), which is defined as CRC or advanced adenomas. FOBTs fall into two categories based on detection technique and the detected blood component: qualitative guaiac‐based FOBTs (gFOBTs) and faecal immunochemical tests (FITs), which can be qualitative and quantitative. Screening with gFOBTs reduces CRC‐related mortality.

Objectives

To compare the diagnostic test accuracy of gFOBT and FIT screening for detecting advanced colorectal neoplasia in average‐risk individuals.

Search methods

We searched CENTRAL, MEDLINE, Embase, BIOSIS Citation Index, Science Citation Index Expanded, and Google Scholar. We searched the reference lists and PubMed‐related articles of included studies to identify additional studies.

Selection criteria

We included prospective and retrospective studies that provided the number of true positives, false positives, false negatives, and true negatives for gFOBTs, FITs, or both, with colonoscopy as reference standard. We excluded case‐control studies. We included studies in which all participants underwent both index test and reference standard ("reference standard: all"), and studies in which only participants with a positive index test underwent the reference standard while participants with a negative test were followed for at least one year for development of interval carcinomas ("reference standard: positive"). The target population consisted of asymptomatic, average‐risk individuals undergoing CRC screening. The target conditions were CRC and advanced neoplasia (advanced adenomas and CRC combined).

Data collection and analysis

Two review authors independently screened and selected studies for inclusion. In case of disagreement, a third review author made the final decision. We used the Rutter and Gatsonis hierarchical summary receiver operating characteristic model to explore differences between tests and identify potential sources of heterogeneity, and the bivariate hierarchical model to estimate sensitivity and specificity at common thresholds: 10 µg haemoglobin (Hb)/g faeces and 20 µg Hb/g faeces. We performed indirect comparisons of the accuracy of the two tests and direct comparisons when both index tests were evaluated in the same population.

Main results

We ran the initial search on 25 June 2019, which yielded 63 studies for inclusion. We ran a top‐up search on 14 September 2021, which yielded one potentially eligible study, currently awaiting classification.

We included a total of 33 "reference standard: all" published articles involving 104,640 participants. Six studies evaluated only gFOBTs, 23 studies evaluated only FITs, and four studies included both gFOBTs and FITs. The cut‐off for positivity of FITs varied between 2.4 μg and 50 µg Hb/g faeces. For each Quality Assessment of Diagnostic Accuracy Studies (QUADAS)‐2 domain, we assessed risk of bias as high in less than 20% of studies. The summary curve showed that FITs had a higher discriminative ability than gFOBTs for AN (P < 0.001) and CRC (P = 0.004). For the detection of AN, the summary sensitivity of gFOBTs was 15% (95% confidence interval (CI) 12% to 20%), which was significantly lower than FITs at both 10 μg and 20 μg Hb/g cut‐offs with summary sensitivities of 33% (95% CI 27% to 40%; P < 0.001) and 26% (95% CI 21% to 31%, P = 0.002), respectively. Results were simulated in a hypothetical cohort of 10,000 screening participants with 1% CRC prevalence and 10% AN prevalence. Out of 1000 participants with AN, gFOBTs missed 850, while FITs missed 670 (10 μg Hb/g cut‐off) and 740 (20 μg Hb/g cut‐off). No significant differences in summary specificity for AN detection were found between gFOBTs (94%; 95% CI 92% to 96%), and FITs at 10 μg Hb/g cut‐off (93%; 95% CI 90% to 95%) and at 20 μg Hb/g cut‐off (97%; 95% CI 95% to 98%). So, among 9000 participants without AN, 540 were offered (unnecessary) colonoscopy with gFOBTs compared to 630 (10 μg Hb/g) and 270 (20 μg Hb/g) with FITs. Similarly, for the detection of CRC, the summary sensitivity of gFOBTs, 39% (95% CI 25% to 55%), was significantly lower than FITs at 10 μg and 20 μg Hb/g cut‐offs: 76% (95% CI 57% to 88%: P = 0.001) and 65% (95% CI 46% to 80%; P = 0.035), respectively. So, out of 100 participants with CRC, gFOBTs missed 61, and FITs missed 24 (10 μg Hb/g) and 35 (20 μg Hb/g). No significant differences in summary specificity for CRC were found between gFOBTs (94%; 95% CI 91% to 96%), and FITs at the 10 μg Hb/g cut‐off (94%; 95% CI 87% to 97%) and 20 μg Hb/g cut‐off (96%; 95% CI 91% to 98%). So, out of 9900 participants without CRC, 594 were offered (unnecessary) colonoscopy with gFOBTs versus 594 (10 μg Hb/g) and 396 (20 μg Hb/g) with FITs.

In five studies that compared FITs and gFOBTs in the same population, FITs showed a higher discriminative ability for AN than gFOBTs (P = 0.003).

We included a total of 30 "reference standard: positive" studies involving 3,664,934 participants. Of these, eight were gFOBT‐only studies, 18 were FIT‐only studies, and four studies combined both gFOBTs and FITs. The cut‐off for positivity of FITs varied between 5 µg to 250 µg Hb/g faeces. For each QUADAS‐2 domain, we assessed risk of bias as high in less than 20% of studies. The summary curve showed that FITs had a higher discriminative ability for detecting CRC than gFOBTs (P < 0.001). The summary sensitivity for CRC of gFOBTs, 59% (95% CI 55% to 64%), was significantly lower than FITs at the 10 μg Hb/g cut‐off, 89% (95% CI 80% to 95%; P < 0.001) and the 20 μg Hb/g cut‐off, 89% (95% CI 85% to 92%; P < 0.001). So, in the hypothetical cohort with 100 participants with CRC, gFOBTs missed 41, while FITs missed 11 (10 μg Hb/g) and 11 (20 μg Hb/g). The summary specificity of gFOBTs was 98% (95% CI 98% to 99%), which was higher than FITs at both 10 μg and 20 μg Hb/g cut‐offs: 94% (95% CI 92% to 95%; P < 0.001) and 95% (95% CI 94% to 96%; P < 0.001), respectively. So, out of 9900 participants without CRC, 198 were offered (unnecessary) colonoscopy with gFOBTs compared to 594 (10 μg Hb/g) and 495 (20 μg Hb/g) with FITs. At a specificity of 90% and 95%, FITs had a higher sensitivity than gFOBTs.

Authors' conclusions

FITs are superior to gFOBTs in detecting AN and CRC in average‐risk individuals. Specificity of both tests was similar in "reference standard: all" studies, whereas specificity was significantly higher for gFOBTs than FITs in "reference standard: positive" studies. However, at pre‐specified specificities, the sensitivity of FITs was significantly higher than gFOBTs.

Keywords: Humans, Adenoma, Adenoma/diagnosis, Colorectal Neoplasms, Colorectal Neoplasms/diagnosis, Early Detection of Cancer, Early Detection of Cancer/methods, Guaiac, Hemoglobins, Occult Blood, Prospective Studies, Retrospective Studies, Sensitivity and Specificity

Plain language summary

Which faecal blood test is more accurate in detecting bowel cancer and large polyps in population screening?

Background One of the most common types of cancer diagnosed is large bowel or colorectal cancer (CRC). Early detection, before symptoms appear, makes it easier to treat bowel cancer and increases the chance of survival. Taking part in a bowel cancer screening program can lead to early detection and removal of large or advanced polyps (advanced adenomas), which are considered to be a precursor to bowel cancer. Simple faecal tests are used to detect the presence of blood in stool, which could be an early sign of bowel cancer or polyps. Two types of faecal blood tests used in population screening are: guaiac‐based faecal occult blood tests (gFOBTs) and faecal immunochemical tests (FITs). Large, older studies have shown that screening with gFOBTs can reduce mortality. In a systematic review of the literature, we compared the accuracy of these two tests in order to assess which test gives the best results in population screening for bowel cancer, and, secondarily, for advanced neoplasia (which comprises bowel cancer and advanced polyps together).

Study characteristics We carried out a detailed search of online databases for studies that evaluated or compared (one of) these two tests in CRC screening. The review included only studies in average‐risk individuals over 40 years of age without symptoms. The reference standard to compare the test results with was a total endoscopic examination of the large bowel with a camera on a flexible tube passed through the anus (colonoscopy). We reviewed two types of studies: those in which all participants underwent both the stool test and colonoscopy; and those in which only participants with an unfavourable result on the stool test underwent colonoscopy (in these studies, participants who did not have a colonoscopy after the stool test were followed for at least one year to see if they would be diagnosed with colorectal cancer). The evidence is current until 25 June 2019. We ran a top‐up search on 14 September 2021, which yielded only one potentially eligible study, currently awaiting classification.

Test characteristics

The gFOBT 'screenees' – i.e. those who participate in screening – are instructed to collect two faecal samples from three consecutive bowel movements and to smear this on six stool panels. If there is blood in the stool, the panel changes colour. The number of coloured panels for referral to colonoscopy varies between screening programs. In most programs, a single coloured panel is sufficient for referral; however, in others, the number of panels is set at five out of six.

The FIT screenees are instructed to collect one faecal sample from one bowel movement, and to collect this with a brush or spatula into a tube. This tube is then send to a laboratory where the concentration of blood in the stool can be measured. Depending on the height of this concentration, above or below the so‐called cut‐off or threshold, the screenee is referred for colonoscopy. This cut‐off differs per screening program.

Key results We analysed 63 studies including almost 4 million individuals. The results of this review indicate that if, in theory, 10,000 people take part in screening with a faecal blood test and 100 people in this group have CRC:

‐ out of the 100 people with CRC, 24 will be missed in those being screened with FITs.

‐ out of the 100 people with CRC, 61 will be missed in those being screened with gFOBTs.

We also looked at participants with large polyps, CRC, or both. If, in theory, 10,000 people take part in screening with a faecal blood test and 1000 people in this group have large polyps, CRC, or both:

‐ out of the 1000 people with large polyps, CRC, or both, 850 will be missed in those being screened with gFOBTs.

‐ out of the 1000 people with large polyps, CRC, or both, 670 will be missed in those being screened with FITs.

In this theoretical group of 10,000 screenees:

‐ 594 people being screened with FITs will be offered an 'unnecessary' colonoscopy – unnecessary because they do not have CRC; and

‐ 594 people being screened with gFOBTs will be offered an 'unnecessary' colonoscopy.

From the results described above, we can see that FITs miss less CRC than gFOBTs, while an equal number of screenees from each type of blood test undergo an unnecessary colonoscopy.

How reliable are the results of the studies in this review? The results of the studies are reliable, as the included studies mostly met the quality criteria we specified before commencing the review.

Future research More research is needed to investigate whether, in the long term, FIT screening can reduce the number of bowel cancer cases and deaths, and to compare these findings with those from gFOBT screening.

Summary of findings

Summary of findings 1. Diagnostic accuracy of gFOBTs compared to FITs.

Diagnostic accuracy of gFOBTs compared to FITs
Participants/ population	Asymptomatic, average‐risk individuals over the age of 40 years undergoing colorectal cancer (CRC) screening
Prior testing	Only the results of the first screening round were included in this analysis
Settings	Population‐based colorectal cancer screening
Index test	Guaiac‐based faecal occult blood test (gFOBT) or faecal immunochemical test (FIT)*
Importance	Many screening programmes worldwide are currently changing from gFOBT‐ to FIT‐based screening
Reference standard	Colonoscopy is the reference standard for the diagnosis of colorectal cancer. If a colonoscopy was not completed, a CT‐colonography (or double‐contrast barium enema) was used as a surrogate.
Studies	Prospective and retrospective studies including average‐risk individuals invited for colorectal cancer screening "Reference standard: all": all screenees underwent both the index test and colonoscopy (n = 33) "Reference standard: positive": only screenees with a positive index test underwent colonoscopy and all screen negative participants were followed for at least one year (n = 30).
Quality concerns	Due to strict inclusion criteria, most studies were of high quality. Few studies had an unclear risk of bias due to poor reporting of a pre‐specified cut‐off value. Only three studies had a high risk of bias regarding the selection of study population. For these studies, sensitivity analyses showed no significant differences in outcome when excluding these studies from analyses.
Test/subgroup*
	Studies (participants)	Summary sensitivity (%, 95% CI)	Summary specificity (%, 95% CI)	Implications*
"Reference standard: all" studies
gFOBT advanced neoplasia	11 (17,622)	15 (12 to 20)	94 (92 to 96)	Out of 1000 participants with AN, 850 will be missed. Among those without AN, 540 will be offered an (unnecessary) colonoscopy.
FIT** advanced neoplasia	16 (49,081)	33 (27 to 40)	93 (90 to 95)	Out of 1000 participants with AN, 670 will be missed. Among those without AN, 630 will be offered an (unnecessary) colonoscopy.
gFOBT colorectal cancer	9 (17,340)	39 (25 to 55)	94 (91 to 96)	Out of 100 participants with CRC, 61 will be missed. Among those without CRC, 594 will be offered an (unnecessary) colonoscopy.
FIT** colorectal cancer	13 (42,335)	76 (57 to 88)	94 (87 to 97)	Out of 100 participants with CRC, 24 will be missed. Among those without CRC, 594 will be offered an (unnecessary) colonoscopy.
"Reference standard: positive" studies
gFOBT colorectal cancer	12 (1,349,890)	59 (55 to 64)	98 (98 to 99)	Out of 100 participants with CRC, 41 will be missed. Among those without CRC, 198 will be offered an (unnecessary) colonoscopy.
FIT** colorectal cancer	10 (1,274,115)	89 (85 to 92)	94 (92 to 95)	Out of 100 participants with CRC, 11 will be missed. Among those without CRC, 594 will be offered an (unnecessary) colonoscopy.
Conclusion	FITs have a higher sensitivity and similar specificity for both AN and CRC compared to gFOBTs in an average‐risk population. In "reference standard: positive" participants, sensitivity for CRC was higher with FITs, but specificity was higher with gFOBTs.
CAUTION: the results in this table should not be interpreted in isolation from the results of the individual included studies contributing to each summary test accuracy measure. These are reported in the main body of the review.
* In a hypothetical situation with prevalence of CRC of 1%, prevalence of AN of 10% and an assumed 100% participation rate in a population of n = 10,000 ** Results for a FIT cut‐off of 10 µg Hb/g faeces are shown
AN = advanced neoplasia; CI = confidence interval; CRC = colorectal cancer; FIT = faecal immunochemical test; gFOBT = guaiac‐based faecal occult blood test; Hb = haemoglobin

Study	Country	Test brand	gFOBT	FIT	FIT quant./quali.	FIT 10 µg	FIT 20 µg	AN	CRC	Other cut‐off	No. of stools
Ahlquist 2008a	USA	Hemoccult	+					+	+		3
Ahlquist 2008b	USA	Hemoccult Sensa	+					+	+		3
Aniwan 2015	Thailand	SD Bioloine FOB		+	Qualitative	+		+	+	50 ng Hb/mL = 10 µg/g	1
Aniwan 2017	Thailand	OC‐Sensor		+	Qualitative	+	+	+	+		1
Brenner 2012	Germany	HemoCARE (gFOBT)	+					+	+		3
	Germany	ImmoCARE‐C (FIT)		+	Qualitative			+	+	Unknown	3
Brenner 2013	Germany	Hemoccult	+					+	+		1
Brenner 2018	Germany	FOB‐Gold		+	Qualitative	+	+	+	+		1
Chang 2017	Taiwan	OC‐Sensor		+	Qualitative	+	+	+			1
Chen 2014	Taiwan	OC‐Light		+	Qualitative	+		+	+		1
Cheng 2002	Taiwan	OC‐Hemodia		+	Qualitative	+		+	+		1
Chiu 2013	Taiwan	OC‐Light		+	Qualitative	+		+	+		1
Chiu 2016a	Taiwan	OC‐Sensor		+	Qualitative		+	+	+		1
Cruz‐Correa 2007	USA	Hemoccult II	+					+	‐		3
De Wijkerslooth 2012	Netherlands	OC‐Sensor		+	Quantitative	+	+	+	+		1
Graser 2009	Germany	gFOBT brand not specified	+					+	+		3
	Germany	FOB‐Gold		+	Quantitative			+	+	14 ng Hb/mL = 2.4 µg/g	2
Haug 2011	Germany	RIDASCREEN		+	Quantitative	+	+	+	+		1
Hernandez 2014	Spain	OC‐Sensor		+	Quantitative	+	+	+	+		1
Hoepffner 2006		Hemoccult	+					+	‐		1
		Hb ELISA Immunodiagnostik		+	Unknown	+		+	‐	10 µg/g	1
Imperiale 2004	USA	Hemoccult II	+					+	+		3
Imperiale 2014	Canada and USA	OC‐Sensor		+	Quantitative		+	+	+		1
Khalid‐de Bakker 2011	Netherlands	OC‐Sensor		+	Quantitative	+	+	+			1
Kim 2017	South Korea	OC‐Sensor		+	Qualitative		+	+	+		1
Levy 2014b	USA	Inverness Clearview		+	Qualitative			+	+	50 µg/g faeces	1
Levy 2014c		Alere Clearview		+	Qualitative			+	+	6 µg/g faeces	1
Levy 2014a		Polymedco OC‐Light		+	Qualitative	+		+	+		1
Levy 2014d		Quidel QuickVue		+	Qualitative			+	+	50 µg/g faeces	1
Liebermann 2001	USA	Hemoccult II	+					+	+		3
Nakama 2000	Japan	Iatro Hemcheck		+	Qualitative				+	Not reported	2
Nakazato 2006	Japan	OC‐Hemodia			Quantitative			+	+	Not reported	2
Omata 2011	Japan	OC‐Micro		+	Quantitative	+	+	+	+		1
Park 2010	South Korea	Hemoccult II	+					+	+		3
	South Korea	OC‐Sensa		+	Quantitative	+	+	+	+		3
Ribbing Wilen 2019	Sweden	OC‐Sensor		+	Qualitative	+		+	+		2
Siripongpreeda 2016	Thailand	ABON Biopharm		+	Qualitative			+	+	6 µg/g faeces	1
Sohn 2005	South Korea	OC‐Hemodia		+	Quantitative		+		+	100 ng/mL = 20 µg/g faeces	1
Sung 2003	China	Hemoccult II	+					+	+		3
Wong 2014	Hong Kong	Hemosure		+	Qualitative			+	+	50 ng/mL = 50 µg/g faeces	1
Wu 2014	Taiwan	ACON Laboratories		+	Qualitative			+	+	50 ng/mL = 6 µg/g faeces	1

Test	Sensitivity at 90% specificity	Sensitivity at 95% specificity	Difference in accuracy (LR test, P value)
gFOBT	0.21 (0.17, 0.27)	0.14 (0.11, 0.18)	< 0.0001
FIT	0.36 (0.32, 0.41)	0.26 (0.22, 0.29)	< 0.0001

Date	Event	Description
13 January 2022	Feedback has been incorporated	Revisions incorporated
14 September 2021	New search has been performed	Top‐up search 14 September 2021

DOMAIN	PATIENT SELECTION	INDEX TEST	REFERENCE STANDARD	FLOW AND TIMING
Description	Describe methods of participant selection: explain selection of invitees (identified from general practitioner records or population registers Describe included patients (prior testing, presentation, intended use of index test and setting)	Describe the index test and how it was conducted and interpreted	Describe the reference standard and how it was conducted and interpreted. Describe if definition of advanced neoplasia is according to Cochrane Review protocol	Describe enrolment of study participants, randomisation (if applicable). Describe any participants who were excluded from the 2 x 2 table: describe the time interval between index test(s) and reference standard
Signalling questions (yes/no/unclear)	Was a consecutive or random sample of patients enrolled?	If a threshold was used, was it pre‐specified?	Were the reference standard results interpreted without knowledge of the results of the index tests?	Was there an appropriate interval between index test and reference standard?
	Was a case‐control design avoided?	Were non‐interpretable test results reported for the index test?	Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Were all participants included in the analysis?
	Did the study avoid inappropriate exclusions?			Were withdrawals from the study explained?
	Was the spectrum of invitees representative of CRC screening? (e.g. avoid not‐average‐risk participants)			Did screened persons receive the same reference standard irrespective of index test result?
Risk of bias: high/low/unclear	Could the selection of participants have introduced bias?	Could the conduct or interpretation of the index test have introduced bias?	Could the reference standard, its conduct, or its interpretation have introduced bias?	Could the participant flow have introduced bias?
Concerns regarding applicability: high/low/ unclear	Are there concerns that the included participants do not match the review question?	Are there concerns that the index test, its conduct, or interpretation differ from the review question?	Are there concerns that the target condition as defined by the reference standard does not match the review question?

Question	Response and weighting	Explanation
Patient selection
Was case‐control design avoided?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	Diagnostic case‐control studies were considered inappropriate for this review because such studies are likely to overestimate diagnostic performance (Deeks 2013). Moreover, literature suggests that measures of accuracy may vary with the prevalence and stage‐distribution of the target condition (Leeflang 2009). For instance, the sensitivity of a test will often vary according to the severity of the detected disease (e.g. advanced CRCs are more easily detected with FOBTs than early‐stage tumours). This item was scored as YES if case‐control design was avoided, NO if the study was clearly a case‐design study or if this was mentioned in the article, and UNCLEAR if design of the study was unclear.
Did the study avoid inappropriate exclusions?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	Inappropriate exclusions may lead to a potential bias; for example, overoptimistic estimates of diagnostic accuracy. Therefore, a study should preferably enrol all consecutive, or a random sample of, eligible participants with suspected disease.
Was the spectrum of invitees representative of CRC screening?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	It was determined how invitees were recruited from the general population and whether this was representative for a nationwide CRC screening program. The item was scored as YES if the spectrum of invitees and the method of recruitment of study participants fulfilled the pre‐stated requirements which were described under the subheading 'Participants' in the Methods section. Studies were still eligible for inclusion and scored as YES if high‐risk individuals or screened participants aged < 40 years represented the minority of participants examined (i.e. < 5%), or if they formed an identifiable subset that could be excluded during data extraction. The item was scored as NO if the spectrum of invitees did not fulfil the pre‐stated requirements which are described under the subheading 'Participants' in the Methods section. The item was scored as UNCLEAR when there was insufficient information available to make a judgement either about the spectrum of invitees or the method of recruitment.
Index test
If a threshold was used, was it pre‐specified?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	Selecting the test threshold to optimise sensitivity or specificity, or both, may lead to overoptimistic estimates of test performance, which is likely to be poorer in an independent sample of participants in whom the same threshold is used (Leeflang 2008).
Were non‐interpretable test results reported for the index test?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	A FOBT may, for example, be non‐interpretable because the stool was applied erroneously to the test cards, or because the time between faecal sampling and arrival of the test at the laboratory was too long. Usually, in that case, a new test set will be sent to participants. This item was scored as YES if the number of non‐interpretable test results was stated, or if the number of index test results reported was in accordance with the number of participants. This item was scored as NO if it was stated that non‐interpretable test results occurred or were excluded, and if it was not reported how many. This item was scored as UNCLEAR if it was not possible to work out whether non‐interpretable test results occurred.
Reference standard
Were the reference standard results interpreted without knowledge of the results of the index test?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	It could be hypothesised that knowing the index test result may have implications for more or less extensive searching for advanced neoplasia when the reference standard is being performed. This might positively influence the sensitivity of the index test and therefore diagnostic review bias may occur. Therefore, this item was scored as YES if the study clearly stated that the reference standard results were interpreted blind to the results of the index test. This item was scored as NO if it was clear that the reference standard was performed with knowledge of the index test. This was per definition the case for all included "reference type: positive" studies in which only screened individuals with a positive test result were referred for the reference standard. This item was scored as UNCLEAR if this information was not reported by the study.
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	The availability or absence of participant information and screenee characteristics such as age, gender, family history, presence or severity of symptoms, may influence the performance of the reference standard. Therefore, this item was scored as YES if it was clearly described that clinical data concerning the screened individual were available to the physician during performance of the reference standard. This item was scored as NO if clinical data were withheld or if more information than normally available was provided. This item was scored as UNCLEAR if this information about the availability of clinical data was not stated.
Flow and timing
Was there an appropriate time between index test(s) and reference standard to be sure that the target condition did not change?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	With a polyp dwell time (i.e. defined as the average time for the transformation from a small adenomatous polyp to cancer) of approximately 10 years, CRC is generally a slow‐growing tumour (Winawer 1997). Therefore, disease progression bias is not likely to occur in a screening setting. The item was scored as YES if the delay between the performance of the index test and reference standard was on average 3 months, with a maximum of 12 months, and if for "reference type: positive" studies (detection of interval carcinomas for those with a negative index test), the follow‐up time after a negative test result was at least 12 months. The item was scored as NO if the time period between the performance of the index test and the reference standard was sufficiently long (i.e. more than 12 months) to assume that the disease status may have changed between the performance of the two tests or in case of "reference type: positive" studies, if the follow‐up time after a negative test result was shorter than 12 months. This item was scored as UNCLEAR if it was not clearly stated what the time period was between the reference standard and index test.
Did screened persons receive the same reference standard irrespective of index test result?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	This item was scored as YES if the same reference standard was used in all screened individuals irrespective of the FOBT result. We scored this item as NO if the choice of reference standard varied between individuals. This is per definition the case for all included "reference type: positive" studies in which only individuals with a positive test index test were referred for the reference standard, and index negative screenees were followed by interval carcinomas. This item was scored as UNCLEAR if this information was not reported by the study.
Were all participants included in the analysis?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	Not including a number of participants for analysis in the 2 x 2 table could potentially lead to bias. When participants that were not included differ systematically from those that were analysed, diagnostic test accuracy may differ.
Were withdrawals from the study explained?	No = high risk of bias Yes = low risk of bias Unclear = unclear risk of bias	Withdrawals included participants who dropped out from the study before the results of either the index test or reference standard were known, or when they were lost to follow‐up in case of verification of test results by interval carcinomas. This item was scored YES when it was clear what happened to all individuals from the moment of invitation until the results of the reference standard were available. This item was scored as NO when it appeared that some of the participants who entered the study did not complete the study, and these participants were not accounted for. This item was scored as UNCLEAR if it was not clear how many participants entered and, hence, if there were withdrawals.

Test	No. of studies	No. of participants
1 "Reference standard: all" gFOBT AN	11	17622
2 "Reference standard: positive" gFOBT CRC	12	1349890
3 "Reference standard: all" FIT10 AN	16	49018
4 "Reference standard: all" FIT20 AN	13	52318
5 "Reference standard: all" gFOBT CRC	9	17340
6 "Reference standard: all" FIT CRC	25	105744
7 "Reference standard: all" FIT10 CRC	13	42335
8 "Reference standard: all" FIT20 CRC	11	45823
9 "Reference standard: all" gFOBT_Hemoccult_II_AN	5	6781
10 "Reference standard: all" gFOBT_Hemoccult_Sensa_AN	1	3764
11 "Reference standard: all" gFOBT_Hemoccult_II_CRC	5	8767
12 "Reference standard: all" gFOBT_Hemoccult_Sensa_CRC	1	3764
13 "Reference standard: all" gFOBT_Hemoccult_AN	3	6155
14 "Reference standard: all" FIT_Iatro_Hemcheck_CRC	1	17664
15 "Reference standard: all" FIT AN	29	94849
16 "Reference standard: all" gFOBT_Hemoccult_CRC	2	5999
17 "Reference standard: all" FIT_OC_Sensor_50_AN	6	10847
18 "Reference standard: all" FIT_OC_Sensor_50_CRC	4	4320
19 "Reference standard: all" FIT_OC_Sensor_100_AN	7	31144
20 "Reference standard: all" FIT_OC_Sensor_100_CRC	5	24904
21 "Reference standard: all" FIT_OC_Light_50_AN	3	24609
22 "Reference standard: all" FIT_OC_Light_100_AN	0	0
23 "Reference standard: all" FIT_OC_Light_100_CRC	0	0
24 "Reference standard: all" FIT_OC_Light_50_CRC	3	24609
25 "Reference standard: all" FIT_OC_Sensa_100_CRC	1	770
26 "Reference standard: all" FIT_OC_Sensa_50_CRC	1	770
27 "Reference standard: all" FIT_OC_Sensa_100_AN	1	770
28 "Reference standard: all" FIT_OC_Sensa_50_AN	1	770
29 "Reference standard: all" FIT_OC_Micro_50_AN	1	1085
30 "Reference standard: all" FIT_ELISA_Immunodiagnostik	1	156
31 "Reference standard: all" FIT_RIDASCREEN_50_AN	1	2325
32 "Reference standard: all" FIT_RIDASCREEN_50_CRC	1	2325
33 "Reference standard: all" FIT_RIDASCREEN_100_AN	1	2325
34 "Reference standard: all" FIT_RIDASCREEN_100_CRC	1	2325
35 "Reference standard: all" FIT_OC_Hemodia_AN	3	11951
36 "Reference standard: all" FIT_OC_Hemodia_CRC	3	11951
37 "Reference standard: positive" FIT CRC	23	2318203
38 "Reference standard: positive" FIT_OC_Hemodia_CRC	3	177693
39 "Reference standard: positive" FIT 50 CRC	4	36620
40 "Reference standard: positive" FIT 100 CRC	10	1274115
41 "Reference standard: positive" gFOBT_Hemoccult_II_CRC	4	202822
42 "Reference standard: positive" gFOBT_Hemoccult_Sensa_CRC	1	2241
43 "Reference standard: positive" FIT_OC‐Micro_CRC	3	277059
44 "Reference standard: positive" gFOBT_Hemoccult_CRC	2	38922
45 "Reference standard: positive" FIT_OC_Sensor_CRC	10	1257867
46 "Reference standard: positive" FIT_Hemeselect_CRC	1	1489
47 "Reference standard: positive" FIT_Magstream_CRC	1	7355
48 "Reference standard: positive" gFOBT_Hema‐screen_CRC	3	1102123
49 "Reference standard: positive" FIT_HM‐Jack_CRC	2	247282
50 "Reference standard: positive" FIT_Monohaem_CRC	1	3365
51 "Reference standard: all" FIT_OCHEMCHECK_100_CRC	1	9989
52 "Reference standard: all" FIT_OCHEMCHECK_100_AN	1	9989
53 "Reference standard: all" FIT_Hemosure_50_AN	1	4127
54 "Reference standard: all" FIT_Hemosure_50_CRC	1	4127
55 "Reference standard: all" FIT_FOBGold_AN	2	3496
56 "Reference standard: all" FIT_FOBGold_CRC	2	3496
57 "Reference standard: all" FIT_InvernessClearview_AN	1	44
58 "Reference standard: all" FIT_InvernessClearview_CRC	0	0
59 "Reference standard: all" FIT_AlereClearview_AN	1	308
60 "Reference standard: all" FIT_AlereClearview_CRC	1	308
61 "Reference standard: all" FIT_ Quidel _QuickVue_AN	1	52
62 "Reference standard: all" FIT_ Quidel _QuickVue_CRC	0	0
63 "Reference standard: all" FIT_ImmoCARE‐C_AN	1	646
64 "Reference standard: all" FIT_ImmoCARE‐C_CRC	1	646
65 "Reference standard: all" gFOBT_HemoCARE_AN	1	646
66 "Reference standard: all" gFOBT_hemoCARE_CRC	1	646
67 "Reference standard: positive" FIT_167ug_CRC	1	7383
68 "Reference standard: positive" FIT_250ug_CRC	1	7397
69 "Reference standard: all" FIT_OC_Micro_50_CRC	1	1085
70 "Reference standard: all" FIT_OC_Micro_100_AN	1	1085
71 "Reference standard: all" FIT_OC_Micro_100_CRC	1	1085
72 "Reference standard: positive" gFOBT_Hemofec_CRC	1	1670
73 "Reference standard: all" Linked‐ROC FIT AN	5	4182
74 "Reference standard: all" Linked‐ROC gFOBT AN	5	4073
75 "Reference standard: positive" FIT_OC_FIT_FIT‐CHEK_CRC	1	319424
76 "Reference standard: positive" FIT_FOBGOLD	1	21245
77 "Reference standard: all" linked‐ROC gFOBT CRC	3	1682
78 "Reference standard: all" linked‐ROC all FIT CRC	3	1701
79 "Reference standard: positive" linked‐ROC gFOBT CRC	2	3159
80 "Reference standard: positive" linked‐ROC all FIT CRC	2	3205

*Study characteristics*
Patient Sampling	Design: prospective, blinded, multicenter, cross‐sectional study Setting: invitation through health care systems
Patient characteristics and setting	Population: asymptomatic, age 50 to 80 years, 48% male Country: USA Time period: 2001 to 2007
Index tests	Index test: gFOBT Brand: Hemoccult Sensa Method of collection: 3 consecutive stools, collected by using plastic buckets in toilet seat. After collection, performance of test. Multiple tests were conducted on the same group of individuals. Execution: interpretation by trained technicians. Rehydration not mentioned. Positivity threshold: 1/6 panels
Target condition and reference standard(s)	Reference test: colonoscopy performed by: experienced endoscopists % that underwent colonoscopy: 100 Definition advanced neoplasia: curable stage colorectal cancer, high‐grade dysplasia, adenomas >1 cm
Flow and timing	Enrolment and exclusions: 4482 enrolled. Excluded: 477 because of protocol violations, 171 because of incomplete colonoscopy, 68 stool not collected within 120 days, 2 distant metastasis CRC Number analysed: 3764
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	Yes
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		High risk

*Study characteristics*
Patient Sampling	Design: cross‐sectional study Setting: participants of health promotion program performed FIT before screening colonoscopy in a university hospital
Patient characteristics and setting	Population (including mean age, % men): asymptomatic participants aged 50 to 75 years, mean age 61 years, 35% males Country: Bangkok, Thailand Time period: February 2013 to July 2014 Number analysed: 948
Index tests	Index test: qualitative FIT Brand: SD Bioline FOB, Standard Diagnostics Method of collection: participants collected a one‐time stool sample at home within 3 days before colonoscopy and returned the stool container on the day of colonoscopy. Execution: the FIT was interpreted by the medical laboratory scientist, who was blinded to the colonoscopy result Positivity threshold: 50 ng/mL
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: experienced endoscopists (> 1000 colonoscopies), blinded to FIT results % that underwent colonoscopy: 100% of analysed population Definition of advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: of the 963 eligible participants, 6 were excluded because of missing FIT and 9 because of poor bowel preparation. Number analysed: 948
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Unclear risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: cross‐sectional study. Setting: participants of health promotion program performed FIT before screening colonoscopy in six university hospitals
Patient characteristics and setting	Population (including mean age, % men): asymptomatic participants aged 50 to 75 years, mean age 60 years, 38% males Country: Bangkok, Thailand Time period: December 2014 to June 2016 Number analysed: 1479
Index tests	Index test: qualitative FIT Brand: OC‐Sensor (Eiken, Japan) Method of collection: participants collected a one‐time stool sample at home within 3 days before colonoscopy and returned the stool container on the day of colonoscopy. No diet restrictions Execution: the FIT was analysed within 7 days after collection. The FIT was interpreted by the medical laboratory scientist, who was blinded to the colonoscopy result. Positivity threshold: multiple thresholds are reported
Target condition and reference standard(s)	Reference test: colonoscopy performed by: experienced endoscopists (> 1000 colonoscopies) % that underwent colonoscopy: 100% of analysed population Definition of advanced neoplasia: adenoma with high‐grade dysplasia, or villous adenoma (> 25 %), or > 10 mm size or CRC
Flow and timing	Enrolment and exclusions: of 1580 screened, 60 were excluded due to age > 75 years, missed stool collection and poor bowel preparation. Number analysed: 1520
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: prospective cohort study Setting: Basque Colorectal Cancer Screening Programme
Patient characteristics and setting	Population: participants aged 50 to 69 and resident. Mean age and % male not given for complete cohort Country: Spain (Basque Country) Time period: 2009 to 2012
Index tests	Index test: quantitative FIT Brand: OC‐Sensor Micro and FOB‐Gold in the first screening round. Afterwards only OC‐Sensor Method of collection: not specified Execution: processed and analysed in centralised public laboratories under strict total quality management systems Positivity threshold: 20 µg Hb/g faeces for both tests
Target condition and reference standard(s)	Reference test: colonoscopy in case of positive test Performed by: expert specialists in referral public hospitals % that underwent colonoscopy: 94.7% of positive participants
Flow and timing	Enrolment and exclusions: 296,378 participants participated in screening, with 18,273 positive tests. 17,304 of 18,273 (94.7%) screen‐positive participants had colonoscopy. 1441 FIT‐positive participants underwent colonoscopy in FOB‐Gold group Number analysed: 295,409. 17,304 FIT‐positive participants underwent colonoscopy, of which 1441 in the FOB‐Gold group Identification of interval cancers: interval cancers, defined as CRC prior to a subsequent invitation, were identified from linkage to hospital discharges and population‐based cancer registries and pathology systems. Also, information provided about interval CRC within one or two years after negative test. Follow‐up time: 2 years
Comparative
Notes	Multiple tests were conducted on the same group of individuals
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	No
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Unclear
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Unclear risk

Test	Sensitivity at 90% specificity	Sensitivity at 95% specificity	Difference in accuracy (LR test, P value)
gFOBT	0.23 (0.10, 0.42)	0.12 (0.07, 0.22)	0.003
FIT	0.47 (0.29, 0.66)	0.30 (0.18, 0.46)	0.003

Test	Sensitivity at 90% specificity	Sensitivity at 95% specificity	Difference in accuracy (LR test, P value)
gFOBT	0.80 (0.67, 0.89)	0.73 (0.63, 0.80)	0.009*
FIT	0.97 (0.95, 0.98)	0.89 (0.86, 0.91)	0.009*
* HSROC curves have different shape parameters

*Study characteristics*
Patient Sampling	Design: population‐based screening study Setting: National Health Service (NHS) bowel cancer screening program in England
Patient characteristics and setting	Population: women aged 60 to 74 and resident in England and registered in NHS. Mean age not given for complete cohort. 0% men Country: England Time period: 1 December 2006 to 31 March 2012
Index tests	Index test: qualitative gFOBT Brand: Hemascreen Method of collection: three separate stools deliver two small faecal samples each. In total six windows are tested. Execution: no dietary restrictions Positivity threshold: > 4 out of 6 windows positive is considered positive. When 1 to 4 windows are positive, a second (and third) test is offered. If at least 1 window is positive in this second or third collection, follow‐up evaluation is offered.
Target condition and reference standard(s)	Reference test: colonoscopy in case of positive test (default), flexible sigmoidoscopy and radiological investigations. In > 97% of FOB+ participants, colonoscopy is considered as most appropriate first follow‐up evaluation. Performed by: not specified % that underwent colonoscopy: 87% of positive participants (7911 out of 9133 participants)
Flow and timing	Enrolment and exclusions: 628,976 participants participated in the first round of screening. 1222 of 9133 screen‐positive participants had no colonoscopy. Number analysed: 627,754. 628,976 participants participated in the first round of screening. 7911 of 9133 positive participants underwent colonoscopy. Identification of interval cancers: interval cancers, defined as CRC within two years of a negative result, were identified from the national cancer registry. Follow‐up time: 2 years
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	No
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Unclear
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Unclear
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: prospective population‐based study Setting: 1st round of screening in six areas of department of Calvados, France. Invitation via general practitioners and occupational doctors
Patient characteristics and setting	Population: 165,000 people, aged 45 to 74 years, 51.1% male Country: France Time period: April 1991 to December 1994
Index tests	Index test: gFOBT Brand: Haemocullt II Method of collection: no dietary or drug restrictions were required Execution: all tests were mailed to a single centre and were processed without rehydration Positivity threshold: 1/6 panels
Target condition and reference standard(s)	Reference test: colonoscopy and in case of incomplete colonoscopy, DCBE, or follow‐up Performed by: not specified % that underwent colonoscopy: 63.2%. A total of 2020 positive tests, 1603 had a colonoscopy and DCBE (79.4%, of which 1277 had only colonoscopy (63.2%).
Flow and timing	Enrolment and exclusions: all inhabitants of selected department, no exclusions. 22 cancers were excluded because of missing data; not clear if this is only from participants with negative test or also from non‐participants / reference group. 71,307 completed the test (43.3%). Number analysed: 71,307 Identification of interval cancers: recorded by the local digestive cancer registry Definition of interval cancers: all the cancers diagnosed between 1991 and 1995 in people living in the department, whether they occurred in a person participating in the screening or not. Follow‐up time: at least 12 months for all and 24 months for 90.5%
Comparative
Notes	Only 63% colonoscopy, additional 16% had DBCE Calculated incidence based on FU
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	No
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Unclear
Were withdrawals from the study explained?	No
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: observational study in 13 gastroenterological centres Setting: recruited participants who requested screening colonoscopy
Patient characteristics and setting	Population: asymptomatic population, mean age 62 years, 50.3% male Country: Germany Time period: 2008 to 2009
Index tests	Index test: qualitative FIT and gFOBT (participants underwent both tests) Brand: immoCARE‐C (FIT) hemoCARE (gFOBT) Method of collection: both tests were performed on three different stool samples; for the FIT test, three test sticks were given Execution: not specified Positivity threshold: not specified
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: 13 gastroenterology physician practices % that underwent colonoscopy: 100% Definition advanced neoplasia: adenomas > 10 mm, villous adenoma or carcinoma
Flow and timing	Enrolment and exclusions: 925 participants; 279 excluded because of missing data on colonoscopy or stool test results. Maximum of 310 days between index test and colonoscopy Number analysed: 646
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Unclear
Were non‐interpretable test results reported for the index test?	Yes
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	No
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	No
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: prospective study Setting: participants of a screening colonoscopy study (Blitz) in Germany, invited for additional stool tests
Patient characteristics and setting	Population: asymptomatic first‐time participants aged 50 to 79 years, mean age 63 years, 49% male Country: Germany Time period: 2005 to 2009
Index tests	Index test: gFOBT (+3 brands of FIT) Brand: HemOccult (Beckman Coulter, Krefeld Germany) Method of collection: no dietary restrictions, stool from one bowel movement, apply to 2 windows of 1 gFOBT test card Execution: by practice personnel, on average 4 days after stool collection and storage at room temperature. Rehydration not mentioned Positivity threshold: 1/2 windows of 1 slide (index test not conducted as usual: only one stool and only one of 3 cards used)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: not specified % that underwent colonoscopy: 98% Definition advanced neoplasia: CRC + advanced adenoma (advanced adenoma not defined )
Flow and timing	Enrolment and exclusions: 2414 first‐time participants of screening colonoscopy for whom results of gFOBT, FIT, and colonoscopy were available. 102 excluded because stool sample on day of/after colonoscopy, 77 with histologically undefined polyps. Number analysed: 2235
Comparative
Notes	FIT data from this article not included in review as these data are included in the article by Haug 2011 Multiple tests were conducted on the same group of individuals in these studies (Haug 2011, Brenner 2013)
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		High risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Unclear
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: national screening program Setting: participants living in 19 municipalities in the Province of Florence attending FOBT screening
Patient characteristics and setting	Population: participants aged 50 to 70 years, 48% male Country: Italy Time period: January 2000 to December 2002
Index tests	Index test: quantitative FIT (latex agglutination test) Brand: OC‐Hemodia, developed with the OC‐Sensor Method of collection: not described Execution: not described Positivity threshold: 100 ng Hb/mL (20 µg Hb/g faeces)
Target condition and reference standard(s)	Reference test: colonoscopy or follow‐up Performed by: not specified % that underwent colonoscopy: out of 1097 positives, 959 (87%) accepted further workup with colonoscopy. In 171 cases, colonoscopy was incomplete; in 166 of these DCBE was performed. So 72% total underwent colonoscopy.
Flow and timing	Enrolment and exclusions: 24,913 participants performed 27,503 FITs; they were excluded in case of positive FIT and no colonoscopy and cancer after more than 1 year Number analysed: 27,365 Identification of interval cancers: through linkage to Tuscany cancer registry Definition ofinterval cancers: all cancers after negative FIT or positive FIT with negative colonoscopy Follow‐up time: 1 to 2 years (2000 to 2003)
Comparative
Notes	24,913 participants, performed 27,503 tests; 2590 extra tests are unexplained
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Unclear
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	No
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Unclear
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: prospective Setting: screening colonoscopies and FITS at the Health Management Center of National Taiwan University Hospital
Patient characteristics and setting	Population: asymptomatic participants (only ethnic Chinese), mean age 59 years (SD 7.0 years), 51% male Country: Taiwan Time period: August 2010 to November 2014
Index tests	Index test: quantitative FIT Brand: OC‐Sensor (Eiken, Japan) Method of collection: no dietary restrictions, stool from one bowel movement Execution: by screenees at home 2 days before the colonoscopy using the collecting stick, and to refrigerate the samples until colonoscopy. Samples were sent the same day to the clinical laboratory Positivity threshold: 100 ng Hb/mL (20 µg Hb/g faeces)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: experienced endoscopists ( > 5000 colonoscopies performed) % that underwent colonoscopy: 100% Definition advanced neoplasia: only adenomas included, advanced was defined as > 1 cm in size, tubulovillous or villous components and high‐grade dysplasia (WHO criteria)
Flow and timing	Enrolment and exclusions: 36,720 screenees received a screening colonoscopy during the study period, the majority (n = 21,605) had received a screening colonoscopy in the past 5 years. People with family or personal history of CRC, those younger than 50 years, those who did not submit a stool sample, those with an incomplete colonoscopy or those with invasive cancer (n = 3) were excluded. Number analysed: 6198
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: retrospective analysis of prospectively enrolled population Setting: asymptomatic Taiwanese population that underwent health checkup
Patient characteristics and setting	Population: asymptomatic population, 40 years or older, mean age 53.6 years old, 56% men. Country: Taiwan Time period: 2008 to 2009
Index tests	Index test: qualitative FIT Brand: OC‐Light (Eiken, Japan) Method of collection: one stool sample Execution: automated analysis Positivity threshold: 10 µg Hb/g faeces (50 ng/mL buffer)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: experienced endoscopists % that underwent colonoscopy: 100% Definition of advanced neoplasia: same as protocol
Flow and timing	Enrolment and exclusions: 8258 consecutively enrolled, 2162 excluded. Exclusion criteria: under 40 years old (n = 1584), incomplete or missing information (n = 578), CRC, IBD, visible rectal bleeding or menstruation during time of stool collection. Number analysed: 6096
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: cross‐sectional analysis of voluntary screening program Setting: asymptomatic adults who underwent colonoscopy in a health screening program
Patient characteristics and setting	Population: asymptomatic adults, mean age 47 years (we used sub‐analysis of participants 40+ years old), 81% male Country: Taiwan Time period: 1997 to 2000
Index tests	Index test: qualitative FIT Brand: OC‐Hemodia Method of collection: diet instructions for 3 days, stool collection on day of colonoscopy Execution: not specified Positivity threshold: not specified (qualitative test so pre‐specified)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: well‐trained and board‐certified gastroenterologists % that underwent colonoscopy: 100% Definition advanced neoplasia: advanced neoplasm: polyp larger than 1 cm, polyps with villous or severe dysplastic features and cancer
Flow and timing	Enrolment and exclusions: 7617 examined: 206 excluded because of exclusion criteria, 7411 examined for study. We used sub‐analysis of participants 40+ years old. Exclusion criteria: known history of colorectal cancer, IBD, rectal bleeding, recent changes in bowel habits, weight loss, anaemia, or positive FOBT on prior examinations. Individuals with a personal history of colonic polyps or a family history of colon cancer were not excluded. Number analysed: 5067
Comparative
Notes	Data for participants > 40 years received from authors (n = 5067) CFOBB test results not extracted (= toluidine test, not guaiac)
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Unclear
Could the selection of patients have introduced bias?		Unclear risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	No
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: nationwide CRC screening program Setting: residents were invited for biennial FIT screening at 810 screening sites
Patient characteristics and setting	Population: aged 50 to 69 years, mean age 58.42 (OC‐Sensor), 38.4% males Country: Taiwan Time period: 1 January 2004 to 31 December 2009
Index tests	Index test: quantitative FIT Brand: OC‐Sensor Method of collection: 1 test at home in 1 stool Execution: in one of the 125 qualified laboratories Positivity threshold: 20 µg Hb/g faeces
Target condition and reference standard(s)	Reference test: colonoscopy and DCBE, or follow‐up Performed by: not described % that underwent colonoscopy: 86.2% Definition of advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: 747,076 participants underwent screening; exclusions not clearly described Number analysed: 747,076 Identification of interval cancers: Taiwan cancer registry Definition of interval cancers: invasive CRC diagnosed after a negative FIT and < 2 years to the next screen Follow‐up time: > 3 years
Comparative
Notes	Multiple tests were conducted on the same group of individuals
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	No
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	No
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: prospective screening study Setting: asymptomatic adults invited for health checkup
Patient characteristics and setting	Population: asymptomatic, age 50 years and older, mean age 59.8 years, 59% male Country: Taiwan Time period: 2005 to 2010
Index tests	Index test: qualitative FIT Brand: OC‐LIGHT V‐PC50 and V‐PH80 Method of collection: 1 stool sample 1 day before colonoscopy, no diet restrictions Execution: automated analysis with qualitative result at cut‐off of 50 ng Hb/mL Positivity threshold: 10 µg Hb/g faeces (50 ng Hb/mL)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: experienced endoscopists % that underwent colonoscopy: 100% Definition advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: target population was 33,263. Participants who were younger than 50 years old (n = 12.997), had CRC, and received colectomy (n = 96), did not submit faecal samples (n = 1555) or had incomplete colonoscopy (n = 319) were excluded. Number analysed: 18,296
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Unclear
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	Unclear
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

PERMALINK

Guaiac‐based faecal occult blood tests versus faecal immunochemical tests for colorectal cancer screening in average‐risk individuals

Esmée J Grobbee

Pieter HA Wisse

Eline H Schreuders

Aafke Roon

Leonie Dam

Ann G Zauber

Iris Lansdorp-Vogelaar

Wichor Bramer

Sarah Berhane

Jonathan J Deeks

Ewout W Steyerberg

Monique E Leerdam

Manon CW Spaander

Ernst J Kuipers

Abstract

Background

Objectives

Search methods

Selection criteria

Data collection and analysis

Main results

Authors' conclusions

Plain language summary

Summary of findings

Summary of findings 1. Diagnostic accuracy of gFOBTs compared to FITs.

Background

Target condition being diagnosed

Index test(s)

Guaiac‐based faecal occult blood tests

Faecal immunochemical tests (FITs)

Qualitative FITs

Quantitative FITs

Clinical pathway

Prior test(s)

Role of index test(s)

Alternative test(s)

Rationale

Objectives

Methods

Criteria for considering studies for this review

Types of studies

Participants

Index tests

Comparator test

Target conditions

Reference standards

Exclusion criteria

Search methods for identification of studies

Electronic searches

Searching other resources

Data collection and analysis

Selection of studies

Data extraction and management

Assessment of methodological quality

1. Overview of test characteristics per study for "reference standard: all" studies.

2. Overview of test characteristics per study for "reference standard: positive" studies.

Statistical analysis and data synthesis

Descriptive analysis

Inferential statistics

Investigations of heterogeneity

Sensitivity analyses

Assessment of reporting bias

Other published versions of this review

Summary of findings

Results

Results of the search

Initial searches

1.

Top‐up search

Included studies

Studies using "reference standard: all"

Studies using "reference standard: positive"

Methodological quality of included studies

Studies using "reference standard: all"

2.

3.

Patient selection domain

Index test domain

*Study characteristics*
Patient Sampling	Design: multicentre prospective study Setting: participants visiting bowel cancer screening centres or general medical outpatient clinics
Patient characteristics and setting	Population: asymptomatic individuals aged > 40 years, mean age 57.8 years, 51% male. Participants had to fill out the Asia‐Pacific Colorectal Cancer Screening score and were divided in to low‐, medium‐, and high‐risk groups, according to the score. Country: 15 sites, including Australia, Brunei, China, Hong Kong, Japan, South Korea, Malaysia, Pakistan, Philipines, Singapore, Taiwan, and Thailand Time period: December 2011 to December 2013
Index tests	Index test: quantitative FIT Brand: OC‐Sensor (Eiken, Japan). However, the OC‐Sensor was not available in all countries; therefore, other available FIT kits were used. Method of collection: not described Execution: by screenees following the manual's instructions Positivity threshold: for OC‐Sensor, 20 µg Hb/g faeces; for other FIT kits, the cut‐off value as recommended by the manufacturer was used (not further specified in article)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: not described % that underwent colonoscopy: 100% Definition advanced neoplasia: CRC or adenomas which were > 1 cm in size, tubulovillous or villous components or high‐grade dysplasia (WHO criteria)
Flow and timing	Enrolment and exclusions: exclusion criteria included history of colonic diseases that increase the risk of CRC; a CRC screening examination in the past 5 years; severe premorbid illness Number analysed: 5657
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		High risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Unclear
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Unclear
Were all patients included in the analysis?	Unclear
Were withdrawals from the study explained?	Unclear
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		High risk

*Study characteristics*
Patient Sampling	Design: prospective cross‐sectional study Setting: university hospital, referred for colonoscopy
Patient characteristics and setting	Population: asymptomatic people referred for colonoscopy, age 50 years and older, 37% male Country: USA Time period: 2002 to 2003
Index tests	Index test: 2x gFOBT Brand: EZ‐direct and Hemoccult II Method of collection: 2 consecutive bowel movements 1 week before colonoscopy, diet restrictions 3 days prior to test Execution: non‐rehydrated, interpretation by 2 investigators independently Positivity threshold: EZ‐direct: 1/3 cards; Hemoccult: 1/6 windows positive
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: endoscopists % that underwent colonoscopy: 100% Definition of advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: 207 analysed in study, only 126 screening participants used Number analysed: 126
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	No
Could the selection of patients have introduced bias?		High risk
Are there concerns that the included patients and setting do not match the review question?			High
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	No
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: RCT screening study (randomisation between gFOBT and FIT) Setting: population‐based screening pilot
Patient characteristics and setting	Population: average risk, 50 to 74 years living in the catchment area of the screening program. Mean age 60 years, 43% male Country: the Netherlands Time period: 2008
Index tests	Index test: gFOBT Brand: Hemoccult II Method of collection: gFOBT: No dietary instructions. Participants were instructed to collect 2 samples of 3 consecutive bowel movements. Execution: gFOBT: cards were not rehydrated and read by 2 trained laboratory technicians. Positivity threshold: gFOBT: 1/6 windows
Target condition and reference standard(s)	Reference test: colonoscopy or follow‐up Performed by: experienced endoscopists % that underwent colonoscopy: 82% of positives Definition advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: 10,054 invited, 4990 participants Number analysed: gFOBT: 2112 (FIT: 2824) Identification of interval cancers: interval cancers were identified through cross‐linkage of the screening pilot database with the Dutch cancer registry. Definition of interval cancers: defined as proportion of cancers diagnosed in first‐round participants (both guaiac FOBT and FIT) outside the screening protocol but within the screening interval. Follow‐up time: 2 years
Comparative
Notes	Multiple tests were conducted on the same group of individuals
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	No
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: prospective study, retrospective analysis Setting: population‐based screening program
Patient characteristics and setting	Population: asymptomatic individuals, 50 to 74 years, living in the Isère department. Mean age 65 years, 62% male Country: France Time period: 2002 to 2006 Number analysed: 84,897
Index tests	Index test: gFOBT Brand: Hemoccult II Method of collection: two samples from each of three stools, diet not specified Execution: in analysis centre, (non)rehydration not specified Positivity threshold: not specified
Target condition and reference standard(s)	Reference test: colonoscopy or follow‐up (in case of negative test) Performed by: not specified % that underwent colonoscopy: not specified
Flow and timing	Enrolment and exclusions: not specified. GP provided FOBT after checking for exclusion criteria. Exclusion criteria: personal or family history of CRC, history of adenomatous polyps, symptoms suggestive of cancer, colonoscopy in past 5 years. Number analysed: 86,750 Identification of interval cancers: via cancer registry database Definition of interval cancers: CRC diagnosed within 2 years of a negative test or a test that could not be analysed Follow‐up time: 2 years
Comparative
Notes	Article only describes CRCs; data for 2x2 table received from authors
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Unclear
Could the selection of patients have introduced bias?		Unclear risk
Are there concerns that the included patients and setting do not match the review question?			Unclear
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Unclear
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Unclear
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Unclear
Could the reference standard, its conduct, or its interpretation have introduced bias?		Unclear risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	No
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		High risk

*Study characteristics*
Patient Sampling	Design: prospectiveSetting: not described
Patient characteristics and setting	Population: average‐risk participants older than 50 years; not clear how consulted. 54.9% male Country: Germany Time period: not specified Number invited: 311 Number analysed: 276
Index tests	Index test: quantitative FIT and gFOBT (participants underwent both tests) Brand: FOBT‐Gold and for gFOBT not specified Method of collection: 2 FITs on 2 different parts from the same stool and gFOBT on 3 consecutive stools Execution: FIT automated analysis and gFOBT judged by staff Positivity threshold: FIT 14 ng Hb/mL (2.8 µg Hb/g faeces), the highest value of both tests was used. For gFOBT 1/3 samples
Target condition and reference standard(s)	Reference test: colonoscopy (and CT‐colonography) Performed by: six experienced gastroenterologists % that underwent colonoscopy: 100 Definition of advanced neoplasia: not defined, but characteristics of all polyps are listed
Flow and timing	Enrolment and exclusions: 311 enrolled, 4 withdrawals, FIT results available for 258, gFOBT for 276 participants Number analysed: 285
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Unclear
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Unclear
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	Yes
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Unclear
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	No
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	No
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Unclear risk

*Study characteristics*
Patient Sampling	Design: prospective colonoscopy‐screening study Setting: opportunistic colonoscopy screening
Patient characteristics and setting	Population: asymptomatic, age 55 to 80 years, mean age and male/female ratio of total not specified Country: Germany Time period: 2006 to 2009
Index tests	Index test: quantitative FIT Brand: RIDASCREEN Execution: automated analysis Method of collection: stool from 1 bowel movement, without diet restrictions Positivity threshold: 10 µg Hb/g and 20 µg Hb/g faeces
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: experienced endoscopists % that underwent colonoscopy: 100% Definition of advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: 3077 consented to screening colonoscopy; 752 excluded, 15 excluded because advanced neoplasia in both left and right colon (these are re‐included in this analysis) Number analysed: 2310
Comparative
Notes	2x2 table received from authors including 15 excluded because of advanced neoplasia in both left and right colon Multiple tests were conducted on the same group of individuals in these studies (Haug 2011, Brenner 2013)
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	No
Were non‐interpretable test results reported for the index test?	Unclear
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: multicentre, prospective, double‐blind study (COLONPREV study) Setting: average‐risk individuals submitted to screening colonoscopy in 3 tertiary hospitals in Spain
Patient characteristics and setting	Population: average‐risk individuals aged 50 to 69 years, 49.6% males, mean age 57.5 years Country: Spain Time period: 1 January 2010 to 30 June 20111
Index tests	Index test: quantitative FIT Brand: OC‐Sensor Method of collection: 2 stool samples from 2 consecutive days the week before the colonoscopy was scheduled, without diet or medication restrictions. Execution: automated analysis. Faecal haemoglobin concentration was determined in the first sample (FIT1) and the highest level of both samples (FITmax). We only used results of FIT1. Positivity threshold: all, specified for 50 ng Hb/mL and 100 ng Hb/mL
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: not reported % that underwent colonoscopy: 94% (54 out of 851 did not complete colonoscopy) Definition of advanced neoplasia: according to protocol
Flow and timing	Enrolment and exclusions: 851 were included and 779 were analysed (completed both FIT and colonoscopy) Number analysed: 779
Comparative
Notes	Two stool samples from 2 consecutive days were collected; we only used results of the first sample (FIT1).
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: prospective multicentre studySetting: participants referred for colonoscopy at 4 different centres
Patient characteristics and setting	Population: participants referred for colonoscopy (both symptomatic as well as asymptomatic); only screened population used for this review (48% males) Country: Germany Time period: not specified
Index tests	Index test: gFOBT, ELISA iFOBT and bedside test strip device Prevent ID CC (latter not used in review) (participants underwent all tests) Brand: Haemocullt and Hb ELISA (Immunodiagnostik) Execution: gFOBT non‐hydrated and iFOBT by automated analysis Method of collection: one stool sample Positivity threshold: 1/3 for gFOBT and 10 µg Hb/g stool for iFOBT
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: experienced endoscopists ('experienced' not defined) % that underwent colonoscopy: 100% Definition of advanced neoplasia: CRC + large adenomas (> 10 mm)
Flow and timing	Enrolment and exclusions: 237 symptomatic participants and 150 healthy participants undergoing CRC screening were enrolled. 387 participants underwent 407 tests. All participants with repeat tests were known IBD cases, according to manuscript. Nevertheless, results for 156 screening participants given. Number analysed: 156
Comparative
Notes	Authors did not respond to emails with request for clarification. Manufacturer of the iFOBT clarified the correct cut‐off (10 µg Hb/g stool instead of 10 g Hb/mL stool)
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	No
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	No
Were withdrawals from the study explained?	Unclear
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		High risk

*Study characteristics*
Patient Sampling	Design: cross‐sectional prospective study Setting: 90 sites throughout the USA and Canada, including private practice and academic settings
Patient characteristics and setting	Population: asymptomatic persons 50 to 84 years old were enrolled to undergo screening colonoscopy. Mean age of evaluable subgroup 64.2 years; 46.3% male. Country: Canada and USA Time period: June 2011 to November 2012
Index tests	Index test: quantitative FIT Brand: OC FIT‐CHECK (Polymedco) Method of collection: one stool sample before bowel preparation, collected in container. No diet restrictions. Execution: performed in stool from container according to manufacturer's instructions Positivity threshold: 100 ng Hb/mL (20 µg Hb/g faeces)
Target condition and reference standard(s)	Reference test: colonoscopy Performed by: not described % that underwent colonoscopy: 100% Definition of advanced neoplasia: according to protocol plus sessile serrated polyps measuring 1 cm or more
Flow and timing	Enrolment and exclusions: 12,776 were enrolled; 11,016 could be evaluated and 9989 had results that could be fully evaluated Number analysed: 9989
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	Yes
Could the conduct or interpretation of the index test have introduced bias?		Low risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Low concern
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	Yes
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		Low risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			Low concern
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	Yes
Could the patient flow have introduced bias?		Low risk

*Study characteristics*
Patient Sampling	Design: retrospective cohort study Setting: population‐based screening in Kaiser Permanente Northern and Southern California (KPNC, KPSC)
Patient characteristics and setting	Population: a total of 323,349 health plan members aged 50 to 70 years. 46.4% male Country: California, USA Time period: January 2007 to December 2013
Index tests	Index test: quantitative FIT Brand: OC FIT‐CHEK Method of collection: not specified Execution: OC‐Sensor‐Diana automated system (Polymedco) Positivity threshold: 20 µg Hb/g faeces
Target condition and reference standard(s)	Reference test: the FIT results recorded within 1 year of each mail date, and colonoscopies performed and adenomas or CRC diagnosed within 1 year after FIT results were considered part of a single screening episode for the round when the FIT was distributed. Performed by: not described Percentage that underwent colonoscopy: out of 16,037 first‐round FIT‐positive participants, 12,112 underwent colonoscopy (75.5%) within one year of positive result
Flow and timing	Enrolment and exclusions: population of 323,349. Follow‐up colonoscopy for 12,113 (out of 16,037) FIT positives. However, CRC results only reported for group consisting of 16,037 participants Number analysed: 319,424 Identification of interval cancers: colorectal adenocarcinomas and disease stage were obtained from the KPNC and KPSC cancer registries Follow‐up time: interval cancers after first round defined as CRC diagnosed within one year after FIT result
Comparative
Notes
*Methodological quality*
Item	Authors' judgement	Risk of bias	Applicability concerns
DOMAIN 1: Patient Selection
Was a consecutive or random sample of patients enrolled?	Yes
Was a case‐control design avoided?	Yes
Was the spectrum of invitees representative of CRC screening?	Yes
Could the selection of patients have introduced bias?		Low risk
Are there concerns that the included patients and setting do not match the review question?			Low concern
DOMAIN 2: Index Test (All tests)
If a threshold was used, was it pre‐specified?	Yes
Were non‐interpretable test results reported for the index test?	Unclear
Could the conduct or interpretation of the index test have introduced bias?		Unclear risk
Are there concerns that the index test, its conduct, or interpretation differ from the review question?			Unclear
DOMAIN 3: Reference Standard
Were the reference standard results interpreted without knowledge of the results of the index tests?	No
Were the same clinical data available when test results were interpreted as would be available when test is used in practice?	Yes
Could the reference standard, its conduct, or its interpretation have introduced bias?		High risk
Are there concerns that the target condition as defined by the reference standard does not match the question?			High
DOMAIN 4: Flow and Timing
Was there an appropriate interval between index test and reference standard?	Yes
Were all patients included in the analysis?	Yes
Were withdrawals from the study explained?	Yes
Did screened persons receive the same reference standard irrespective of index test result?	No
Could the patient flow have introduced bias?		Unclear risk