Skip to main content
Microbiology Spectrum logoLink to Microbiology Spectrum
. 2024 Feb 7;12(3):e02210-23. doi: 10.1128/spectrum.02210-23

Inter-laboratory ring trial to compare four quantitative polymerase chain reaction assays employed for detection of Mycobacterium avium subspecies paratuberculosis

L Worsley 1,, P L Davies 1
Editor: Sadjia Bekal2
Reviewed by: Fazeela Arshad3, Asra'a Abdul-Jalil4, Muthuraj Muthaiah5
PMCID: PMC10913479  PMID: 38323833

ABSTRACT

Johne’s disease is an infectious enteric disease caused by Mycobacterium avium subspecies paratuberculosis (MAP) affecting ruminant species worldwide. In Project 1, an independent performance comparison ring trail was conducted between three different commercial MAP quantitative polymerase chain reaction (qPCR) assay services (B, C, and D) currently marketed in Great Britain by three separate laboratories against each other and against a fourth assay (A) not available commercially in Great Britain. A total of 205 individual ovine and bovine samples from five farms were analyzed to give 41 sets of pooled results (pool size five) from each laboratory according to their specific protocols. The numbers of positive pools for assays A–D were 18, 12, 11, and 1 (43.9%, 29.2%, 26.8%, and 2.4%), respectively. Assessment of interrater reliability produced a Fleiss’ kappa coefficient of 0.15, indicating very poor overall agreement between the four laboratories. Laboratories A–D diagnosed 4, 3, 2, and 1 flocks at the farm level, respectively, as MAP positive. In Project 2, 38 pooled ovine samples from 10 flocks were analyzed to compare the performance of laboratories A and B. The numbers of positive results for laboratories A and B were 24 (63.1%) and 17 (44.7%), respectively (Cohen’s kappa 0.54), indicating that laboratory A was more sensitive than B in line with results from Project 1. Variation between laboratories offering MAP qPCR assays is a significant concern, and further work is warranted to validate and standardize the performance of assays between laboratories for both ovine and bovine samples.

IMPORTANCE

Our study reports the findings of an inter-laboratory ring trial comparing the performance of four different quantitative polymerase chain reaction (qPCR) assay services for detecting Mycobacterium avium subspecies paratuberculosis (MAP) infection in cattle and sheep. MAP is the causative agent of Johne’s disease (also known as paratuberculosis), a significant production-limiting disease in livestock populations with a worldwide distribution. The content of this paper is significant and novel as it is the first to highlight the marked variation between the diagnostic sensitivity and reproducibility of the three principal commercial laboratories offering MAP qPCR diagnostic and screening services in Great Britain. The low sensitivity and high variability between the laboratories are of great concern and relevance to veterinary practitioners and livestock producers.

KEYWORDS: ring trial, paratuberculosis, Johne's, ovine, bovine

INTRODUCTION

Johne’s disease (JD), or paratuberculosis, is an important production-limiting chronic wasting disease caused by Mycobacterium avium subspecies paratuberculosis (MAP), affecting predominantly ruminants with a worldwide distribution. The disease is chronic and insidious, with a long incubation period during which infected animals shed MAP in their feces intermittently and at low levels initially (1, 2), progressing to persistent and heavy shedding as the disease progresses towards the clinical stages for those animals that are not capable of clearing the infection (3, 4). In addition to negatively impacting animal welfare, JD is responsible for economic losses through reduced productivity, and increased replacement costs associated with premature culling and increased mortality (5, 6). To this end, disease control programs have been developed in many countries, including Great Britain (7). These programs require robust and reliable diagnostic and screening tests to establish herd or flock MAP infection status, to identify infected individual animals for culling, and make informed decisions regarding retention of their progeny, identify uninfected herds and flocks that are free from disease as sources of replacement stock, and also to satisfy specific health scheme requirements. Paratuberculosis also raises public health concerns with ongoing contention around MAP’s suspected role in Crohn’s disease in humans. This emphasizes the importance of implementing on-farm JD control programs (8).

Antemortem diagnosis of MAP infection in the early sub-clinical stages of the disease is challenging due to the lack of sufficiently sensitive and cost-effective tests. Additionally, while commercially available MAP quantitative polymerase chain reaction (qPCR) assays are validated for bovine samples in GB, the same laboratories do not offer validated equivalent tests for ovine samples. The consensus is that fecal qPCR and culture are more sensitive than antibody tests, especially in the early stages of the disease as MAP fecal shedding precedes a detectable antibody response in sheep and cattle (911).

Antibody enzyme-linked immunosorbent assays (ELISA) are most appropriate in the more advanced stages of disease associated with increasingly discernible clinical signs (1). However, despite their low sensitivity in the early stages of infection, ELISA assays are frequently employed as herd and flock screening tests because of their practicality and low cost. Indeed, whole herd serological screening using ELISA tests is routinely used for official JD control programs for GB beef herds and, less commonly, sheep flocks.

While fecal culture is still often considered the “gold standard” mainly due to its very high specificity (12), sensitivity estimates vary considerably depending on the MAP strain in question and stage of infection and, thus, the magnitude of fecal shedding in the individual or population being tested (13, 14). In contrast to C-strains which are relatively amenable to laboratory culture techniques, S-strains display slow and fastidious growth and are highly susceptible to various decontamination procedures and certain antibiotics (15, 16). As a result, it’s plausible that the sensitivity of fecal culture is underestimated where S strains predominate (14). How relevant this is to GB is unknown, given the lack of information regarding circulating MAP strains in our sheep and cattle populations. Fecal real-time qPCR assays have been shown to demonstrate superior sensitivity to fecal culture (14, 1719). Plausible explanations for this include the ability of qPCR to detect both C- and S- strains and also live and dead MAP organisms (14).

To the author’s knowledge, there have been no published independent trials comparing the performance of the different MAP qPCR assays commercially available in GB. This study aimed to establish the comparable performance of the pooled fecal MAP qPCR services that are commercially available as screening tests for veterinarians in clinical practice. The inter-laboratory ring trial study design encompasses the combined effect of intrinsic biochemical and extrinsic procedural factors that influence the sensitivity and specificity of the testing service provided to the clinician.

MATERIALS AND METHODS

Two projects were conducted as part of this inter-laboratory ring trial study. Project 1 compared the performance of three pooled fecal MAP qPCR tests (assay services B, C, and D) as provided by the three prominent laboratories (B, C, and D, respectively) offering this assay in GB against each other and also against a fourth qPCR (A) (20) not currently available commercially in GB and performed at the University of Liverpool (laboratory A). Laboratories B and C are government subsidized, while laboratory D is a private commercial enterprise. Project 2 compared the performance of laboratory A against laboratory B using an additional set of samples.

Sample selection

In Project 1, fecal samples originated from a convenience sample of five farms participating in a concurrent MAP prevalence study (in press). Samples were initially selected based on sufficient mass to allow division of the sample to provide adequate mass for each of the laboratories according to their specific requirements. Subsequent random allocation gave a total of 205 individual samples, grouped in 41 pools of five for analysis by each laboratory. A pool size of five was chosen as this is the maximum pool size laboratories B, C, and D offer as part of their commercial services. All five farms were sheep and beef mixed species farms. The sample set comprised 50 bovine and 155 ovine samples. The number of ovine and bovine pools tested from each farm is shown in Table 1.

TABLE 1.

Summary of total number of ovine and bovine pools analyzed from farms V–Z

Farm ID Number of ovine pools tested Number of bovine pools tested Total number of pools tested
V 5 2 7
W 5 0 5
X 8 0 8
Y 7 4 11
Z 6 4 10
Total 31 10 41

Samples were collected between August and October 2021 by the first author. All samples were kept at 4°C and sent to the University of Liverpool within two to three days of being collected, where they were stored at −70°C. Individual samples were identified with a unique ID, and instructions on the individual pool composition were provided to each laboratory to ensure the same pre-defined pools were tested by each laboratory. Samples were shipped directly to laboratories B, C, and D from the University of Liverpool.

For Project 2, samples originated from a second, independent convenience sample of 190 ovine fecal samples collected from 10 flocks. These were pooled according to the same parameters as previously described into 38 pools of five individuals per pool and were analyzed by laboratories A and B only due to limited sample mass. Both projects were performed as blind studies.

Laboratory assays

Laboratory A

Samples were analyzed by the first author at the University of Liverpool research laboratories using the DNA extraction kit and RL-PCR assay developed by Kawaji et al. (20).

Sample preparation

Individual samples were pooled into groups of five in the laboratory by creating individual fecal suspensions before pooling, as described by Mita et al. (21). Briefly, for each sample, 1 g of feces was placed into a 50-mL centrifuge tube containing 20 mL of sterile distilled water, mixed and shaken vigorously with a vortex for 30 sec before allowing the suspension to stand for 30 min, and repeated for all samples contributing to a pool of five. One milliliter of each fecal suspension was collected and pooled into a fresh 15-mL tube, followed by centrifugation at 900 × g for 30 min at room temperature. All but approximately 1.5 mL of supernatant was removed, the fecal pellet was re-suspended in the remaining supernatant, and the entire volume was transferred to a tube containing zirconia beads provided with the Johne-PureSpin kit (FASMAC Ltd.).

DNA extraction

DNA extraction was performed using the Johne-PureSpin kit (FASMAC Ltd.) according to the manufacturer’s instructions. Briefly, 1 mL of supernatant from the prepared fecal suspension was transferred to the bead tube provided and centrifuged at 17,000 × g for 6 min at room temperature. After removal of the supernatant, 400 µL of lysis buffer 1-A was added, and samples were pulverized for 20 min at 30 Hz using a tissue lyser (Qiagen Ltd.). Following centrifugation at 17,000 × g for 6 min, the supernatant was transferred to a 1.5-mL tube containing 200 µL of lysis buffer 1-B and 75 µL of extraction buffer 2. After centrifugation at 17,000 × g for 12 min, 500 µL of supernatant was mixed with 400 µL of binding buffer 3. The mixture was transferred to a spin column and centrifuged at 13,000 × g for 1 min. The column was washed once with 600 µL of washing buffer 4, centrifuged at 13,000 × g for 1 min, and then placed into a new 1.5-mL tube. The wash step was repeated if the sample was still discolored after the initial wash. DNA samples were eluted with 50 µL of elution buffer 5 by centrifugation at 13,000 × g for 1 min and stored at –20°C if qPCR could not be performed the same day.

qPCR assay

This was performed using a Reso-Light (RL) IS900 qPCR assay as described by Kawaji et al. (20). Briefly, the reaction mixture comprised 5 µL of template DNA, 25 µL of 2× GeneAce RL qPCR mix (Nippon Gene), 1 µL of 25 pmol of forward (IS900-3) and reverse (IS900-32) primers, 1 µL of 0.4fg/μL internal control (IC), 0.5 µL of 1 U/µL uracil-DNA glycosylase (UDG; Nippon Gene) and 16.5 µL of nuclease-free water to make up a 50 µL total reaction volume. Reaction mixtures were aliquoted with a QIAgility instrument (Qiagen Ltd.), and qPCR runs were performed on a Rotor-Gene Q machine (Qiagen Ltd.) on an SYBR reading setting using the following reaction program: initial incubation for UDG at 50°C for 2 min, followed by activation step of 10 min at 95°C, then 45 cycles of PCR amplification at 95°C for 30 secs and 68°C for 60 secs. After PCR amplification, dissociation curve data were collected to analyze melting temperature (Tm) peaks, as shown in Fig. 1. Tm peaks for the target (91.5°C ± 1.5°C) indicated the presence of MAP IS900, while the Tm peak for the IC (85.5°C ± 1.5°C) indicated a reaction without inhibition. If neither Tm peak was detected, the result was interpreted as invalid due to PCR inhibition. DNA extracts were run in singular with PCR-positive and negative controls and DNA extraction-negative controls.

Fig 1.

Fig 1

Melt curve analysis for the Reso-Light (RL) IS900 qPCR assay used by laboratory A. Melting temperature peaks for the target IS900 and internal control and expected results for the strong and weak positive and negative controls.

Laboratory B

Samples were analyzed according to laboratory B’s specific protocols. The detailed specification of the protocol is not in the public domain. Based on personal communication with a laboratory representative, the author’s understanding of the protocol is as follows: 2 g of fecal sample is required from each individual animal contributing to the pool of five. Fecal material is pooled, and 2 g of the composite fecal sample undergoes purification steps with water before centrifugation and resuspension in TE buffer in preparation for magnetic bead DNA extraction using a MagMax pathogen RNA/DNA kit (Applied Biosystems). qPCR is performed in duplicate for 45 cycles using the primer pair MP10-1 and MP11-11, designed to target the MAP IS900 element. Interpretation based on Ct values and melt curve value (Tm) is as follows: positive samples have a Ct value <45 combined with a Tm of between 87°C and 90°C inclusive. Negative samples have a Ct value >45 or <45 combined with an incorrect Tm product. Both replicates must be analyzed for all positive samples showing a Ct value of 35 or higher. If the replicate is also positive, it will be reported as such. If the replicate is negative, then it should be reported as negative. If there is still a discrepancy, e.g., one replicate has the correct Tm product and Ct value below 35 and the other replicate is clear negative, then, it is repeated a third time, and that result is reported. Repeats are done on the extracted DNA from the original extraction process. Samples are always reported as either positive or negative.

Laboratory C

Samples were analyzed according to laboratory C’s specific protocols. The detailed specification of the protocol is not in the public domain. Based on personal communication with a representative of the laboratory, the author’s understanding of the protocol is as follows: 5 g of the fecal sample from each animal contributing to the pool of five is added to an EZ PREP bottle containing 30 mL of a proprietary buffer (IDVet) and then vortexed for 30 seconds. Two milliliter from each fecal suspension is then squeezed through a filter, located in the cap of the EZ Prep lid, into a container, and the resulting 10 mL is mixed by vortexing. Of the pooled suspension, 1.5 mL is added to the bead tube for magnetic bead DNA extraction using the ID Gene Mag Fast extraction kit (Innovative Diagnostics). qPCR is performed using the ID Gene Paratuberculosis Duplex kit (Innovative Diagnostics), which includes an internal control. DNA extracts are run in singular for 40 cycles. The interpretation is as follows: if Ct < 33, record the sample as positive, not re-tested. If Ct value > 40, record as negative. If Ct ≥ 33 and < 40, repeat from the beginning of the extraction process. If Ct < 40 on repeat, record the sample as positive. If Ct > 40 on repeat, record the sample as inconclusive. Statistical analyses were performed with inconclusive results coded as both positive (C1) and negative (C2) to assess interrater reliability.

Laboratory D

Samples were analyzed according to laboratory D’s specific protocols. The detailed specification of the protocol is not in the public domain. Based on personal communication with a representative of the laboratory, the author’s understanding of the protocol is as follows: 1 g of feces from each animal contributing to the pool of five is mixed into 30 mL of water, of which 1 mL is used for DNA extraction using a MagMAX CORE Mechanical Lysis kit (Applied Biosystems). qPCR is performed using the VetMAX MAP Screening kit (Applied Biosystems). DNA extracts are run in singular for 45 cycles. The interpretation is as follows: if Ct < 37, record the sample as positive, not re-tested. If Ct value > 45, record as negative. Ct ≥ 37 and < 45, repeat from the beginning of the extraction process if sufficient amounts of original fecal sample remain, otherwise with saved material from part way through the extraction process. If Ct < 37 on repeat, record as positive. If Ct ≥ 37 and < 45 on repeat, record as inconclusive or positive depending on clinical history. If Ct value > 45 on repeat, repeat extraction and record the best of three results.

Statistical analysis

Interrater reliability was tested using Cohen’s and Fleiss’ kappa coefficients in Minitab19 Statistical Software (Minitab, LLC) to assess pairwise and overall agreement between all four laboratories, respectively, at the pool level.

RESULTS

Project 1

The numbers of positive pools for assays A, B, C1, C2, and D were 18, 12, 11, 2, and 1 (43.9%, 29.3%, 26.8%, 4.9%, and 2.4%), respectively. Figure 2 shows a heat map of the 41 raw results for each of the laboratories for farms V–Z for both ovine and bovine samples.

Fig 2.

Fig 2

Heat map of the raw pooled results for laboratories A, B, C1, C2, and D for farms V–Z for both ovine (O) and bovine (B) samples (n = 41). 1, positive; 0, negative; i, inconclusive.

Figure 3 shows the results of each laboratory for ovine and bovine pools.

Fig 3.

Fig 3

Summary of number of positive ovine (black) and bovine (red) pools as detected by laboratories A, B, C1, C2, and D for farms V–Z.

The Fleiss’ kappa coefficient between all four laboratories for both bovine and ovine pools was 0.15 (P = 0.011 and 0.008 for C1 and C2, respectively). Cohen’s kappa scores ranged between −0.05 and 0.66 when considering all 41 samples, as shown in Table 2.

TABLE 2.

Agreement analysis between laboratories A, B, C1, C2, and D for all ovine and bovine samples (n = 41)

Laboratory A B C1 C2 D
A X 0.59a(0)b 0.22 (0.062) 0.12 (0.051) 0.06 (0.126)
B X −0.03 (0.568) 0.07 (0.254) −0.05 (0.743)
C1 X X 0.13 (0.047)
C2 X X 0.66 (0)
D X
a

Cohen’s kappa statistic.

b

P (vs <0) of ĸ.

Fleiss’ kappa coefficient values between all four laboratories for ovine samples were 0.19 (P = 0.005) and 0.12 (P = 0.050) for C1 and C2, respectively. Cohen’s kappa scores ranged between −0.06 and 0.65 when considering all 31 samples, as shown in Table 3.

TABLE 3.

Agreement analysis between laboratories A, B, C1, C2, and D for ovine samples (n = 31)

Laboratory A B C1 C2 D
A X 0.56a (0)b 0.39 (0.003) 0.11 (0.092) 0.05 (0.178)
B X 0.12 (0.248) 0.07 (0.290) −0.06 (0.759)
C1 X X 0.21 (0.030)
C2 X X 0.65 (0)
D X
a

Cohen’s kappa statistic.

b

P (vs <0) of ĸ.

The Fleiss’ kappa coefficient between all four laboratories for bovine samples was −0.10 (P = 0.773) and 0.16 (P = 0.109) for C1 and C2, respectively. Cohen’s kappa scores ranged between −0.36 and 0.62 when considering all 10 samples, as shown in Table 4.

TABLE 4.

Agreement analysis between laboratories A, B C1, C2, and D for bovine samples (n = 10)

Laboratory A B C1 C2 D
A X 0.62a (0.018)b −0.19 (0.805) 0 (*) 0(*)
B X −0.36 (0.902) 0 (*) 0 (*)
C1 X X 0(*)
C2 X X N/A
D X
a

Cohen’s kappa statistic.

b

P (vs <0) of ĸ.

For the diagnosis of MAP infection at the farm level, laboratory A detected four out of the five flocks as MAP positive, laboratory B detected three, laboratory C detected two (for both interpretations of inconclusive results), and laboratory D detected only one flock as MAP positive, as shown in Fig. 4.

Fig 4.

Fig 4

Number of positive ovine pools detected by each of laboratories A–D for farms V–Z. Highlighted cells show positive flock diagnoses.

For diagnosis of MAP infection in the three beef herds in the study, laboratory A detected one herd as MAP positive, laboratories B and C1 detected two (but not the same two), while C2 and D did not detect any herds as MAP positive, as shown in Fig. 5.

Fig 5.

Fig 5

Number of positive bovine pools detected by each of laboratories A–D for farms V–Z. Highlighted cells show positive herd diagnoses.

Project 2

The number of positive pools detected by laboratories A and B were 24 (63.1%) and 17 (44.7%), respectively, as shown in Table 5, with a Cohen’s kappa score of 0.54. These results were similar to the ovine results of Project 1, whereby agreement between laboratories A and B was moderate. Furthermore, laboratory A’s assay was again more sensitive than laboratory B’s. When considering samples from Projects 1 and 2 together, Cohen’s kappa scores for all (n = 79) and ovine-only (n = 69) samples were 0.57 and 0.54, respectively.

TABLE 5.

Summary of number of positive pools as detected by laboratories A and B for farms A–J

Farm Number of pools tested Number of positive pools
Laboratory A Laboratory B
A 7 7 6
B 3 2 1
C 3 3 3
D 5 4 5
E 1 1 1
F 3 1 0
G 4 2 1
H 2 2 0
I 4 1 0
J 6 1 0
Total 38 24 17

For diagnosis of MAP infection at the farm level, laboratory A diagnosed all 10 flocks as MAP positive, while laboratory B detected just six as MAP positive.

DISCUSSION

Agreement between the four laboratories varied greatly and was generally remarkably poor for both ovine and bovine samples. Several kappa values below zero were recorded between laboratories B–D, representing agreement worse than expected by chance alone or disagreement. Low negative scores (0 to −0.10) may generally be interpreted as “no agreement,” but large negative kappa values, such as −0.36, seen between laboratories B and C1 for bovine samples, represents substantial disagreement between these two operators. The highest interrater reliability was consistently seen between laboratories C2 and D, which was attributable to the high proportion of negative pooled results for both of these laboratories in both species. The most likely explanation for this agreement is the poor sensitivity of laboratory D and, to a slightly lesser extent, C2. These two results evoke a false sense of high interrater reliability and should be interpreted cautiously. Unfortunately, no “gold standard” diagnostic test was available to assess the “correctness” of the results in this ring trial. However, the high specificity performance demonstrated previously for the primer pairs employed by the assays performed by laboratories A (20) and B (17) confers a significant degree of confidence that positive results from these two assays are valid. For ovine samples, laboratory A detected the highest number of positive pools, followed by laboratory B. Agreement between these two sets of results was moderate (0.56 and 0.62 kappa values for ovine and bovine pools, respectively, for Project 1 and 0.54 for Project 2) and broadly in line with the 0.6 kappa score level considered to be the minimum acceptable interrater agreement for clinical and diagnostic settings (22). Laboratory C1 detected the highest number of positive pools for bovine samples, followed by laboratory B. However, the agreement between these two sets of results for bovine samples was remarkably poor, with a significant negative kappa value (−0.36). Furthermore, there wasn’t sufficient fecal mass to assess repeatability by each laboratory, but this would have been another interesting test performance characteristic to investigate.

The marked difference in MAP detection rates at the flock and herd levels between the four laboratories highlights the clinical consequences of poor qPCR performance. Results from this ring trial indicate that the chance of diagnosing MAP infection in a flock is significantly affected by which laboratory is selected by submitting veterinary clinicians and the number of pools analyzed. This was also true for diagnosing MAP infection in beef herds, which is potentially even more concerning as laboratories B, C, and D all claim to have done significantly more internal validation and proficiency testing for MAP qPCR on bovine versus ovine feces. However, the relatively small sample size of bovine pools used in this ring trial makes it difficult to assess the significance of this observation.

Some degree of disagreement between the four laboratories was expected, especially for samples containing low amounts of MAP DNA. MAP is known to “clump” in feces (17), and, for stochastic reasons, this non-homogenous distribution could explain a lower detection rate for samples with low MAP levels. All four laboratories use qualitative qPCR assays; thus, quantifying the bacterial load of the samples in this study was not possible to explore this hypothesis. The pooling process and resulting dilution factor would have exasperated low MAP DNA levels. It’s also possible that MAP quantities varied between different sub-samples used by each laboratory from the same original individual animal, despite mixing the original sample before creating sub-samples to be sent to laboratories B−D.

It’s also plausible that other factors may have contributed to the wide range of apparent sensitivities resulting in the poor levels of inter-laboratory agreement. Potential explanations for the wide variation in results between laboratories include differences in sample volume, pooling methodology, DNA extraction protocols, and primer pairs employed, all of which may affect sensitivity and interpretation of results.

Stipulated individual fecal sample volumes for assays A−D were 1, 2, 5, and 1 g, respectively. Interestingly, both the most and least sensitive assays (A and D, respectively) required the smallest volumes of feces to be submitted, which indicates variation in performance is not attributable to this factor alone. However, the low volume of feces required for assay D may have contributed to overall poor performance.

An appropriate DNA extraction kit is essential for maximizing the final eluted DNA concentrations and purity (23). Efficient extraction and DNA isolation are particularly challenging with a complex biological matrix such as feces and an organism such as MAP. Feces contain PCR-inhibiting substances such as polysaccharides, heme, plant components, phytic acid, polyphenols, bile salts, and Ca2+ ions, which can lead to false-negative results by inhibiting DNA amplification in the PCR (24), thus compromising the PCR reaction. Additionally, MAP bacteria possess a thick waxy cell wall with a high lipid content that renders them inherently difficult to be lysed, resulting in low DNA recovery and reduced PCR sensitivity (23, 25). Direct comparisons between the performance of the extraction kits used by laboratories A−D were not possible here, although it’s feasible that this affected the overall diagnostic sensitivity.

The variation in sensitivity performance observed for laboratories A−D may also be due to the target sequence of each qPCR. Laboratories A, B, and C use primers to detect the insertion sequence IS900, which is considered a MAP-specific element with 15–20 copies present in a single MAP genome (26). However, IS900-like sequences do exist in non-MAP mycobacteria, which may contribute to false-positive results if the PCR method employs common probes and primers (27). This can be overcome by designing and employing specific primer sets. The study which first described the primer pair (MP10-1 and MP11-1) used by laboratory B reported no cross-reaction when conducting specificity experiments using 51 non-MAP mycobacterial species, including 10 that contained IS900-like sequences (17). Similarly, the study which first described the primer pair (IS900-3 and IS900-32) used by laboratory A also reported 100% specificity when evaluating against 57 other non-MAP mycobacterial strains (20). Furthermore, laboratories A and B used assessment of melting temperatures as part of their interpretation, providing further reassurance regarding specificity. Laboratory D’s target gene is ISMAP02 which has a lower copy number (six copies) in the MAP genome (28) than IS900, which may explain the lower diagnostic sensitivity.

Differences in pooling methodology may also have affected laboratory performance. To ensure all individuals in a pool are represented in the sample that is used for DNA extraction, it is mandatory that pooled samples are thoroughly mixed, in particular, to mitigate MAP’s clumping tendencies. It is arguably more difficult to effectively mix solid feces (as for the protocols for laboratories B and D) versus liquid fecal suspensions (laboratories A and C).

The MAP qPCR assays offered by two of the four laboratories involved in this ring trial study are accredited by the United Kingdom Accreditation Service (UKAS) and Cattle Health Certification Standards (CHECS) for bovine fecal samples and can be used interchangeably by farmers enrolled in CHECS-licensed herd health schemes. CHECS is the body that sets standards, quality controls, and certifies the official JD control programs in GB. This ring trial focused on assessing the performance of MAP qPCRs as a screening test on pooled samples, while these control programs most commonly employ MAP qPCR assays for individual bovine samples. However, it’s plausible that the poor performance of a particular laboratory on pooled samples also translates to poor performance on individual samples, particularly for low-shedding animals. If this is the case, the results of this ring trial would suggest that the extremely poor sensitivity of assay D raises the question of whether it is fit for purpose as an approved diagnostic test for such health schemes. Indeed, anecdotal reports from cattle farmers and veterinary practitioners raising concerns over the divergent performance of MAP qPCR assays from laboratories C and D are not uncommon. The inability to diagnose MAP-positive individuals leads to unintentionally retaining infected animals and their progeny within the herd, maintaining a reservoir of infection that will propagate disease spread. This thwarts costly efforts made at JD control and decreases consumer confidence. Perversely, laboratory B demonstrated the highest diagnostic sensitivity of the three commercially available assays assessed in this ring trial, and yet, the MAP qPCR assay offered by laboratory B is neither UKAS nor CHECS accredited.

The dilution effects of pooling samples may have reduced the sensitivity of the assays assessed in this ring trial. However, commercial laboratories should offer a qPCR assay capable of detecting low amounts of MAP DNA in pooled samples as a cost-effective screening option for livestock producers. The pool size of five used in this study is the maximum pool size laboratories B–D will run, and yet, other countries routinely offer MAP diagnostic tests in larger pool sizes. The Australian Johne’s Disease Market Assurance Program for Sheep (SheepMAP) uses pool sizes of 50 animals for pooled fecal culture (29), and there a several reports in the literature of pool sizes of 10 having no adverse effect on qPCR sensitivity compared with individual testing (20, 30).

Conclusion

Variation between commercial laboratories offering MAP qPCR assays is a significant concern. The three main MAP qPCR assays currently available commercially in GB appear to have inferior sensitivity to other DNA extraction and qPCR kit combinations described in the literature and commercially available on the broader global market. A lack of diagnostic sensitivity will lead to the underdiagnosis of this important disease and subsequently a loss of consumer confidence in the laboratory tests and the health schemes that endorse them. Further work is required to standardize the performance of assays within and between laboratories for both bovine and ovine samples, with specific validation and proficiency studies undertaken for each species. This would assist farmers and veterinarians in making more informed decisions on screening, diagnosis, and control of Johne’s disease on GB livestock farms.

Supplementary Material

Reviewer comments
reviewer-comments.pdf (1.3MB, pdf)

ACKNOWLEDGMENTS

Funding was provided by Hybu Cig Cymru, Virbac Ltd., and the University of Liverpool.

Contributor Information

L. Worsley, Email: L.Taylor11@liverpool.ac.uk.

Sadjia Bekal, Institut National de Santé Publique du Québec, Sainte-Anne-de-Bellevue, Québec, Canada.

Fazeela Arshad, National Institute for Biotechnology and Genetic Engineering, Faisalabad, Punjab, Pakistan.

Asra'a Abdul-Jalil, University of Anbar, Sulaimaniya, Iraq.

Muthuraj Muthaiah, Government Hospital for Chest Diseases, Puducherry, Puducherry, India.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/spectrum.02210-23.

OPEN PEER REVIEW. reviewer-comments.pdf.

An accounting of the reviewer comments and feedback.

reviewer-comments.pdf (1.3MB, pdf)
DOI: 10.1128/spectrum.02210-23.SuF1

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. Nielsen SS, Toft N. 2008. Ante mortem diagnosis of paratuberculosis: a review of accuracies of ELISA, interferon-gamma assay and faecal culture techniques. Vet Microbiol 129:217–235. doi: 10.1016/j.vetmic.2007.12.011 [DOI] [PubMed] [Google Scholar]
  • 2. Whittington RJ, Reddacliff LA, Marsh I, McAllister S, Saunders V. 2000. Temporal patterns and quantification of excretion of Mycobacterium avium subsp paratuberculosis in sheep with Johne’s disease. Aust Vet J 78:34–37. doi: 10.1111/j.1751-0813.2000.tb10355.x [DOI] [PubMed] [Google Scholar]
  • 3. Begg D. WR. 2010. Paratuberculosis in sheep, p 157–164. In Behr M, Collins D (ed), Paratuberculosis: organism, disease, control, 1st ed. CABI, Oxfordshire, UK. [Google Scholar]
  • 4. de Silva K, Plain K, Purdie A, Begg D, Whittington R. 2018. Defining resilience to mycobacterial disease: characteristics of survivors of ovine paratuberculosis. Vet Immunol Immunopathol 195:56–64. doi: 10.1016/j.vetimm.2017.11.008 [DOI] [PubMed] [Google Scholar]
  • 5. Morris CA, Hickey SM, Henderson HV. 2006. The effect of Johne’s disease on production traits in Romney, Merino and Merino x Romney-cross ewes. N Z Vet J 54:204–209. doi: 10.1080/00480169.2006.36698 [DOI] [PubMed] [Google Scholar]
  • 6. Gautam M, Anderson P, Ridler A, Wilson P, Heuer C. 2018. Economic cost of ovine Johne’s disease in clinically affected New Zealand flocks and benefit-cost of vaccination. Vet Sci 5:16. doi: 10.3390/vetsci5010016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Whittington R, Donat K, Weber MF, Kelton D, Nielsen SS, Eisenberg S, Arrigoni N, Juste R, Sáez JL, Dhand N, et al. 2019. Control of paratuberculosis: who, why and how. A review of 48 countries. BMC Vet Res 15:198. doi: 10.1186/s12917-019-1943-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Waddell LA, Rajić A, Stärk KDC, McEwen SA. 2016. The potential public health impact of Mycobacterium avium ssp. paratuberculosis: global opinion survey of topic specialists. Zoonoses Public Health 63:212–222. doi: 10.1111/zph.12221 [DOI] [PubMed] [Google Scholar]
  • 9. Chaitaweesub P, Abbott KA, Whittington R, Marshall DJ. 1999. Shedding of organisms and sub-clinical effects on production in pre-clinical Merino sheep affected with ovine paratuberculosis Proceedings of the Sixth International Colloquium on Paratuberculosis, p 126–131International Association for Paratuberculosis, Madison, Wis [Google Scholar]
  • 10. Kawaji S, Begg DJ, Plain KM, Whittington RJ. 2011. A longitudinal study to evaluate the diagnostic potential of a direct faecal quantitative PCR test for Johne’s disease in sheep. Vet Microbiol 148:35–44. doi: 10.1016/j.vetmic.2010.07.022 [DOI] [PubMed] [Google Scholar]
  • 11. Yamamoto T, Murai K, Hayama Y, Kobayashi S, Nagata R, Kawaji S, Osaki M, Sakakibara SI, Tsutsui T. 2018. Evaluation of fecal shedding and antibody response in dairy cattle infected with paratuberculosis using national surveillance data in Japan. Prev Vet Med 149:38–46. doi: 10.1016/j.prevetmed.2017.10.009 [DOI] [PubMed] [Google Scholar]
  • 12. Whitlock RH, Wells SJ, Sweeney RW, Van Tiem J. 2000. ELISA and fecal culture for paratuberculosis (Johne’s disease): sensitivity and specificity of each method. Vet Microbiol 77:387–398. doi: 10.1016/s0378-1135(00)00324-2 [DOI] [PubMed] [Google Scholar]
  • 13. Whittington RJ, Sergeant ES. 2001. Progress towards understanding the spread, detection and control of Mycobacterium avium subsp paratuberculosis in animal populations. Aust Vet J 79:267–278. doi: 10.1111/j.1751-0813.2001.tb11980.x [DOI] [PubMed] [Google Scholar]
  • 14. Bauman CA, Jones-Bitton A, Jansen J, Kelton D, Menzies P. 2016. Evaluation of fecal culture and fecal RT-PCR to detect Mycobacterium avium ssp. paratuberculosis fecal shedding in dairy goats and dairy sheep using latent class Bayesian modeling. BMC Vet Res 12:212. doi: 10.1186/s12917-016-0814-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Reddacliff LA, Vadali A, Whittington RJ. 2003. The effect of decontamination protocols on the numbers of sheep strain Mycobacterium avium subsp. paratuberculosis isolated from tissues and faeces. Vet Microbiol 95:271–282. doi: 10.1016/s0378-1135(03)00181-0 [DOI] [PubMed] [Google Scholar]
  • 16. Stevenson K, Alvarez J, Bakker D, Biet F, de Juan L, Denham S, Dimareli Z, Dohmann K, Gerlach GF, Heron I, Kopecna M, May L, Pavlik I, Sharp JM, Thibault VC, Willemsen P, Zadoks RN, Greig A. 2009. Occurrence of Mycobacterium avium subspecies paratuberculosis across host species and European countries with evidence for transmission between wildlife and domestic ruminants. BMC Microbiol 9:212. doi: 10.1186/1471-2180-9-212 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Kawaji S, Taylor DL, Mori Y, Whittington RJ. 2007. Detection of Mycobacterium avium subsp paratuberculosis in ovine faeces by direct quantitative PCR has similar or greater sensitivity compared to radiometric culture. Vet Microbiol 125:36–48. doi: 10.1016/j.vetmic.2007.05.002 [DOI] [PubMed] [Google Scholar]
  • 18. Arsenault J, Singh Sohal J, Leboeuf A, Hélie P, Fecteau G, Robinson Y, L’Homme Y. 2019. Validation of an in-house real-time PCR fecal assay and comparison with two commercial assays for the antemortem detection of Mycobacterium avium subsp. paratuberculosis infection in culled sheep. J Vet Diagn Invest 31:58–68. doi: 10.1177/1040638718810744 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Plain KM, Marsh IB, Waldron AM, Galea F, Whittington A-M, Saunders VF, Begg DJ, de Silva K, Purdie AC, Whittington RJ. 2014. High-throughput direct fecal PCR assay for detection of Mycobacterium avium subsp. paratuberculosis in sheep and cattle. J Clin Microbiol 52:745–757. doi: 10.1128/JCM.03233-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Kawaji S, Nagata R, Minegishi Y, Saruyama Y, Mita A, Kishizuka S, Saito M, Mori Y. 2020. A novel real-time PCR-based screening test with pooled faecal samples for bovine Johne’s disease. J Clin Microbiol 58:e01761-20. doi: 10.1128/JCM.01761-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Mita A, Mori Y, Nakagawa T, Tasaki T, Utiyama K, Mori H. 2016. Comparison of fecal pooling methods and DNA extraction kits for the detection of Mycobacterium avium subspecies paratuberculosis. Microbiologyopen 5:134–142. doi: 10.1002/mbo3.318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. McHugh ML. 2012. Interrater reliability: The Kappa Statistic. Biochem Med (Zagreb) 22:276–282. doi: 10.11613/bm.2012.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Leite FL, Stokes KD, Robbe-Austerman S, Stabel JR. 2013. Comparison of fecal DNA extraction kits for the detection of Mycobacterium avium subsp. paratuberculosis by polymerase chain reaction. J Vet Diagn Invest 25:27–34. doi: 10.1177/1040638712466395 [DOI] [PubMed] [Google Scholar]
  • 24. Christopher-Hennings J, Dammen MA, Weeks SR, Epperson WB, Singh SN, Steinlicht GL, Fang Y, Skaare JL, Larsen JL, Payeur JB, Nelson EA. 2003. Comparison of two DNA extractions and nested PCR, real-time PCR, a new commercial PCR assay, and bacterial culture for detection of Mycobacterium avium subsp. paratuberculosis in bovine feces. J Vet Diagn Invest 15:87–93. doi: 10.1177/104063870301500201 [DOI] [PubMed] [Google Scholar]
  • 25. Logar K, Kopinč R, Bandelj P, Starič J, Lapanje A, Ocepek M. 2012. Evaluation of combined high-efficiency DNA extraction and real-time PCR for detection of Mycobacterium avium subsp. paratuberculosis in subclinically infected dairy cattle: comparison with faecal culture, milk real-time PCR and milk ELISA. BMC Vet Res 8:49. doi: 10.1186/1746-6148-8-49 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Green EP, Tizard ML, Moss MT, Thompson J, Winterbourne DJ, McFadden JJ, Hermon-Taylor J. 1989. Sequence and characteristics of IS900, an insertion element identified in a human Crohn’s disease isolate of Mycobacterium paratuberculosis. Nucleic Acids Res 17:9063–9073. doi: 10.1093/nar/17.22.9063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Cousins DV, Whittington R, Marsh I, Masters A, Evans RJ, Kluver P. 1999. Mycobacteria distenct from Mycobacterium avium subsp. paratuberculosis isolated from the faeces of ruminants possess IS900-like sequences detectable IS900 polymerase chain reaction: implications for diagnosis. Mol Cell Probes 13:431–442. doi: 10.1006/mcpr.1999.0275 [DOI] [PubMed] [Google Scholar]
  • 28. Stabel JR, Bannantine JP. 2005. Development of a nested PCR method targeting a unique multicopy element, ISMap02, for detection of Mycobacterium avium subsp. paratuberculosis in fecal samples. J Clin Microbiol 43:4744–4750. doi: 10.1128/JCM.43.9.4744-4750.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Australia AH. 2022. Market assurance schemes, SheepMAP
  • 30. Ly A, Dhand NK, Sergeant ESG, Marsh I, Plain KM, Buczinski S. 2019. Determining an optimal pool size for testing beef herds for Johne’s disease in Australia. PLoS One 14:e0225524. doi: 10.1371/journal.pone.0225524 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer comments
reviewer-comments.pdf (1.3MB, pdf)
OPEN PEER REVIEW. reviewer-comments.pdf.

An accounting of the reviewer comments and feedback.

reviewer-comments.pdf (1.3MB, pdf)
DOI: 10.1128/spectrum.02210-23.SuF1

Articles from Microbiology Spectrum are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES