Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 Sep 29;16(9):e0257474. doi: 10.1371/journal.pone.0257474

Highly sensitive scent-detection of COVID-19 patients in vivo by trained dogs

Omar Vesga 1,2,*, Maria Agudelo 1,2, Andrés F Valencia-Jaramillo 2,3, Alejandro Mira-Montoya 2,3, Felipe Ossa-Ospina 2,3,4, Esteban Ocampo 3, Karl Čiuoderis 5, Laura Pérez 5, Andrés Cardona 5, Yudy Aguilar 2, Yuli Agudelo 1, Juan P Hernández-Ortiz 5,6, Jorge E Osorio 5,6
Editor: Etsuro Ito7
PMCID: PMC8480816  PMID: 34587181

Abstract

Timely and accurate diagnostics are essential to fight the COVID-19 pandemic, but no test satisfies both conditions. Dogs can scent-identify the unique odors of volatile organic compounds generated during infection by interrogating specimens or, ideally, the body of a patient. After training 6 dogs to detect SARS-CoV-2 by scent in human respiratory secretions (in vitro diagnosis), we retrained 5 of them to search and find the infection by scenting the patient directly (in vivo screening). Then, efficacy trials were designed to compare the diagnostic performance of the dogs against that of the rRT-PCR in 848 human subjects: 269 hospitalized patients (COVID-19 prevalence 30.1%), 259 hospital staff (prevalence 2.7%), and 320 government employees (prevalence 1.25%). The limit of detection in vitro was lower than 10−12 copies ssRNA/mL. During in vivo efficacy experiments, our 5 dogs detected 92 COVID-19 positive patients among the 848 study subjects. The alert (lying down) was immediate, with 95.2% accuracy and high sensitivity (95.9%; 95% C.I. 93.6–97.4), specificity (95.1%; 94.4–95.8), positive predictive value (69.7%; 65.9–73.2), and negative predictive value (99.5%; 99.2–99.7) in relation to rRT-PCR. Seventy-five days after finishing in vivo efficacy experiments, a real-life study (in vivo effectiveness) was executed among the riders of the Metro System of Medellin, deploying the human-canine teams without previous training or announcement. Three dogs were used to examine the scent of 550 volunteers who agreed to participate, both in test with canines and in rRT-PCR testing. Negative predictive value remained at 99.0% (95% C.I. 98.3–99.4), but positive predictive value dropped to 28.2% (95% C.I. 21.1–36.7). Canine scent-detection in vivo is a highly accurate screening test for COVID-19, and it detects more than 99% of infected individuals independent of key variables, such as disease prevalence, time post-exposure, or presence of symptoms. Additional training is required to teach the dogs to ignore odoriferous contamination under real-life conditions.

Introduction

At the time of this writing, almost 33% of the world population has received at least one dose of a COVID-19 vaccine, but less than 1.5% of people in low-income countries belong to this group [1]. Under the most optimistic scenario, universal vaccine coverage is unlikely before 2023 [2]. Until this happens, early and accurate identification of people infected with SARS-CoV-2 is essential to prevent contagion [3]. Ideally, diagnostic tests must detect the pathogen in asymptomatic, pre-symptomatic and symptomatic patients [4]. The reference standard is the real-time reverse transcriptase-polymerase chain reaction (rRT-PCR); it is highly specific (~100%), but lacks sensitivity during the first 5 days post-exposure (0% on day 1, 33% on day 4, 62% on day 5), and its availability is limited [5]. Lateral flow antigen tests are less expensive, instrument-free, provide results faster than rRT-PCR, are quite sensitive during the pre-symptomatic period (84%-98%), and have ~100% specificity as well, but sensitivity is lost 5–7 days after exposure; all these make antigen detection more suitable to complement the insensitive period of the rRT-PCR [6]. Antibody tests are useless to prevent the dissemination of the virus, as they peak after the infectious period [7]. It was clearly demonstrated in several countries that early and massive testing, followed by immediate isolation in designated areas away from home and rigorous contact-tracing, were the only measures that effectively stopped the pandemic even before the first vaccine was available [8]. Quarantines provide time for health authorities to respond, but the benefit is doubtful [9], and the cost is catastrophic [10]. Vaccines offer the solution [1], but the time needed to immunize the world’s population is more than enough for the virus to mutate and adapt [11]. Therefore, finding strategies to balance prevention against economic considerations is still an emergency.

Humans have been using dogs—Canis lupus familiaris—for scent-detection since the beginnings of domestication [12]. The great power of their sense of smell is exceedingly useful, and the first study of their olfactory capabilities was published more than 130 years ago by George J. Romanes [13], the research associate of Charles Darwin. Today, highly trained dogs are invaluable not only for their service [14], but also because their accuracy is superior to analytical instruments [15]. In the field of medical diagnosis, dogs are known to detect specific conditions [16], but most are anecdotal reports instead of formal protocols designed to validate a diagnostic test for clinical use [17]. However, at least one study demonstrated that appropriate training, coupled with strict adherence to the scientific method, lead to consistent diagnosis of Clostridioides difficile infection in humans [18], and, more recently, a comprehensive method was published validating canine diagnosis of two plant pathogens of international concern [19, 20]. Dogs detect and differentiate unique odors that result from the emission of volatile organic compounds (VOC) that constitute the “smell print” of the target [21]. In the case of SARS-CoV-2, several VOC have been found in the breath of COVID-19 patients [22, 23]. Since canines are inherently resistant to SARS-CoV-2 [24], and the virus cannot replicate in them or be transmitted from dogs to other mammals [25], it is not only safe but justifiable to study their efficacy and effectiveness in diagnosing COVID-19 by scent [2629].

Our main objective was to determine the performance of scent-detection dogs as a screening tool in vivo for immediate detection of COVID-19 patients under a variety of circumstances [30]. The study was designed to address five research questions: 1) if working dogs belonging to breeds destined for non-scenting tasks would succeed as medical detectors (a positive result would increase significantly the canine population from which dogs could be selected); 2), the minimal number of COVID-19 patients required to train the dogs in vitro (such number must be enough for the dogs to make the inference that any human being with the same smell-print is a positive); 3) key diagnostic metrics (e.g. sensitivity and specificity) of scent detection in vitro and in vivo under controlled experimental conditions, i.e., screening efficacy; 4) the canine limit of detection for SARS-CoV-2, quantified as number of copies of single stranded viral RNA per milliliter (ssRNA/mL); and 5) the real-life performance of scent detection dogs to detect SARS-CoV-2 in humans outside a hospital or office setting, i.e., screening effectiveness.

After proper training of six dogs, we compared canine diagnostic performance against the reference standard to determine relevant diagnostic metrics, including sensitivity (SEN), specificity (SPC), positive predictive value (PPV), negative predictive value (NPV), accuracy (ACC), and likelihood ratio (LR) of our dogs to detect by scent COVID-19 in vivo, i.e., by direct olfaction of the patient. The outcome was a very fast and cost-effective screening method for infection by SARS-CoV-2 in human patients.

Materials and methods

Media data were uploaded to Figshare and are available at:

https://doi.org/10.6084/m9.figshare.14815848.v1

Ethical statement

The study protocol was approved by the Ethical Committee for Human Research of Hospital Universitario San Vicente Fundación and the Animal Research Ethics Committee of Colina-K9. All human subjects read and signed their informed consent. We did not subject our dogs to any kind of pressure for training. We did not starve the dogs and did not need to make them obsessive for food, toys, games, or anything else. Since it is impossible to force a dog to do scent-work, our methods are exclusively positive, rewarding every correct response, and being indifferent to any mistake. Experiments with Syrian hamsters were carried out in strict accordance with the recommendations of the Guide for the Care and Use of Laboratory Animals of Universidad de Antioquia and the National Institutes of Health of the United States. No surgery was performed and animals were not subjected to any suffering or stress.

Design

Fig 1 shows the training program and experimental design.

Fig 1. Efficacy studies.

Fig 1

Flow chart depicting the order in which training phases and experimental design were conducted. The number of days after phases 1, 2, and 3 indicate the time employed training the dogs before running efficacy experiments; in the case of phase 4, the time without training before starting the effectiveness experiments. COVID-19 prevalence was set up as desired for in vitro experiments, introducing a more difficult scenario by minimizing prevalence during phase 2 (in vitro diagnosis). Prevalence during phases 3 and 4 (in vivo screening) was spontaneous, given by the pandemic epidemiology of the different human groups participating in the study.

Specimen collection for in vitro work

This study was designed to determine the diagnostic performance of canines to detect, by olfaction, patients infected by SARS-CoV-2 in vitro and in vivo. The first step was to request written informed consent to aliquot, ultra-freeze (-70°C), and thaw (for experimental use as needed) respiratory secretions from 12 COVID-19 patients admitted to three hospitals located in the metropolitan area of the Aburrá Valley, namely, Clínica CES, Hospital Manuel Uribe Angel, and Hospital Universitario San Vicente Fundación (Table 1).

Table 1. Human subjects who provided specimens for in vitro training and experimentation (phases 1 and 2).
Patient # Sex, Race Age (y) Specimen Days Sick SARS-CoV-2 rRT-PCR Viral Load (log10 copies ssRNA/mL)
1 F, Hispanic 74 NPS & saliva 12 Positive 5.42
2 M, Hispanic 55 NPS & saliva 10 Positive ND
3 F, Hispanic 57 NPS & saliva 5 Positive 6.90
4 M, Hispanic 80 TA 16 Positive 5.15
5 M, White 29 NPA 10 Positive ND
6 M, White 83 TA 10 Positive 5.09
7 M, Hispanic 34 NPS 3 Positive 10.2
8 F, Hispanic 27 Sputum 7 Positive ND
9 F, Hispanic 26 Sputum & saliva 5 Positive ND
10 F, Hispanic 61 Sputum & saliva 9 Positive 5.07
11 M, Hispanic 77 Saliva 8 Positive ND
12 M, Hispanic 59 TA 13 Positive ND
13–112 59F, 41M R: 18–84 Saliva 0 Negative NA

NPS: nasopharyngeal swab; TA: tracheal aspirate; NPA: nasopharyngeal aspirate; F: female; M: male; R: range; ND: not determined; NA: not applicable. The institutions in which the first 12 donors were hospitalized are not listed to prevent any risk of potential identification; the last 100 donors were ambulatory citizens.

Dog training

Using operant conditioning based on clicker-training and rewarding with food [31], we trained six canines to detect the odor print of SARS-CoV-2 in saliva and in the human body (Fig 2): four Belgian Shepherd Malinois (a herding breed), one first-generation cross Alaskan Malamute by Siberian Husky (a Nordic sled-dog), and one pit bull (a fighting breed).

Fig 2. Pictures and identification of the six dogs trained for the scent-detection of SARS-CoV-2.

Fig 2

(1) Andromeda, intact female, 6-mo, Belgian Malinois (BM). (2) Nina, intact female, 25-mo, BM, (3) Niño, castrated male, unknown age, American Pit Bull Terrier. (4) Timo, intact male, 31-mo, BM. (5) Vika, intact female, 36-mo, BM. (6) Vita, intact female, 36-mo, first generation Alaskan Malamute x Siberian Husky.

Dog training was planned in three phases, each followed by its corresponding experimental work. Phase 1 (in vitro recognition) lasted 28 days during which we trained the dogs to recognize in vitro the scent-print of SARS-CoV-2 under a wide variety of environmental modifications that included, but were not limited to, time of the day, weather, training field, altitude above the level of the sea, distraction and hiding devices, age and temperature of the target samples, noise level and origin, distracting smells, training time, rewards, dog collars and leashes, etc. The only aspects of training that remained constant during phase 1 were the positive specimens from the first three donors in Table 1 (one from each hospital), the use of 0.9% sterile saline solution as negative controls, the constant schedule of reinforcement, and the dog-trainer duos. We adhered to the errorless discrimination learning protocol developed by Terrace [32], which consists in presenting the animal a marked contrast between positive and negative stimuli during the foundations of training [33]. The “stimulus” is the problem presented to the dog, which was, for in vitro diagnosis, sterile saline solution (phase 1) or human saliva (phase 2), and for in vivo screening, the body of a person (phase 3). A stimulus can be positive if it leads to a reward for the dog (SARS-CoV-2), or negative, if it does not (controls). To recognize the scent-print of SARS-CoV-2, we trained the dogs to find their food (the reward) using their olfaction, always hiding with it a respiratory specimen from Patient 1 (the positive stimuli). The amount of food was diminished progressively until only the SARS-CoV-2 specimen was left in the hiding place, while the number of hides with saline increased in number. The director of training (AFVJ) pressed a clicker device to mark the correct behavior (i.e., lying down to alert on the identification of the SARS-CoV-2 specimens), and the trainer immediately rewarded the dog, who had been conditioned beforehand to regard the click as the constant precedent (secondary reinforcer) to reward (primary reinforcer). Training sessions varied from 1 to 60 minutes, always followed by a resting period at least twice longer than the working time. It took one day for all dogs to understand that finding the SARS-CoV-2 specimen meant a prize for them, and that saline conveyed no reward. The following 27 days of phase 1, dogs were trained with respiratory secretions from Patients 1, 2 and 3 under the above-mentioned variations. The other 9 positive specimens (Patients 4–12) were reserved exclusively for experimentation, which only took place after the dogs had acquired the error-free skills necessary to identify SARS-CoV-2 with Patients 1–3 (defined as zero errors in a 10-sample field during 10 repetitions varying the target prevalence).

Phase 2 (in vitro diagnosis) went on for 21 days, keeping constant the positive stimuli (specimens from Patients 1, 2 and 3), but changing the negative stimuli for human saliva specimens donated by 100 human volunteers (Patients 13–112, Table 1). We exposed the dogs to a maximum of 10 specimens per training session (<10% positive stimulus plus >90% negative stimulus), reserving the 100-sample field for experimentation only. The donors of control samples were 100 healthy citizens belonging to the general population of the Aburrá Valley and its surrounding mountains; their saliva specimens were negative for SARS-CoV-2 by rRT-PCR the same day that we aliquoted and froze them at -70°C. Sample collection took place in March 2020, when the pandemic was just starting in Colombia and it was quite simple to find non-infected people. To prevent replication of the microbiota within each saliva sample, working specimens were thawed as needed, kept at 4°C between uses, and heat-sterilized before appropriate disposal. An illustration of the experimental field and the scent-detection work in vitro can be seen in S1 Video in S1 File (https://doi.org/10.6084/m9.figshare.14815848.v1).

Phase 3 of training (in vivo screening) took 56 days during which the dogs learned to identify COVID-19 patients by scenting the human body. Each canine was trained with 400 subjects who did not participate in the previous (or future) experiments, 100 hospitalized patients (COVID-19 prevalence: 40%) and 300 HCW (COVID-19 prevalence: 7%). Most of phase 3 training time was dedicated to improving specificity and positive predictive value, because sensitivity and negative predictive value never represented an obstacle.

Sample size

Different sample sizes were used to train dogs in vitro and in vivo, because a much larger sample size is needed to validate in vitro scent detection [34]. Trial design is affected by prevalence and clinical severity of the disease to be diagnosed, because both variables have significant influence on SEN, SPC, PPV, NPV, LR, and ACC of the test under study [35].

In vitro, we set disease prevalence at random within specific ranges for each experiment, making it more difficult for the dogs as training progressed, 5% to 10% for phase 1, and 1% to 5% for phase 2. For a 5% prevalence rate and based on a target significance level of 0.05, at least 2140 samples were required to achieve a power greater than 80% in order to detect a change in sensitivity from 0.80 (null hypothesis, H0) to 0.90 (alternative hypothesis, Ha). To detect the same change in specificity, only 113 samples were required.

In vivo (phase 3), we recruited three populations based on their epidemiological risk for COVID-19: a high-risk group, consisting of patients admitted to Hospital Universitario San Vicente Fundación in Medellin, Colombia; an intermediate-risk group, consisting of health-care workers (HCW) at the same institution, and a low-risk group, consisting of officials working with the Governor of the Department of Antioquia. We anticipated a prevalence of 30%, 10% and 5% for each of these groups, requiring the participation of at least 63, 190, and 380 subjects to achieve the same targets as our in vitro testing, but with a change in sensitivity from 0.6 (H0) to 0.9 (Ha); to detect the same change in specificity, only 27, 21 and 20 participants were required from each group, respectively. The expected average prevalence of COVID-19 for the whole sample was 10%, requiring at least 310 participants to achieve the same targets with a change in sensitivity from 0.7 (H0) to 0.9 (Ha) and 34 participants to detect the same change in specificity [35]. To preserve power in case of lower prevalence, we aimed to include a larger number of subjects in all of the above-described scenarios. COVID-19 severity was expected to be proportional to risk, because prognosis for hospitalized patients and young people working in a government building should be at the extremes, while for HCW it should be in the middle. The sample sizes required (and attained) for sensitivity during experimental phases 1, 2 and 3 were 2140 (3200), 2140 (6000) and 310 (848), respectively; specificity requirements were much lower (S1 Table in S1 File).

Experimentation after scent-detection training

Blinding

The first 60% of the experiments in phase 1 were unblinded (i.e., the handlers knew the position and number of positive stimuli in the field) to observe the behavioral cues offered by each dog during alerts on the target odor; the last 40% were blinded, as well as all experiments in phases 2 and 3, and the final scent effectiveness experiment. Except for phase 4, where diagnosis was unknown to everyone involved (scientists and participants), the director of training was always unblinded, and activated the clicker to inform the dog-trainer duos about every correct alert. In phase 4 he had to interpret the behavior of each dog to decide if a reward was in order.

In vitro experiments

For every experiment in vitro, the position of the samples in the field (1 to 100) and disease prevalence (1% to 10%) were randomized with a mobile phone app, and the dogs went through an open field arrangement of 10 x 10 samples (100) distanced 2 m in all directions. An illustration of the experimental field and the scent-detection work in vitro can be seen in S1 Video in S1 File (https://doi.org/10.6084/m9.figshare.14815848.v1).

Three kinds of 2-mL specimens were prepared under a biosafety class III laminar flow cabinet using 209 sterile, scent-free flasks: the control specimens consisted of 100 flasks with 0.9% sterile saline solution for phase 1, or saliva from 100 rRT-PCR-negative individuals for phase 2, while the experimental specimens consisted of 9 flasks with respiratory secretions from COVID-19 Patients 4–12 for phases 1 and 2. The positive specimens were diluted (1:1 volume) in 0.9% sterile saline solution to preserve the virus [36]. During in vitro experiments, each dog had to interrogate by scent a field with 100 flasks, the vast majority (90%-99%) containing negative stimuli; the rest would have the positive stimuli. After finishing a 100-flask field, the dog was offered abundant water and placed to rest in its individual kennel. Before the next search, each dog was scheduled to have an unrestricted play session and to take a long walk with its trainer. Once ready, the field was rearranged for a new experiment, changing at random the position of the specimens and the prevalence of COVID-19. Phase 2 differed from phase 1 only in that the negative stimulus was saliva from 100 healthy human volunteers and all experiments were blinded, i.e., the dog handlers did not know the position and number of positive specimens.

In vivo experiments: Efficacy study

For phase 3 in vivo screening, dogs could scent any part of the anatomy and were allowed to touch with their noses the body of the subjects. Because dogs usually sniffed the hands first, we instructed each participant to present the hands opened with palms facing the dog, as illustrated in S2 Video in S1 File (https://doi.org/10.6084/m9.figshare.14815848.v1). Phase 3 experiments took place at HUSVF (inpatients and HCW) and at the Governor’s Building (government officials). Hospitalized patients were visited individually by the research team in their rooms or in the intensive care units. Government officials and HCW were screened in groups of up to 20 individuals in an open space in their respective institutions. After ending phase 3, we planned an additional experimental step without telling the training team about it, to determine real-life performance of the canine-human duos (phase 4).

Phase 4, in vivo effectiveness

These experiments were executed 75 days after the last experiment of phase 3 and entailed screening the general population riding the Metro System of Medellin (n = 550; 3 dogs of 3 breeds). Since we wanted to evaluate performance under real-life conditions, the human-canine teams were deployed to the field without previous announcement or environmental training, and participants were recruited on site without further delay. Since the research team had only three trainers, the effectiveness assay was limited to three dogs. As in phase 3, dogs were allowed to scent any part of the body, and each human volunteer provided a saliva sample for rRT-PCR once screened by the three dogs. As unique aspects of phase 4, dog performance was analyzed over time as they adapted to the Metro environment, and the director of training was informed of the experiments the day before, leaving rewards at his absolute discretion. In opposition to the method we had employed to train our dogs, it implied that some dog alerts might not be rewarded if interpreted as false positives by the training director.

Limit of canine scent-detection

Freshly collected saliva specimens from four COVID-19 patients (unknown to the dogs) with viral loads ranging from 47 to 475 copies ssRNA/mL were serially diluted in sterile physiologic saline solution in 10-fold dilutions down to 1x10-12 copies ssRNA/mL (i.e., 15 dilutions per specimen). Then, we randomized the dilutions from each patient by placing two COVID-19 dilutions along with 8 saline controls (10 flasks per row), and commanded every dog to search them until they finished the scent-interrogation of all 60 dilutions. The limit of detection (LOD) was the mean of the most diluted specimens that each dog was able to identify without failing a single one of the more concentrated dilutions. Since we did not determine the nature and relative concentrations of the VOC of COVID-19 patients, the only method available to quantify the LOD was the RNA concentration of SARS-CoV-2 per mL of saliva, which is very precise. It does not mean that RNA has odor, but provides a quantitative approach to the actual acuity of canine olfactory system to detect the scent-print of COVID-19.

Dog-human teams biosafety: Evaluation of the SARS-CoV-2 containment devices

Besides strict adherence to the biosafety and patient isolation rules of HUSVF, we contrived two devices (D1 and D2) made with the fabric of a DupontTM Tychem 2000 Coverall to prevent transmission of SAS-CoV-2 from speciments used in the study to pariticipating canines and their human trainers. The D1 device was used for scent-detection in saliva or respiratory specimens; it was a 130-mL glass flask with a metallic lid in which we perforated a 1 cm hole in the middle. The lid allowed a hermetic closure that remained intact after placing a 10x10 cm piece of Tychem 2000 between the bottle and its lid. The D2 device was used for the same purpose but offered greater versatility than D1; it was a waterproof bag made of two 18x8 cm pieces of Tychem 2000, heat-sealed on all four sides (it contained inside a sterile gauze impregnated with the SARS-CoV-2 specimen).

In order to ascertain if any of the dog-trainer teams got infected by, or could have been at risk of exposure to SARS-CoV-2 during the project, we implemented two strategies. The first one was to run rRT-PCR tests on saliva specimens of dogs and trainers after ending phases 1 and 2. The second approach was to determine experimentally the efficiency of our containment devices in the Syrian hamster (Mesocricetus auratus) COVID-19 model (S1 Fig in S1 File). After an acclimatization period of 4 weeks, we exposed 15 animals (6 females and 9 males, 8 weeks old, outbred, immunocompetent) to SARS-CoV-2 over 4 days in groups of 3 hamsters of the same sex (2 experimental and 3 control groups), each contained in a HEPA filtered One System cage. Each of the two experimental groups had, inside their respective cage, one of the containment devices (D1 in group 1, D2 in group 2) protected by a metallic welded wire mesh enclosure that allowed hamsters to smell the device without touching it. Each of the three control groups had free access to an unprotected D1 flask (group A), a sterile gauze impregnated with a fresh specimen from a different COVID-19 patient (group B), or an unprotected D2 bag (group C). D1, D2 and the virus-impregnated gauze were replaced with fresh SARS-CoV-2 specimens every 12 hours in the 5 groups. All hamsters were sampled for rRT-PCR by saliva swabs before and after SARS-CoV-2 exposure.

rRT-PCR assay and RNA quantification, RNA transcript standard generation, assay efficiency, and analytical sensitivity

The SARS-CoV-2 molecular diagnosis was conducted at the Genomic One Health Laboratory, Universidad Nacional de Colombia. Viral RNA was extracted from canine nasal and oropharyngeal swabs and human nasopharyngeal aspirates using the ZR viral extraction kit (Zymo Research) from a 140-μL volume of the specimens. Instructions provided by the manufacturer were followed and the sample was eluted into 20 μL. The CDC 2019-Novel Coronavirus Real-Time RT-PCR Diagnostic Panel (Integrated DNA Technologies) [37] and Berlin-Charité E gene protocol for SARS-CoV-2 [38] were used to detect virus nucleocapsid (N1 and N2) and Envelope genes, respectively. All rRT-PCR testing was done using Superscript III One-Step RT-PCR System with Platinum Taq Polymerase (Thermo Fisher Scientific). Each 25-μL reaction contained 12.5 μL of the reaction mix, 1 μL of enzyme mix, 0.5 μL of 5 μmol/L probe, 0.5 μL each of 20 μmol/L forward and reverse primers, 3.5 μL of nuclease-free water, and 5 μL of RNA. The amplification was done on an Applied Biosystems 7500 Fast Real-Time PCR Instrument (Thermo Fisher Scientific). Thermocycling conditions consisted of 15 min at 50°C for reverse transcription, 2 min at 94°C for activation of the Taq polymerase, and 40 cycles of 3 s at 94°C and 30 s at 55°C (N gene) or 58°C (R gene), and 3 min at 68°C for the final extension. SARS-CoV-2 assays were run simultaneously along with internal control genes for canine (glyceraldehyde-3-phosphate dehydrogenase-GAPDH) and human specimens (Ribonuclease P-RP) [39] to monitor nucleic acid extraction, sample quality, and presence of PCR reaction inhibitors [40]. To monitor assay performance, positive template controls and no-template controls were also incorporated in all runs. Biosafety precautions were followed during the workflow to minimize PCR contamination. For rRT-PCR qualitative detection, a threshold was set in the middle of the exponential amplification phase of the amplification results; a specimen was determined as positive for SARS-CoV-2 when all controls exhibited expected performance and assay amplification fluorescent curves crossed the threshold within 40 cycles (CT <40). For rRT-PCR quantitative detection of SARS-CoV-2 in human specimens, an analysis of copy number and linear regression of the RNA standard was used.

Preparation of in vitro RNA Transcript as standard

An in vitro RNA transcript of the SARS-CoV-2 envelope gene was generated as a standard for rRT-PCR quantitative detection in human specimens. Viral RNA from a positive clinical sample was used as an initial template for in vitro RNA transcription. cDNA was synthetized using SuperScript™ III First-Strand Synthesis System and random hexamers primer (Thermo Fisher, USA). Double-stranded DNA containing the 5′-T7 RNA polymerase promoter sequence for the SARS-CoV-2 complete E gene sequence, was obtained using DreamTaq Hot Start PCR Master Mix (Thermo fisher, USA) and E-Std-T7-Fwd (TAA TAC GAC TCA CTA TAG GGG CGT GCC TTT GTA AGC ACA A), and the E-Std-Rev (GGC AGG TCC TTG ATG TCA CA) primers [41]. The DNA was finally transcribed using the MEGAscript T7 Transcription Kit (Thermo Fisher Scientific). The RNA transcripts were purified with Ampure XP beads (Belckman Counter, USA) and quantified with a Qubit fluorometer using a Qubit RNA HS Assay Kit (Thermo Fisher Scientific). All commercial reagents were used according to manufacturer instructions.

Assay efficiency and analytical sensitivity

The in vitro RNA transcript standard was used to assess LOD and assay efficiency using a standard curve. Serial 10-fold dilutions of quantified in vitro RNA transcript were prepared in triplicate per dilution. The LOD for each assay was defined as the highest dilution of the transcript at which all replicates were positive. The efficiency (E) was estimated by linear regression of the standard curve using the equation (E) = [10 (1/slope)]– 1 [42]. The LOD and E of the SARS-CoV-2 assay were determined to warrant consistency with what has been previously demonstrated [43]. The intra- and inter-assay variability were also calculated using the in vitro RNA standard. To assess intra-assay variation, the RNA standard was used at 2 and 6 log10 copies/reaction by triplicate in a single assay. To assess inter-assay variation, the RNA standard was tested at 2 and 6 log10 copies/reaction by triplicate in two separate PCR assays. Mean, standard deviation, the coefficient of variation of the CT and copy numbers were also determined.

Statistical analysis

Data input into 2x2 contingency tables generated the metrics SEN, SPC, PPV, NPV (mean and 95% confidence interval), ACC, and LR. Since pooling results from experiments with 100 specimens violates the independence assumption of the Fisher’s exact test, we performed latent class analysis for in vitro data. Since no assumptions were violated in vivo, we applied the two-tailed Fisher’s Exact Test to challenge the null hypothesis that the dogs detected COVID-19 by chance.

Results

Phase 1: In vitro recognition of SARS-CoV-2

The mean prevalence of SARS-CoV-2 for these experiments was 7.56% (range, 5.0%-8.6%), and the magnitude of all diagnostic metrics was very high (S2 Table in S1 File). The number of experiments varied for each dog because the required sample size (3200) was reached early and all of them recognized COVID-19 specimens with an accuracy >95.0%. To determine if diagnostic performance would improve by increasing prevalence to 20% (expected at the time of deployment), we set up an experiment with 40 flasks in a 10 x 4 field, allocating randomly 8 positive samples within 32 saline distractors. All six dogs identified correctly every sample without a single mistake, as expected from errorless training theory. With these results, dogs were ready for phase 2 training, designed for lower prevalence of positive samples (1%-5%) and greater difficulty in discriminating the positive from the negative stimulus (saliva from 100 non-COVID subjects instead of saline).

Phase 2: In vitro diagnosis of SARS-CoV-2

Mean prevalence was 2.2%. Compared with phase 1, there was a significant improvement in the magnitude of all metrics for every dog (S3 Table in S1 File). As a group, the 6 dogs achieved SEN 95.5% (95% C.I. 90.4–97.9), SPC 99.6% (99.5–99.8), PPV 85.7% (79.2–90.5), NPV 99.9% (99.8–100), ACC 99.6%, and LR 267. The PPV improved 12 percentile points, while the NPV was close to perfection, thereby suggesting a very low probability that any of our dogs would miss a positive case in vitro (S2 Fig in S1 File).

Phase 3: In vivo diagnosis of SARS-CoV-2 by direct body-scenting (efficacy trial)

One of the dogs (Vika) was excluded due to advanced pregnancy. Five dogs interrogated by scent 848 human subjects from three risk-groups: 269 hospitalized patients (high risk), 259 HCW (intermediate risk), and 320 government employees (low risk group). Demographics of the human participants are described in Table 2.

Table 2. Phase 3: In vivo screening, efficacy trial.

Demographic and clinical characteristics of 848 participants in the scent-detection experiments.

Variable n (%)
Sex All Participants 848 (100)
Females 514 (60.6)
Males 334 (39.4)
Age (years-old) Median 56
Mean 53
Youngest 15
Oldest 92
COVID-19 Prevalence All Participants 92 of 848 (10.85)
Government Employees 4 of 320 (1.25)
Health-Care Workers, HUSVF 7 of 259 (2.70)
Hospitalized Patients, HUSVF 81 of 269 (30.1)
COVID-19 Status SARS-CoV-2 positive 92 (10.85)
SARS-CoV-2 negative 753 (88.8)
SARS-CoV-2 indeterminate 3 (0.35)
Result by Reference Standard rRT-PCR positive 41 (4.83)
rRT-PCR negative 753 (88.8)
rRT-PCR indeterminate 3 (0.35)
Antigen positive 51 (6.01)
Antigen negative 0 (0)
Clinical Status at K9 Test COVID-19, asymptomatic 18 (2.12)
COVID-19, pre-symptomatic 0 (0)
COVID-19, symptomatic 74 (8.73)
Not COVID-19, but sick 188 (22.2)
Not COVID-19, healthy 565 (66.6)
Indeterminate, asymptomatic 2 (0.24)
Indeterminate, symptomatic 1 (0.12)

HUSVF: Hospital Universitario San Vicente Fundación.

Before the canine scent-screening, we sampled the 848 participants to determine their COVID-19 status by molecular and antigen testing (Fig 3). SARS-CoV-2 infection was confirmed in 92 patients (10.8%) and 753 (88.8%) tested negative. The other 3 (0.4%) were asymptomatic subjects with “indeterminate” rRT-PCR results after repeated testing; they were excluded from the analysis (Fig 4). The average cycle threshold (CT) of the rRT-PCR positive patients was 32.0 (range, 20.6–38.8). COVID-19 was diagnosed by antigen test (Standard Q COVID-19 Ag Test, SD Biosensor) in 51 patients that had been admitted to the ER with acute respiratory distress, fever, sinus pain, cough, anosmia, or dysgeusia. Of 753 COVID-19 negative patients, 188 were hospitalized for other diseases that included respiratory conditions (23%, half had bacterial infections), malignancy (19%), autoimmunity (8%), coronary or peripheral atherosclerosis (7%), diabetes mellitus (6%), or chronic osteomyelitis (6%), and the rest had traumatic injuries, peritonitis, HIV, or cholangitis, among other pathologies. Of note, the dogs did not alert on any of the patients with respiratory diseases other than COVID-19. COVID-19 prevalence was 10.85% for the study population (92 of 848 subjects), distributed this way based on pre-test risk: 30.1% (81 of 269), 2.70% (7 of 259), and 1.25% (4 of 320) for the high, intermediate, and low-risk groups, respectively. As a group, the five dogs achieved SEN 95.9% (95% CI 93.6–97.4), SPC 95.1% (94.4–95.8), PPV 69.7% (65.9–73.2), NPV 99.5% (99.2–99.7), ACC 95.2%, and LR 19.6 (S4 Table in S1 File). Individual performance mirrored closely the group metrics (Fig 5). Four of 320 participants in the low-risk group had positive rRT-PCR tests, but these individuals declined canine scent-detection, producing zero values in two cells of the 2x2 contingency tables and precluding the computation of diagnostic metrics (Fig 4).

Fig 3. Phase 3: In vivo screening (efficacy trial).

Fig 3

Diagram illustrating the flow of human participants in the third phase of the study.

Fig 4. Phase 3: In vivo screening (efficacy trial).

Fig 4

Data analysis by risk group of all participants in experiments designed to determine performance metrics of the dogs during in vivo screening. Green, yellow, orange and purple cells contain true positives, false positives, false negatives, and true negatives, respectively. Cells not enhanced contain the number of participants with “indeterminate” rRT-PCR (3), subjects who declined K9 olfaction (4), and those rare occasions where the dogs refused to scent an individual, which happened 7 times with Andromeda and Nina and 2 times with Niño. Sensitivity could not be computed in the low risk group (NAN: not a number) because all 4 COVID-19 patients declined K9 scent-detection, resulting in 0 in two cells of the 2x2 contingency table and not significant P values in the two-tailed Fisher’s Exact Test (enhanced in salmon color).

Fig 5. Phase 3: In vivo screening (efficacy trial).

Fig 5

Performance metrics of 5 dogs screening for COVID-19 the patients and staff of Hospital Universitario San Vicente Fundación and the personnel working in the Office of the Governor of Antioquia; n = 848, global prevalence = 10.5%. Each symbol has a different color to ease visualization of the dogs. The vertical lines above and below the symbols represent the 95% confidence interval for each metric, which is contained within the symbol for SPC, NPV and ACC. Additional numeric data in S4 Table in S1 File.

Phase 4: In vivo screening of citizens riding the Metro System of Medellin (effectiveness assay)

The mass transit service of Medellin transports 1.5 million passengers every day. Without prior notification to the San Antonio station users or to trainers, three canines screened, over two days, 550 individuals who also volunteered to provide saliva specimens for rRT-PCR testing. S2 Video in S1 File illustrates the level of difficulty of these crowded conditions for scent-detection work (https://doi.org/10.6084/m9.figshare.14815848.v1). Despite the environmental impact on the dog’s concentration, they detected 17 COVID-19 cases with high SPC and NPV, 15 of them asymptomatic or pre-symptomatic. During the first 200 subjects, SEN and PPV dropped significantly in comparison with the efficacy trial (S5 Table in S1 File and Fig 6), but the dogs adjusted within 3 hours to the new environment and improved their performance until reaching a plateau (Fig 7). Table 3 shows the value for every diagnostic metric in each phase of the study.

Fig 6. Phase 4: In vivo screening (effectiveness assay).

Fig 6

Performance metrics of 3 dogs screening for COVID-19 the citizens riding the Metro System of Medellin; n 550, prevalence 3.1%. Each symbol has a different color to ease visualization of the dogs. The vertical lines above and below the symbols represent the 95% confidence interval for each metric, which is contained within the symbol for NPV and ACC. Additional numeric data in S5 Table in S1 File.

Fig 7. Phase 4: In vivo screening (effectiveness assay).

Fig 7

Canine adjustment to a real-life situation. Accuracy started much lower under real-life conditions, but improved with time as the dogs adjusted to the new environment. Numbers labeling the abscissa represent the order in which subjects were screened by the dogs, divided in groups of 110 individuals. Screening each group took approximately one hour of work for the dogs.

Table 3. Summary of the results attained with six dogs trained to detect COVID-19 by scenting saliva and the body of human participants.

Diagnostic Metric Phase 1: in vitro Recognition Phase 2: in vitro Diagnosis Phase 3: in vivo Screening (Efficacy Trial) Effectiveness Assay (Metro System)
Value 95% C.I. Value 95% C.I. Value 95% C.I. Value 95% C.I.
SEN (%) 88.8 84.3 92.2 95.5 90.4 97.9 95.9 93.6 97.4 68.6 55.0 79.7
SPC (%) 97.4 96.8 97.9 99.6 99.5 99.8 95.1 94.4 95.8 94.4 93.2 95.5
PPV (%) 73.9 68.6 78.6 85.7 79.2 90.5 69.7 65.9 73.2 28.2 21.1 36.7
NPV (%) 99.1 98.7 99.4 99.9 99.8 100 99.5 99.2 99.7 99.0 98.3 99.4
ACC (%) 96.8   99.6   95.2   93.6  
LR 34.6   266.7   19.6   12.3  
P <0.0001 <0.0001 <0.0001 <0.0001
Method Latent Class Analysis Latent Class Analysis Fisher’s Exact Test Fisher’s Exact Test

Limit of canine scent-detection

The LOD was determined in vitro using freshly collected saliva specimens from four COVID-19 patients new to the dogs. The moment of this assay coincided with the estrus cycle of several females, which caused the exclusion of the males from this experiment because both refused to work. The LOD for Andromeda, Nina, Vika, and Vita was lower than 2.61 x 10−12 copies ssRNA/mL (S6 Table in S1 File), the equivalent of detecting a drop (0.05 mL) of any odorous substance dissolved in a volume of water greater than the capacity of 10.5 Olympic swimming pools (2.6x1010 mL).

Biosafety of the canine and human team handling the virus

None of the dogs, their trainers, or the physician-scientists in charge of sampling and taking care of the patients contracted COVID-19 during this study. The rRT-PCR tests for SARS-CoV-2 from canines and humans resulted negative twice, once after ending phase 2, and again after finishing phase 3 (S7 Table in S1 File). Experimental testing of the devices to contain SARS-CoV-2 showed that both worked as intended, allowing the scent to evaporate while holding the virus secured inside (Fig 8). The six hamsters in the experimental groups climbed and smelled the mesh-protected devices D1 (group 1) and D2 (group 2), but none acquired SARS-CoV-2. Hamsters in the control groups did climb on D1 but could not damage the Tychem 2000 fabric covering the flask, and none got infected (group A); did bite the Tychem of D2, and 1 animal was infected (group C); and played, licked, bit, nested, and slept in the gauze impregnated with SARS-CoV-2, and all three contracted SARS-CoV-2 (group B) (S8 Table in S1 File).

Fig 8. Biosafety data.

Fig 8

Experimental evaluation of the devices used to contain SARS-CoV-2 specimens. After testing negative for SARS-CoV-2 in saliva, 5 groups of 3 golden Syrian hamsters each (Mesocrisetus auratus) were exposed during 4 days to SARS-CoV-2 directly (Group B, virus control) or enclosed in devices 1 and 2. Animals in Test groups 1 and 2 (blue circles) were allowed to sniff their devices but could not touch them, while those allocated to control groups A, B, and C (red triangles) had direct access to the containment fabric. The ordinate represents the viral load in saliva of each hamster after exposure to SARS-CoV-2 in 5 experimental groups.

Discussion

This study shows that canine scent-detection of COVID-19 is immediate, accurate, applicable anytime, and deployable anywhere as a diagnostic test in saliva or respiratory secretions, or as a screening tool in the patient directly. In any of those two roles, the dogs missed very few infected individuals, as demonstrated by NPV >99% in vitro and in vivo, and independently of the experimental design (in vivo efficacy and effectiveness trials). COVID-19 severity, ranging from asymptomatic to pre-symptomatic, sick and very sick patients, had no impact on performance. Prevalence from three populations of diverse levels of risk showed, as expected, that PPV went down when the presence of the disease in the population is very low, but NPV remained close to 100% across low and high prevalence. The errorless learning approach to training allowed generalization from only three specimens and prepared the dogs for in vitro diagnosis in improvised, open fields, making sophisticated and expensive equipment superfluous.

In vivo screening generated very encouraging results in both, efficacy and effectiveness trials, as the dogs detected more than 99% of the infected individuals spending <5 seconds per subject. Canine scenting of people has the potential risks of injury to patients (zero in this study), human refusal (four subjects), or dog refusal (16 cases among almost 6000 scent-screenings), but the advantages are overwhelming considering that this is the only screening test providing immediate identification and isolation of almost all infected subjects. It does not mean that in vivo screening is free of drawbacks: under real-life conditions, odor contamination caused a substantial increase in the false positive rate that drove the PPV down to 28%, which implies that most (72%) dog-positive subjects would have a negative rRT-PCR result, or that dogs produced 2.5 false positives for each true positive in the effectiveness assay. Although such error rate could still be acceptable for any screening test offering a very high NPV [44, 45], determining its cause could provide a method to solve the problem. One explanation was handlers rewarding some but not all alerts during the effectiveness assay but, in fact, rRT-PCR results showed that many correct alerts passed unrewarded; it confused the dogs and caused even more false negative alerts. The other reason is rather speculative, but based on experimental observations. The ultra-sensitive limit of detection suggests that at least a fraction of the false positives are actually pre-symptomatic COVID-19 patients. During training, three nurses whose rRT-PCR was negative were scored positive by the dogs, but 4–7 days later all three nurses had symptomatic COVID-19 with positive rRT-PCR. The dogs were accurate detecting those cases ahead of the molecular test. We also observed the dogs alerting spontaneously on the scientists that had touched any COVID-19 patient, or on the cell phones of nurses and physicians in care of COVID-19 patients. It suggests that our canines were making false alerts when detecting the scent-print of SARS-CoV-2 in contaminated individuals or in their belongings [46, 47].

Our data also provided answers to three other research questions. First, the limit of scent detection in vitro was lower than 10−12 copies ssRNA/mL, close to previous concentration thresholds determined with pure chemicals [48]. Second, all six dogs were successful as medical detectors, despite belonging to breeds not intended specifically for scent-detection. It supports recent data showing that, more than the breed, the best predictors of suitability for medical detection dogs are the levels of motivation, stamina, determination, resilience, and concentration ability of the individual dog [49, 50]. And third, only three COVID-19 patients sufficed for our dogs to recognize the scent-print of this particular disease in fresh saliva specimens and in vivo. This process, called generalization, applies to learning theory, and, in reference to scent-detection, means that the canine ignores variations of the positive stimulus and indicates its source regardless of distracting odors [51]. Generalization after exposure to just three specimens is to be expected if errorless learning principles are the foundations of training [32]. It was demonstrated with pigeons 60 years ago [33], confirmed and expanded recently [52], and then proven with wildlife detection dogs [53]. Some experts believe that canines cannot generalize an odor when trained with specimens coming from a few patients, arguing that the dog memorizes the scent-print of the individual (the source) instead of the particular disease (the target odor) [54]. However, experimental evidence in favor of this hypothesis is scarce, and most citations refer to a work in which urine was employed as positive and negative stimulus during training [55]. Beyond the many variables specific to certain diseases and specimens that might be responsible for a greater level of difficulty for the dog, using the same type of secretion when the dogs are first trained for scent-detection does not favor errorless discrimination learning [32, 33, 52].

The magnitudes of the different diagnostic metrics in vitro and in vivo show that the dogs were looking for the scent-print of SARS-CoV-2 (their target) and not accessory odors memorized from Patients 1–3 or from the hospital environment. SEN and SPC measure the proficiency of the dogs to correctly discriminate between patients infected (SEN) or not (SPC) by SARS-CoV-2. In vitro (phase 2) and in vivo (phases 3 and 4), SEN was >95%, indicating that the dogs identified correctly almost all cases of COVID-19. Had they been alerting to odors other than their target odor, SPC would have been very low, and it was also >95%. While SEN and SPC refer to the index test (canine scent-detection), predictive values quantify the probability that the participants truly had COVID-19 or not, taking the reference standard as the truth (in reality, false positives and negatives also occur with rRT-PCR). In vitro and in vivo, NPV was >99%, while PPV was 85.7% in vitro (phase 2), 69.7% in vivo (phase 3), and 28.3% in the effectiveness assay (phase 4). Memory can be eliminated as an explanation for the low PPV under real-life conditions because, after 75 days without being exposed to a single person with COVID-19, the NPV remained above 99%. A better understanding of this and many other exceptional capabilities of our canines is provided by the abundant scientific data on dog behavior and cognition [5659].

This study has some limitation that deserve attention. First, the lack of human coronavirus in our sample prevents their discrimination from SARS-CoV-2 or any non-human coronavirus. However, dogs did not alert on 43 hospitalized patients with respiratory conditions other than COVID-19, despite the fact that half of them had pneumonia caused by bacterial or viral pathogens like influenza virus. Second, the four COVID-19 subjects from the low-risk group opted out of the canine test. This precluded statistical calculations necessary to determine the different performance metrics under very low prevalence (1.25%). Data with HCW and Metro riders provide an approximation because prevalence was close (2.7% and 3.1%, respectively), suggesting that canine performance declined with prevalence. Although low prevalence is obviously not a problem with COVID-19, validation of this method might not be as successful with less frequent pathogens. Nonetheless, the excellent diagnostic performance in vitro under low prevalence (2.2%) indicates that improving the training method in vivo might overcome this particular barrier too. Finally, it should be noted that the advantages of using saliva instead of nasopharyngeal swabs are substantial and well supported [60, 61].

After our first preprint [62], at least four studies on canine scent-detection of SARS-CoV-2 in vitro have been formally published [6367]. Despite substantial methodological differences with our work, results are reproducible. The main difference with those studies is that we chose to scent-interrogate the human body because of the many obvious advantages that such approach brings: results are immediate, can be obtained anywhere, do not require equipment, and allow in situ separation of contagious individuals. The use of trained dogs as medical detectors was safe for the human participants during training and experimentation and effective regardless of the breed, a point of major importance considering that deployment would require the participation of many canines [68, 69], and the possibility of training dogs for real-time diagnosis of many other infectious diseases may help humanity be better prepared to confront the next pandemic [70]. These data suggest that well-trained dogs can be extremely helpful to guide societies through a safe re-opening of the economy and educational systems, while offering an efficient way to stop transmission. With improved training methods, canines could, in the near future, provide a sensitive and effective method to detect infectious diseases in a matter of seconds.

Supporting information

S1 File. Supporting information containing S1 and S2 Figs and Videos, as well as S1 through S8 Tables.

(DOCX)

Acknowledgments

We thank sincerely the patients, staff, and the members of the Board of Directors of Hospital Universitario San Vicente Fundación, the officers of the Governor of Antioquia, and the administrators and users of Medellin’s Metro System for participating in the study. We are also indebted towards Dr. Hernán Alzate for allowing us to train his two dogs (Nina and Timo) for this project, and to Mr. Jesús González, Cosmovision journalist, for the excellent video footage. Our special gratitude goes to Dr. Tonie E. Rocke for her insightful review of the final version of the manuscript.

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

OV received donations from Mauricio Palacio, Flor Saldarriaga, and LAS Sucesores SAS. JPHO received a donation from Grupo ISA. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Ritchie H, Mathieu E, Rodés-Guirao L, Appel C, Giattino C, Ortiz-Ospina E, et al (2020)—"Coronavirus Pandemic (COVID-19)". Published online at OurWorldInData.org. Available from: ’https://ourworldindata.org/coronavirus’ [Online Resource].
  • 2.Irwin A. What it will take to vaccinate the world against COVID-19. Nature. 2021;592: 176–178. doi: 10.1038/d41586-021-00727-3 [DOI] [PubMed] [Google Scholar]
  • 3.Botti-Lodovico Y, Rosenberg E, Sabeti PC. Testing in a Pandemic—Improving Access, Coordination, and Prioritization. N Engl J Med. 2021;384: 197–199. doi: 10.1056/NEJMp2025173 [DOI] [PubMed] [Google Scholar]
  • 4.Gandhi M, Yokoe DS, Havlir DV. Asymptomatic transmission, the Achilles’ heel of current strategies to control Covid-19. N Engl J Med. 2020;382: 2158–2160. doi: 10.1056/NEJMe2009758 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Kucirka LM, Lauer SA, Laeyendecker O, Boon D, Lessler J. Variation in false-negative rate of reverse transcriptase polymerase chain reaction–based SARS-CoV-2 tests by time since exposure. Ann Intern Med. 2020;173: 262–267. doi: 10.7326/M20-1495 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Centers for Disease Control and Prevention. Interim guidelines for rapid antigen testing for SARS-CoV-2. Available from: https://www.cdc.gov/coronavirus/2019-ncov/lab/resources/antigen-tests-guidelines.html.
  • 7.Centers for Disease Control and Prevention. Interim guidelines for COVID-19 antibody testing. Available from: https://www.cdc.gov/coronavirus/2019-ncov/lab/resources/antibody-tests-guidelines.html.
  • 8.EndCoronavirus. Countries beating COVID-19. Available from: https://www.endcoronavirus.org/countries
  • 9.Allen DW. Covid Lockdown Cost/Benefits: A Critical Assessment of the Literature. April2021. Available from: http://www.sfu.ca/~allen/LockdownReport.pdf. [Google Scholar]
  • 10.Yu KDS, Aviso KB, Santos JR, Tan RR. The Economic Impact of Lockdowns: A Persistent Inoperability Input-Output Approach. Economies. 2020;8: 109. doi: 10.3390/economies8040109 [DOI] [Google Scholar]
  • 11.Wadman M. Israel’s grim warning: Delta can overwhelm shots. Science, 2021;373: n838–839. doi: 10.1126/science.373.6557.838 [DOI] [PubMed] [Google Scholar]
  • 12.Shipman P. The invaders: how humans and their dogs drove Neanderthals to extinction. Cambridge: The Belknap Press; 2015. [Google Scholar]
  • 13.Romanes GJ. Experiments on the sense of smell in dogs. Nature. 1887; 36:273–274. [Google Scholar]
  • 14.Jezierski T, Ensminger J, Papet LE, editors. Canine olfaction science and law: advances in forensic science, medicine, conservation, and environmental remediation. New York: CRC Press; 2016. [Google Scholar]
  • 15.Helton WS. Canine ergonomics: the science of working dogs. New York: CRC Press; 2009. [Google Scholar]
  • 16.Bijland LR, Bomers MK, Smulders YM. Smelling the diagnosis. A review on the use of scent in diagnosing disease. Neth J Med. 2013;71: 300–307. [PubMed] [Google Scholar]
  • 17.Cambau E, Poljak M. Sniffing animals as a diagnostic tool in infectious diseases. Clin Microbiol Infect. 2020;26: 431‐435. doi: 10.1016/j.cmi.2019.10.036 [DOI] [PubMed] [Google Scholar]
  • 18.Bomers MK, van Agtmael MA, Luik H, Vandenbroucke-Grauls CMJE, Smulders YM. A detection dog to identify patients with Clostridium difficile infection during a hospital outbreak. J Infect. 2014;69: 456–461. [DOI] [PubMed] [Google Scholar]
  • 19.Gottwald T, Poole G, McCollum T, Hall D, Hartung J, Bai J, et al. Canine olfactory detection of a vectored phytobacterial pathogen, Liberibacter asiaticus, and integration with disease control. Proc Natl Acad Sci. 2020;117: 3492‐3501. doi: 10.1073/pnas.1914296117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gottwald T, Poole G, Taylor E, Luo W, Posny D, Adkins S, et al. Canine olfactory detection of a non-systemic phytobacterial citrus pathogen of international quarantine significance. Entropy (Basel). 2020;22: 1269. doi: 10.3390/e22111269 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Angle C, Angle C, Waggoner LP, Ferrando A, Haney P, Passler T. Canine detection of the volatilome: a review of implications for pathogen and disease detection. Front Vet Sci. 2016;3: 47. doi: 10.3389/fvets.2016.00047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Chen H, Qi X, Ma J, Zhang C, Feng H, Yao M. Breath-borne VOC Biomarkers for COVID-19. MedRxiv [Preprint]. 2020 medRxiv 2020.06.21.20136523 [posted 2020 June 24; cited 2021 Apr 17]: [11 p]. Available from: 10.1101/2020.06.21.20136523. [DOI]
  • 23.Grassin-Delyle S, Roquencourt C, Moine P, Saffroy G, Carn S, Heming N, et al. Metabolomics of exhaled breath in critically ill COVID-19 patients: A pilot study. EBioMedicine. 2021;63: 103154. doi: 10.1016/j.ebiom.2020.103154 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Damas J, Hughes GM, Keough KC, Painter CA, Persky NS, Corbo M, et al. Broad host range of SARS-CoV-2 predicted by comparative and structural analysis of ACE2 in vertebrates. Proc Natl Acad Sci. 2020;117: 22311–22322. doi: 10.1073/pnas.2010146117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Bosco-Lauth AM, Hartwig AE, Porter SM, Gordy PW, Nehring M, Byas AD, et al. Experimental infection of domestic dogs and cats with SARS-CoV-2: Pathogenesis, transmission, and response to reexposure in cats. Proc Natl Acad Sci. 2020;117: 26382–26388. doi: 10.1073/pnas.2013102117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Porritt F, Shapiro M, Waggoner P, Mitchell E, Thomson T, Nicklin S, et al. Performance decline by search dogs in repetitive tasks, and mitigation strategies. Appl Anim Behav Sci. 2015;166: 112–122. [Google Scholar]
  • 27.Cobb M, Branson N, McGreevy P, Lill A, Bennett P. The advent of canine performance science: offering a sustainable future for working dogs. Behav Process. 2015;110: 96–104. [DOI] [PubMed] [Google Scholar]
  • 28.Gadbois S, Reeve C. Canine olfaction: scent, sign, and situation. In: Horowitz A, editor. Domestic dog cognition and behavior: the scientific study of Canis familiaris. Berlin: Springer-Verlag; 2014. pp. 2–30. [Google Scholar]
  • 29.Hare B, Ferrans M. Is cognition the secret to working dog success? Anim Cogn. 2021Mar9. doi: 10.1007/s10071-021-01491-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig L, et al. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ (Clinical research ed.). 2015;351: h5527. doi: 10.1136/bmj.h5527 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Spector M. Clicker Training for Obedience. Waltham, MA: Sunshine Books; 2005. [Google Scholar]
  • 32.Terrace HS. Discrimination learning with and without “errors”. J Exp Anal Behav. 1963;6: 1–27. doi: 10.1901/jeab.1963.6-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Terrace HS. Wavelength generalization after discrimination learning with and without errors. Science. 1964;144: 78–80. doi: 10.1126/science.144.3614.78 [DOI] [PubMed] [Google Scholar]
  • 34.Bujang MA, Adnan TH. Requirements for minimum sample size for sensitivity and specificity analysis. J Clin Diagn Res. 2016;10: YE01–YE06. doi: 10.7860/JCDR/2016/18129.8744 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Cohen J. Statistical power analysis for the behavioral sciences. Mahwah, NJ: Lawrence Erlbaun Associates; 1988. [Google Scholar]
  • 36.Radbel J, Jagpal S, Roy J, Brooks A, Tischfield J, Sheldon M, et al. Detection of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is comparable in clinical samples preserved in saline or viral transport medium. J Mol Diagn. 2020;22: 871–875. doi: 10.1016/j.jmoldx.2020.04.209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Lu X, Wang L, Sakthivel SK, Whitaker B, Murray J, Kamili S, et al. US CDC real-time reverse transcription PCR panel for detection of severe acute respiratory syndrome coronavirus 2. Emerg Infect Dis 2020;26: 1654–1665. doi: 10.3201/eid2608.201246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Corman V, Bleicker T, Brünink S, Drosten C, Landt O, Koopmans M, et al. Diagnostic detection of 2019-nCoV by real-time RT-PCR. Available from: https://www.who.int/docs/default-source/coronaviruse/protocol-v2-1.pdf?sfvrsn=a9ef618c_2 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Piewbang C, Rungsipipat A, Poovorawan Y, Techangamsuwan S. Development and application of multiplex PCR assays for detection of virus-induced respiratory disease complex in dogs. J Vet Med Sci 2017;78: 1847‐1854. doi: 10.1292/jvms.16-0342 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Bustin SA, Beaulieu J-F, Huggett J, Jaggi R, Kibenge FSB, Olsvik PA, et al. MIQE précis: Practical implementation of minimum standard guidelines for fluorescence-based quantitative real-time PCR experiments. BMC Mol Biol 2010;11: 74. doi: 10.1186/1471-2199-11-74 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Vogels C, Fauver J, Ott I, Grubaugh N. Generation of SARS-COV-2 RNA transcript standards for qRT-PCR detection assays. Protocols.io. Available from: 10.17504/protocols.io.bdv6i69e (2020). [DOI] [Google Scholar]
  • 42.Stordeur P, Poulin LF, Craciun L, Zhou L, Schandené L, de Lavareille A, et al. Cytokine mRNA quantification by real-time PCR. J Immunol Methods 2002;259: 55–64. doi: 10.1016/s0022-1759(01)00489-6 [DOI] [PubMed] [Google Scholar]
  • 43.Centers for Disease Control and Prevention. CDC 2019-novel coronavirus (2019-nCoV) real-time RT-PCR diagnostic panel. Available from: https://www.fda.gov/media/134922/download (2020).
  • 44.National Academies of Sciences, Engineering, and Medicine 2020. Rapid Expert Consultation on Critical Issues in Diagnostic Testing for the COVID-19 Pandemic (November 9, 2020). Washington, DC: The National Academies Press. Available from:: 10.17226/25984. [DOI] [Google Scholar]
  • 45.Mina MJ, Parker R, Larremore DB. Rethinking Covid-19 test sensitivity—A strategy for containment. N Engl J Med. 2020;383: e120. doi: 10.1056/NEJMp2025631 [DOI] [PubMed] [Google Scholar]
  • 46.Behzadinasab S, Chin AWH, Hossein M, Poon LLM, Ducker WA. SARS-CoV-2 virus transfers to skin through contact with contaminated solids. MedRxiv [Preprint]. 2021. medRxiv 2021.04.24.21256044 [posted 2021 April 28; cited 2021 May 2]: [18 p]. Available from: 10.1101/2021.04.24.21256044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Woloshin S, Patel N, Kesselheim AS. False negative tests for SARS-CoV-2 infection—challenges and implications. N Engl J Med. 2020. doi: 10.1056/NEJMp2015897 [DOI] [PubMed] [Google Scholar]
  • 48.Walker DB, Walker JC, Cavnar PJ, Taylor JL, Pickel DH, Hall SB, et al. Naturalistic quantification of canine olfactory sensitivity. Appl Anim Behav Sci. 2006;97: 241–254. [Google Scholar]
  • 49.Turcsán B, Kubinyi E, Miklósi A. Trainability and boldness traits differ between dog breed clusters based on conventional breed categories and genetic relatedness. Appl Anim Behav Sci. 2011;132: 61–70. doi: 10.1016/j.applanim.2011.03.006 [DOI] [Google Scholar]
  • 50.Leighton EA, Hare E, Thomas S, Waggoner LP, Otto CM. A solution for the shortage of detection dogs: a detector dog center of excellence and a cooperative breeding program. Front Vet Sci. 2018;5: 284. doi: 10.3389/fvets.2018.00284 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Moser AY, Bizo L, Brown WY. Olfactory generalization in detector dogs. Animals. 2019;9: 702. doi: 10.3390/ani9090702 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Arantes J, Machado A. Errorless learning of a conditional temporal discrimination. J Exp Anal Behav. 2011;95: 1–20. doi: 10.1901/jeab.2011.95-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Oldenburg C Jr, Schoon A, Heitkönig IMA. Wildlife detection dog training: A case study on achieving generalization between target odor variations while retaining specificity. J Vet Behav. 2016;13: 34e38. doi: 10.1016/j.jveb.2016.03.008 [DOI] [Google Scholar]
  • 54.Lazarowski L, Krichbaum S, DeGreeff LE, Simon A, Singletary M, Angle C, et al. Methodological considerations in canine olfactory detection research. Front Vet Sci. 2020;7: 408. doi: 10.3389/fvets.2020.00408 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Elliker KR, Sommerville BA, Broom DM, Neal DE, Armstrong S, Williams HC. Key considerations for the experimental training and evaluation of cancer odour detection dogs: lessons learnt from a double-blind, controlled trial of prostate cancer detection. BMC Urol. 2014;14: 22. doi: 10.1186/1471-2490-14-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Pilley JW, Reid AK. Border collie comprehends object names as verbal referents. Behav Process. 2011;86: 184–195. doi: 10.1016/j.beproc.2010.11.007 [DOI] [PubMed] [Google Scholar]
  • 57.Fugazza C, Miklosi A. Depths and limits of spontaneous categorization in a family dog. Sci Rep. 2020;10: 3082. doi: 10.1038/s41598-020-59965-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Pilley JW. Border collie comprehends sentences containing a prepositional object, verb, and direct object. Learn Motiv. 2013;44: 229–240. [Google Scholar]
  • 59.Hare B, Woods V. The genius of dogs–How dogs are smarter than you think. New York: Dutton; 2013. [Google Scholar]
  • 60.Wyllie AL, Fournier J, Casanovas-Massana A, Campbell M, Tokuyama M, Vijayakumar P, et al. Saliva or nasopharyngeal swab specimens for detection of SARS-CoV-2. N Engl J Med. 2020;383: 1283–1286. doi: 10.1056/NEJMc2016359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Silva J, Lucas C, Sundaram M, Israelow B, Wong P, Klein J, et al. Saliva viral load is a dynamic unifying correlate of COVID-19 severity and mortality. medRxiv [Preprint]. 2021. Jan 10: 2021.01.04.21249236. doi: 10.1101/2021.01.04.21249236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Vesga O, Valencia AF, Mira A, Ossa F, Ocampo E, Agudelo M, et al. Dog savior: immediate scent-detection of SARS-COV-2 by trained dogs. BioRxiv [Preprint]. 2020. bioRxiv 2020.06.17.158105 [posted 2020 June 19; revised 2021 Apr 16; cited 2021 Apr 17]: [17 p]. Available from: https://www.biorxiv.org/content/10.1101/2020.06.17.158105v1.full. [Google Scholar]
  • 63.Jendrny P, Schulz C, Twele F, Meller S, von Köckritz-Blickwede M, Osterhaus ADME, et al. Scent dog identification of samples from COVID-19 patients—a pilot study. BMC Infect Dis. 2020;20: 536. doi: 10.1186/s12879-020-05281-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Grandjean D, Sarkis R, Lecoq-Julien C, Benard A, Roger V, Levesque E, et al. (2020) Can the detection dog alert on COVID-19 positive persons by sniffing axillary sweat samples? A proof-of-concept study. PLoS ONE. 2020;15: e0243122. doi: 10.1371/journal.pone.0243122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Essler JL, Kane SA, Nolan P, Akaho EH, Berna AZ, DeAngelo A, et al. Discrimination of SARS-CoV-2 infected patient samples by detection dogs: A proof of concept study. PLoS One. 2021;16: e0250158. doi: 10.1371/journal.pone.0250158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Jones RT, Guest C, Lindsay SW, Kleinschmidt I, Bradley J, Dewhirst S, et al. Could bio-detection dogs be used to limit the spread of COVID-19 by travellers? Journal of Travel Medicine. 2020;27: taaa131. doi: 10.1093/jtm/taaa131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Dickey T, Junqueira H. Toward the use of medical scent detection dogs for COVID-19 screening. J Osteopath Med. 2021;121: 141–148. doi: 10.1515/jom-2020-0222 [DOI] [PubMed] [Google Scholar]
  • 68.Duffy DL, Hsub Y, Serpell JA. Breed differences in canine aggression. Appl Anim Behav Sci. 2008;114: 441–460. [Google Scholar]
  • 69.Mikkola S, Salonen M, Puurunen J, Hakanen E, Sulkama S, Araujo C, et al. Aggressive behaviour is affected by demographic, environmental and behavioural factors in purebred dogs. Sci Rep. 2021;11: 9433. doi: 10.1038/s41598-021-88793-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Horton R. Offline: The case for No-COVID. Lancet. 2021;397: 359. doi: 10.1016/S0140-6736(21)00186-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Etsuro Ito

23 Jun 2021

PONE-D-21-16873

Highly sensitive scent-detection of COVID-19 patients in vivo by trained dogs

PLOS ONE

Dear Dr. Vesga,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

I could obtain the comments from only one reviewer. He/she suggested that the manuscript was too long and difficult to follow. Thus, please rearrange the manuscript.

Please submit your revised manuscript by Aug 07 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Etsuro Ito

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found athttps://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.  Your ethics statement should only appear in the Methods section of your manuscript. If your ethics statement is written in any section besides the Methods, please delete it from any other section.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Since the outbreak of COVID-19 pandemic several peer-reviewed papers have been published, showing that dogs, even after a relatively short operant conditioning training, are able to discriminate odor samples collected from COVID-19 positive donors, from those collected from healthy controls. The studies demonstrated suprisingly high sensitivity and specificity >90% of COVID-19 detection by trained canines.

The general aim of these studies is to find a simple, inexpensive and high throughoutput screening method for detection of both symptomatic and asymptomatic humans infected with SARS-CoV-2 virus, to isolate such people to prevent further dissemination of the virus, even without being aware.

The reviewed paper generally supports previous findings, however, it includes some interesting issues that have not been addressed in previously published papers, as well as some novel methodological approaches. Therefore this paper has some merits that makes it worth publicating.

In my review I will focus chiefly on canine aspects of the study. Not being an epidemiologists, I feel not competent enough to evaluate the bio-safety and preventive measures to avoid contagion with SARS-CoV-2 during these experiments and to evaluate methods like rRT-PCR assay and RNA quantification to assess contagion hazard during collecting and handling of samples, as well as during in vitro and in vivo tests, as presented in the attached videos. However, in view of the latest news on new mutations of the virus, which are supposed to be more infectious that the previous variants, the procedure demonstrated on video S2 seems to neglect one of the main preventive measures that are recommended at least during the first phase of the pandemic, and namely distancing, avoiding direct contact e.g. by shaking hands, disinfection etc. Although the authors showed that the dogs and the persons involved in the experiments did not contracted the virus, demonstrating that nothing had happened at disobeying the preventive recommendations, does not mean that these recommendation can be generally disobeyed.

In my opinion, before accepting this manuscript for publication the structure of the submission should be improved, since it is chaotic, making the paper too long and difficult to follow. The authors divided the description of the methodology and results into main body of the paper and supplementary information. It is not clear to me why the authors placed a part of the description of the material and methodology in the main body of the submission and another part in the supporting information. The authors in the supporting info reiterate some details that were given in the main body of the submission. As supplementary info I would rather expect detailed raw data, video files (which are in fact attached, however, some short explanations to the details shown on the videos would be helpful), instead of a longer text with methodology, results, acknowledgements and references.

The authors listed 5 aims of their study

Aim #1 - Can dog breeds other than working breeds commonly used for olfactory detection, in this case pit-bull and nordic mix sled-dog (siberian husky x alaskan malamute), be trained and employed as medical detectors. However, using only one single dog of a breed does not tell much about ability of the breed to be efficiently trained for a specific olfactory task. In almost every dog breed there are individuals that can be successfully trained to perform any task with reasonable training workload and within reasonable time-frame, but it does not mean that a breed is generally recommended for the task. Therefore I suggest to discuss some limitation of this aim in the Discussion section.

Aim #2, assessing how is the lowest number of pattern odor donors for the training to achieve generalization of odor signature characteristic for COVID-19 in order to detect target odor in human population. It is generally recommended that dogs trained to detect diseases in humans should be trained on a number of pattern odor samples (high number of donors with a disease diagnosed). It is supposed that dogs could memorize individual components of odor samples (individual human odor) during training, if samples from too few donors are used, and then may have problems, when it comes to detection of odor samples collected from other donors. This issue is particularly important in COVID-19 detection due to high contagiousness if the SARS-CoV-2 virus and still not knowing exactly how people can contract the virus, especially with regards to new virus mutations. This would mean that the less COVID-19 positive donors from whom the samples have to be taken for the dog training, the lesser hazard to contract the virus. The authors domonstrated that pattern samples taken from only three donors, are sufficient to generalize the COVID-19 odor to the other nine people. The main question is, however, to what odor the dogs really alert. Previous studies showed that the dogs can be trained to alert to sweat samples (Grandjean et al. 2020, Angeletti et al. 2021) or to saliva or tracheobronchial secretions (Jendrny et al. 2020) collected from COVID-19 positive donors. As to the human sweat, it is commonly acknowledged that it contains individual odor component, genetically determined as well as influenced by bacterial decomposition. To my knowledge, no exact data are available as to the individual component of the odor of the saliva or tracheobronchial secretion. If the dogs almost perfectly generalize the sweat or saliva/trachoebronchial secretion odors of only three COVID-positive donors, this would mean that there is a defined, dominating odor of one or a single combination of few volatile organic compounds that masks the individual human odor. If ethyl butanoate was found as the most abundant VOC in the breath COVID-positive patients, (Chen et al. 2020), the question is if ethyl butanoate coud be considered a main candidate for a COVID-19 odor marker. Usually, however, it is a variety of combinations of VOCs in human odor samples, and as it was shown in studies on cancer odor markers, no single odor marker of the disease exist. The issue of the origin of the hypothetical COVID odor should be addressed in the Discussion section.

Aim #3, assessing the effects of sample numbers on the diagnostic metrics in vitro and in vivo under controlled experimental conditions, i.e., efficacy. Assessing such parameters as detection sensitivity, specificity and accuracy is a typical aim in such type of studies, allowing comparison and/or confirmation or throwing into question the results of other studies. The authors of the reviewed article include additional detection parameters like PPV, NPV and prevalence, which were not considered in any of the previous papers.

Aim #4 - Quantitative assessment of the detection threshold in terms of copies of single stranded viral RNA per milliliter (ssRNA/mL). This data would be helpful in collecting and preparing samples for canine trainig and testing. However, a question arises if the dogs alert to viral RNA as such, which is probably odorless, or to products of the changed cell metabolisms caused by the virus. Further question is if a given number of copies of ssRNA/ml is directly related to the amout of a single VOC or VOCs produced, to which the dogs probably alert.

Aim #5 - assessing real-life performance of the dogs in in vivo screening COVID positive passengers was an original and to my knowledge not addressed aim in previous studies on medical detection dogs trained to detect diseases in humans.

The in vitro part of the study uses experimental setup that differs from previously published studies both on cancer markers detection and COVID-19 detection by trained canines. The authors conducted the trials with dogs not indoors as it was the case in all other studies known to me, but outdoors, arranging 100 odor samples in 10 rows of 10 samples 2 m apart. This setup requires an area of 400 square meters, which may be not always availabe for experimentations. Secondly, conducting trials outdoors involves such hardly controllable confounding factors as distraction of dogs, weather conditions, uncontrollable migration of odor plume etc.

The videos attached to the suplementary info, show apparently a training phase under single-blind protocol meaning that the handler was blind to the position of the target sample in the lineup or to the COVID-positive person in the queue, but the experimenter who activated the clicker to give a signal to the handler that the alert was correct, was aware of the position of the target. While on the video S1 all alerts of the dog were rewarded, and the behavior of the dog, the handler and partly the experimenter can be seen, the video S2 was recorded probably for the purpose of the TV, incuding some shots from different positions that are impressive for the TV audience but does not document well the behavior of the dog, the handler and the experimenter . In the video S2 the dogs are sometimes rewarded after alert and sometimes not. Thus, it is not clear whether the video S2 shows a single blind training phase and alerts that are not rewarded were false, or it shows a real screening and the not rewarded alerts were true double blind trials. It would be useful to have some comments of the authors in the suplemenary info on particular alerts and rewarding of dogs recorded on the videos.

There is a basic difference to a real screening which is a true double-blind procedure, meaning that nobody knows if the alert was correct or false, and the dog should not be rewarded for a false alert. The dogs in this study are working on leash, which could be justified by a possibility to urge the dogs to work systematically i.e. sniffing all 100 in vitro samples and to sniff systematically people on the subway station. Although it cannot be seen that the handler gives overt cue to the dogs, and the dogs are well trained to work systematically, however, the leash is considered by many trainers a wired communication between handler and dog and is questionable in this type of detection. On the video S1 the experimenter with the clicker follows closely the dog in the scent lineup. Even if the dog does not seem to pay attention to the experimenter at this stage of deployment, it could be only a matter of time for the dog to learn to observe the experimenter to decipher when it should alert or not (the „Clever Hans effect”). Although theoretically there could be a variable reinforcement ratio, meaning that animals can be rewarded not after each correct behavioral response, but only after some, in practice the dogs, especially after a longer deployment, when not being rewarded frequently may try to earn a reward by using a trial-and-error strategy, which causes making more false alerts. The problem with detection dogs is that they work not to detect odors which have no biological relevance or rewarding values for canines, but the dogs rather work to find an opportunity to earn a reward, due to association between trained odor and a reward, that was created in the process of operant conditioning. Therefore the detection sensitivity and specificity may vary during the deployment period and should be systematically checked. At each of the training or deployment period an appropriate „success rate” should be settled to maintain dogs’ interest for work. While during the real screening test in the lineup of 100 samples it can be controlled how many samples are of known status (positive and negative) and how many samples are of unknown status, when testing people in vivo in a queue, it is hardly to controll who of them are true positive, true negative to reward/not reward the dog for a correct response, unless such persons diagnosed previously are available for the trials. This issues should be adressed in the Discussion section.

If the dogs in this study did not mark as positive any of the patients with respiratory diseases other than COVID-19 this would mean that a very specific VOC or combination of VOCs are characteristic for COVID-19 and should be identified chemically. The other question is if the same VOC or VOCs are characteristic for any new mutations of the virus. This issue should be addressed in the Discussion. Secondly, if the dogs, independently of the experimental design (in vitro and in vivo) and independently of COVID-19 severity, ranging from asymptomatic to pre-symptomatic, sick and very sick patients, alerted with a very high accuracy, the question should be discussed to what they actually alert and what is the origin of the odor and how the odor will be produced.

Minor remarks

L.36 and elsewhere „interrogating” is not a good wording, I suggest „sniffing”

L.36 there are some doubts if sniffing directly the body of patients would be an ideal method because there are some drawbacks e.g. interactions between subjects and dog, fear of dogs, refusal to be sniffed etc .

L.56 it could be expected that dogs that are trained on saliva samples would make more false alerts when sniffing hand palms.

L.103 Here the study objectives and aims should be placed.

L.115 „For maximal output” - this wordng is not clear: working dogs have to be rewarded not for a maximal output but to produce an association between an odor and the reward (operant conditioning)

L.115 „….dogs must be rewarded for each positive finding….” – not quite precise statement: there is also a variable reinforcement ratio – rewording not for every bet only for some of the correct positive findings. This issue concerns real screening under „true” double blind conditions, where the dogs should not be rewarded for each positive finding because it is not known if the alert was correct or false

L.121 Rather „olfactory ability” than „scent power”

L.147 Do the do the dogs identify the virus or alert to the VOCs produced by changed cell metabolisms during infection ?

Tab.1. were some SARS-positive donors asymptomatic ?

L.155 which environmental modifications are meant ?

L.166 – what was the correct behavior (alert), who activated the clicker ?

L.169 how many trials per day ?

L.173 – usually dogs make some false alerts of missess from time to time. How many error-free consecutive trials have to be made?

L.194 what was a criterion for passing to the next training stage (% of correct alerts?)

L.198 The training method for "in vivo” screening was not sufficiently described. How many donors were used for the training? Here it could be understood that 3 donors were used for the training, however in supplementary info page 5 third line from the bottom 400 subjects used for the training are mentioned. What was the first stage of the in vivo training ?

L.201-202 – what kind of samples were collected for the in vivo training (sweat or saliva or secretions) ? were those samples taken only from 51 COVID positive patients, meaning that >100 samples per donor ? How were the samples handled for biosecurity ?

L.207 - dog training was partly described in the paragraph Design and Sample Size.

There is a confusion in describing particular sections: first the animals should be described, then the collecting of odor samples, dog training and statistical methods.

L.226-227 „..only after obtaining very high diagnostic metrics in vitro…”. – how high were these metrics ? >90% ?

L.229 – siffing hand palms without prior washing may involve such issues as confounding odors of food, when a person ate something shortly before being sniffed by the dog, attractive or aversive individual odor of some persons, etc. Also dog-human interaction, fear of dogs may play a role. These issues should be discussed in the revision.

L.320-322 Does the RNA smell? – probably dogs alert to some VOCs produced by cells during infection with SARS-Cov-2. Is there a direct relation between the number of copies of ssRNA/mL and the amount of those putative VOCs ?

L.337 Detailed methodology should be described in the M&M section of the manuscript and not in the Results.

L.341 and 351. In practice it is hardly possible to achieve all trials without any error. Only a series of error-free trials is usually possible. The question is how low was such series – one or 5 or 10 or more trials on 100 samples ?

L.352 – were „zero” trials conducted ? (with no target odor among the 100 negative samples)

L.369-370 „…When the dogs interrogated the first positive patient, all six recognized the scent-print of SARS-CoV-2 and went down without hesitation….” - on videos S1 and S2 played at slow motion some hesitations of dogs can be seen. The first positive patient could be actually alerted without any hesitation but the question is if the further (many) patients would be indicated without any hestitations as well.

L.370-373 „….it proved to be a difficult endeavor…..”. It seems that the training of dogs to alert to SARS-Cov-2 on any belongings of the patients would be too challenging. It would be better to standardize the method and to improve canine proficiency at working on unified odor samples or on people.

L.377 It seems that the confounding effect of individual odor of positive patients tested in vivo plays no role ?

L.397-398 some parts of the description of material and methods are doubled and scattered throughout the main text and the supplementary info. This makes the whole submission too long and difficult to follow.

Figs 5 and 6 seem to be redundant if the same results were given in tables 3 and 4 respectively ?

L.537-542 „…We also observed several times during training that the dogs spontaneously marked as positive the scientists that had touched any COVID-19 patient, or the cell phones of nurses and physicians in care of COVID-19 patients. It means that trained canines detect the scent-print of SARS-CoV-2 in contaminated individuals or in their belongings and, since contamination could lead to infection [39], the dogs actually identify potential COVID-19 cases before infection takes place…” – this statement should be critically revised and clear limitation should be indicated. If the dogs „several times” alerted positively to the experimenters or to the belongings of the hospital staff, there is no proof that the dogs identify COVID-19 before infection takes place. This could be simply false alerts and there are no proofs how many misses (false negative = not detecting cases before infection) would be found.

L.569-573 „…..expert detection dogs remembered a new scent-print 98% of the time as long as it was located first in a line-up with five distractors, but performance went down when the positive stimulus was located farther, dropping to 11.5% at the sixth location [28]. Therefore, it seems impossible for a dog to remember the odor of up to 12 different individuals randomly allocated among 100 distractors….” - the other studies do not support this finding, as the dogs are able to memorize much more than 12 individual scents and the percentage of alerts to memorized scent does not depend of the location in the lineup.

Fig 7. Testing 550 individuals in vivo within 5 hours seems to be definitely too strenuous for dogs. Even testing 110 persons within one hour of work, depending on weather conditions would be very strenuous. It should precised how long was a work bout without break.

In summary, this paper has some merits and would be acceptable for publication in PloS after major revision, considering all critical points and indicating clearly limitations of the metod.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Sep 29;16(9):e0257474. doi: 10.1371/journal.pone.0257474.r002

Author response to Decision Letter 0


29 Jul 2021

Following the instructions of the Academic Editor, we uploaded a file labeled "Response_to_Reviewers" containing a point-by-point answer to comments from the editor and reviewer.

Attachment

Submitted filename: Response_to_Reviewers.docx

Decision Letter 1

Etsuro Ito

4 Aug 2021

PONE-D-21-16873R1

Highly sensitive scent-detection of COVID-19 patients in vivo by trained dogs

PLOS ONE

Dear Dr. Vesga,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

English should be edited as the reviewer suggested.

Please submit your revised manuscript by Sep 18 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Etsuro Ito

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors addressed sufficiently my comments in their rebuttal, but have only partly incorporated amendments into the revision. First of all, the structure of the submission should be further improved, since it remains not concise enough and chaotic. Eventually, it is up to the Editor of PloS to decide if the overall structure of the revision is acceptable or should be changed. In my opinion, however, the main body of a paper should contain all essential information and data of the study. I have no comments to the revised Introduction section including hypotheses, except for the lines 141-145 that fit rather to the M&M section than to the Introduction. I think in the Material and Methods the reader should be able to find all essential information on animals, collecting and handling of odor samples, training procedure and statistical methods used in the study, without searching for supporting information. The supporting information is useful for somebody who wants to learn all additional details of the study. In the present form of the revision, the reviewer (and the readers), have to jump to the supporting info and back to the main text, which is inconvenient and makes following of the paper not easy.

Being not an English native speaker, I will not evaluate the language of the paper, however, to me the revision requires extensive editing of English by a native speaker, who is familiar with specific wording concerning both COVID-19 epidemiology and canine detection. Nevertheless, I try to give some suggestions:

L.48-49, I suggest "...A real-life (in vivo) performance was determined 75 days after in vitro effectiveness assay...”

L.51 Here and elsewhere: is the word "interrogation" appropriate in this context ? I would suggest: " ...Three dogs were used to examine the scent of 350 volunteers, who agreed to participate both in test with canines and in rRT PCR testing...”

L.55-58 This statement is imprecise. The task of the dogs is not to discriminate odoriferous contamination from infection since probably all odor samples collected in reality, are to some extent contaminated, and no pure "infection odor" exists. The dog should be rather trained to ignore contamination and to indicate "infection odor" regardless its contamination.

L.76 The nations did not demonstrate. I suggest ".. It was clearly demonstrated in several countries that...."

L.111 I suggest „ … the dog must be reinforced……with a reward……”

L. 115 I suggest „…selection process of dog candidates for the olfactory training e.g. for detection of explosives is needed….”

L.117 „exceeds” instead of „excels”

L.118 „of working in the scent lineup”

L.119 „human odor” instead of „human subject”

L.123. „….learning ability, trainability and ability to cooperate with humans…” instead of „minds”.

L.141-144 – this passage fits rather to the M&M section than to the Introduction.

L.145 „Ultimate goal” instead of „product” (?)

L.155-160 this passage is redundant since it reiterates information given in the Introduction.

L.160-164 (up to „…from each other…”) this passage should be included into Material section (collecting odor samples)

L.164-178 this passage should be shifted to line 211 (Dog training)

L.166. Fig.1. showing the training phases should be cited here.

L.196-200 Should be moved to the Material section - collecting odor samples

L.202-208 A separate paragraph on Ethical permission, should be moved to the beginning of the M&M section. The passage in lines 202-208 has nothing to do with Sample size.

L.235 My suggestion: „ ……could not bite, lick or touch ….”

L.265-270 Statistical analyses should be the last separate paragraph of the M&M section.

L.447-449 Rewarding some but not all correct alerts during the effectiveness assay cannot be considered as human error. In real screening scenario neither the experimenter nor the handler does know if the dog's response was correct or wrong (true double blind procedure). Therefore during real screening scenario, basically no rewarding have to be applied. Increasing the false negative alerts, or false positive alerts as a consequence of not rewarding the dog for EVERY correct alert on real people, is one of the critical points of the canine screening. It remains an open question, if the dogs that are subjected to a sustaining/improving training in order to reduce false negative and false positive alerts, using odor samples and lineup, would equally well alert on live people (in long term). There is something like context dependent olfactory learning that has to be taken into consideration.

L.462 „bred” instead of „created”

L.463 „suitability” instead of „excellence”

L.464 add „concentration ability”

L.465 „individual dogs” instead of „individual prospect”

L.468 „…the canine recognizes variations….” (?) – rather ignores variations and indicates regardless of distracting odors

L.469 „source” is redundant here. I suggest: „indicates the target odor regardless....”

L.470 but is surprising in view of other medical detection dog training e.g. for cancer detection

L.475-478 This sentence should be rewritten since it is confusing. It is not a problem of using urine as both positive (cancer) and negative (healthy stimulus), but the problem of using the same samples (donors) for the training and testing. Also, Ellier et al. 2014 was not the only study that recommended using odor samples from many donors and conducting the training and testing using different samples (donors).

L.480-481 „ …..using the same type of secretion during the foundational training does not favor errorless discrimination learning…..” (1) - what is the foundational training? - perhaps initial training ?, (2) – finally the dogs have to discriminate odor samples from sick vs healthy humans and not sick humans vs sterile saline solution. The dogs may be perfect at what the authors label as „errorless dicrimination training”, but may show poor performance in real screening scenario.

L.483 what stands for Effect size ? perhaps simply "The different diagnostic metrics"?

L.488-490 „…Had they been scenting in search of odors other than their target…” please rewrite to be more clear. Perhaps: „..Had they been alerting to odors other than the target odor…”?

L.490-492 – please rewrite to be more clear

L.500-508 I suggest to delete the passage in lines 500-508 because canine learning as such goes beyond the scope of this study

L.560 while infectious diseases could be detected in seconds, could they also be controlled in seconds?

Fig 1. „…..training phases and experimental design were planned…”? – or were conducted ?

Table 3 The term Effect size may be confusing. I suggest to delete (%) in the first left side column and insert (%) instead of Effect Size in column captions

In my opinion this is an interesting study that is worth publishing, but would definitely benefit from a better preparation of the second revision, including restructuring and extensive editing of English. Some parts of the Discussion are still not clear or difficult to follow. Therefore I recommend minor revision before final acceptance.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Sep 29;16(9):e0257474. doi: 10.1371/journal.pone.0257474.r004

Author response to Decision Letter 1


27 Aug 2021

Medellin, 27 August 2021

Dear Dr. Ito,

This document contains my point-by-point response to you and to Reviewer 1. We are indebted forever, because thanks to the peer review and editorial process, the manuscript is now readable and understandable.

In blue font, I left your words and those of R1. In red font, my answers. When citing the new text, I used quotes, but left the new text in black font to ease reading. The last step was to shorten the manuscript, and then it was reviewed by an English-native scientist familiar with the methodology (Tonie E. Rocke, PhD. USGS National Wildlife Health Center). Although all suggestions from R1 were not only welcomed but incorporated in the final version of the manuscript, a few might have been let out (unintentionally) or modified slightly along this process.

Sincerely,

Omar Vesga, MD.¬¬

¬¬¬¬¬¬¬-________________________________________________________________________________

PONE-D-21-16873R1

Highly sensitive scent-detection of COVID-19 patients in vivo by trained dogs

PLOS ONE

Dear Dr. Vesga,

English should be edited as the reviewer suggested.

Done. After introducing the changes described below, the manuscript was reviewed and edited (in style) by a US native scientist who is familiar with the methods employed in this study.

Please submit your revised manuscript by Sep 18 2021 11:59PM.

Please include the following items when submitting your revised manuscript:

• A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

Done (this document).

• A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

Done (uploaded along with this document).

• An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

Done (uploaded along with this document).

Etsuro Ito

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

References checked for completeness and correctness; additional references replaced some cited in the previous version. We found no retracted papers.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: The authors addressed sufficiently my comments in their rebuttal, but have only partly incorporated amendments into the revision. First of all, the structure of the submission should be further improved, since it remains not concise enough and chaotic. Eventually, it is up to the Editor of PloS to decide if the overall structure of the revision is acceptable or should be changed. In my opinion, however, the main body of a paper should contain all essential information and data of the study. I have no comments to the revised Introduction section including hypotheses, except for the lines 141-145 that fit rather to the M&M section than to the Introduction. I think in the Material and Methods the reader should be able to find all essential information on animals, collecting and handling of odor samples, training procedure and statistical methods used in the study, without searching for supporting information. The supporting information is useful for somebody who wants to learn all additional details of the study. In the present form of the revision, the reviewer (and the readers), have to jump to the supporting info and back to the main text, which is inconvenient and makes following of the paper not easy.

We tried carefully to incorporate all the amendments suggested by R1. When in disagreement (once or twice), we explained our reasons (in this response).

Being not an English native speaker, I will not evaluate the language of the paper, however, to me the revision requires extensive editing of English by a native speaker, who is familiar with specific wording concerning both COVID-19 epidemiology and canine detection. Nevertheless, I try to give some suggestions.

The structure was edited by myself first, and then by a native English-speaking scientist, as suggested by R1, trying to make the manuscript concise and clear without sacrificing the substance. It moved the line numbers and I had to write here the text so R1 can check what we did with every one of her/his observations. The Results section was incorporated back to the manuscript. Now, there are no Methods in the Supporting information file.

L.48-49, I suggest "...A real-life (in vivo) performance was determined 75 days after in vitro effectiveness assay...”.

It was done as suggested, but correcting the expression underlined above. Now, the sentence reads this way: "Seventy-five days after finishing in vivo efficacy experiments, a real-life study (in vivo effectiveness) was executed among the riders of the Metro System of Medellin, deploying the human-canine teams without previous training or announcement.”.

L.51 Here and elsewhere: is the word "interrogation" appropriate in this context ? I would suggest: " ...Three dogs were used to examine the scent of 350 volunteers, who agreed to participate both in test with canines and in rRT PCR testing...”

It was done as suggested, correcting the expression underlined above. Now, the sentence reads this way: “Three dogs were used to examine the scent of 550 volunteers who agreed to participate both in test with canines and in rRT-PCR testing.”.

L.55-58 This statement is imprecise. The task of the dogs is not to discriminate odoriferous contamination from infection since probably all odor samples collected in reality, are to some extent contaminated, and no pure "infection odor" exists. The dog should be rather trained to ignore contamination and to indicate "infection odor" regardless its contamination.

The imprecise statement was eliminated from the Abstract. Now it reads this way: “Canine scent-detection in vivo is a highly accurate screening test for COVID-19, and it detects more than 99% of infected individuals independent of key variables, such as disease prevalence, time post-exposure, or presence of symptoms.”. Those variables had been mentioned before, so we just move them down without altering the meaning of the Abstract.

L.76 The nations did not demonstrate. I suggest ".. It was clearly demonstrated in several countries that....".

The new line reads this way: “It was clearly demonstrated in several countries that early and massive testing, followed by immediate isolation in designated areas away from home and rigorous contact-tracing, were the only measures that effectively stopped the pandemic even before the first vaccine was available [6].”.

L.111 I suggest „ … the dog must be reinforced……with a reward……”

The new line reads this way: “Working dogs must be reinforced for each positive finding with a reward that conveys an extremely high value for them, and performance depends heavily on the intensity of the expectations that such reward generates in their brains during foundational training [26].”

L. 115 I suggest „…selection process of dog candidates for the olfactory training e.g. for detection of explosives is needed….”

The new line reads this way: “Even within optimal training conditions, not all canine individuals will give their best to gain a reward, and a rigorous selection process of dog candidates for the olfactory training is needed [27].”.

L.117 „exceeds” instead of „excels”.

The new line reads this way: “Bloodhound is a working breed with exceptional olfactory ability,”.

L.118 „of working in the scent lineup”.

The new line reads this way: “…but is not suitable for medical detection because most individuals do not enjoy working in the scent lineup.”.

L.119 „human odor” instead of „human subject”.

Line 119 was eliminated.

L.123. „….learning ability, trainability and ability to cooperate with humans…” instead of „minds”.

The new line reads this way: “but cognition research is providing solid evidence that dogs indeed have unusual learning ability compared with other nonhuman animals…”.

L.141-144 – this passage fits rather to the M&M section than to the Introduction.

It is the last paragraph of the introduction, and as such it has the purpose of preparing the reader for the methods and results. It is the usual style in most journals focused on human medical sciences, including PLoS ONE, which states this in their Submission Guidelines under the topic “Introduction”:

The introduction should:

• Provide background that puts the manuscript into context and allows readers outside the field to understand the purpose and significance of the study

• Define the problem addressed and why it is important

• Include a brief review of the key literature

• Note any relevant controversies or disagreements in the field

• Conclude with a brief statement of the overall aim of the work and a comment about whether that aim was achieved

L.145 „Ultimate goal” instead of „product” (?).

The intention of the sentence is to confirme that the aim was achieved. Therefore, there is no place to add an additional goal when all five objectives had been mentioned in the previous paragraph of the introduction. Since the noun “product” (a thing produced by labor) is what R1 objects, we changed it for “outcome” (a final product or end result; a conclusion reached through a process of logical thinking). The fact that the PPV dropped during the effectiveness assay does not invalidate efficacy data, therefore it is not misleading to say that we ended up with a “very fast and cost-effective screening method”.

L.155-160 this passage is redundant since it reiterates information given in the Introduction.

The sentence was replaced by this: “Fig 1 shows the training program and experimental design.”.

L.160-164 (up to „…from each other…”) this passage should be included into Material section (collecting odor samples).

A new sub section called “Specimen collection for in vitro work” was created in Methods, between Sample Size and Dog Training. The paragraph in lines 160-164 was moved there.

L.164-178 this passage should be shifted to line 211 (Dog training).

Done.

L.166. Fig.1. showing the training phases should be cited here.

Done.

L.196-200 Should be moved to the Material section - collecting odor samples.

Done.

L.202-208 A separate paragraph on Ethical permission, should be moved to the beginning of the M&M section. The passage in lines 202-208 has nothing to do with Sample size.

Done.

L.235 My suggestion: „ ……could not bite, lick or touch ….”.

The new line reads this way: “…and the difference between both was that animals allocated to experimental groups could not bite, lick, or touch D1 or D2,…”.

L.265-270 Statistical analyses should be the last separate paragraph of the M&M section.

Done.

L.447-449 Rewarding some but not all correct alerts during the effectiveness assay cannot be considered as human error. In real screening scenario neither the experimenter nor the handler does know if the dog's response was correct or wrong (true double blind procedure). Therefore during real screening scenario, basically no rewarding have to be applied. Increasing the false negative alerts, or false positive alerts as a consequence of not rewarding the dog for EVERY correct alert on real people, is one of the critical points of the canine screening. It remains an open question, if the dogs that are subjected to a sustaining/improving training in order to reduce false negative and false positive alerts, using odor samples and lineup, would equally well alert on live people (in long term). There is something like context dependent olfactory learning that has to be taken into consideration.

The expression “human error” was taken off the sentence, which now reads this way: “. One explanation was handlers rewarding some but not all alerts during the effectiveness assay but, in fact, rRT-PCR results showed that many correct alerts passed unrewarded; it confused the dogs and caused even more false negative alerts.”.

L.462 „bred” instead of „created”.

The new line reads this way: “Second, all six dogs were successful as medical detectors despite belonging to breeds not intended specifically for scent-detection.”:

L.463 „suitability” instead of „excellence”.

See answer to L.465 below.

L.464 add „concentration ability”.

See answer to L.465 below.

L.465 „individual dogs” instead of „individual prospect”.

The new line reads this way: “It supports recent data showing that, more than the breed, the best predictors of suitability for medical detection dogs are the levels of motivation, stamina, determination, resilience, and concentration ability of the individual dog [42,43].”:

L.468 „…the canine recognizes variations….” (?) – rather ignores variations and indicates regardless of distracting odors.

The new line reads this way: “This process, called generalization, applies to learning theory, and in reference to scent-detection means that the canine ignores variations of the positive stimulus and indicates its source regardless of distracting odors [44].”.

L.469 „source” is redundant here. I suggest: „indicates the target odor regardless....”.

Actually, we trained our dogs to alert on the SOURCE of the target scent instead of the odor itself. It is useful for outdoors scent-work, where a moving odor causes the dog could alert before arriving to the source. On the scent-line with humans, it could cause the dog to alert on the wrong individual when wind is blowing towards the dog.

L.470 but is surprising in view of other medical detection dog training e.g. for cancer detection.

The word “surprising” was eliminated. The new sentence reads this way: “Generalization after exposure to just three specimens is to be expected if errorless learning principles are the foundations of training [31].”.

L.475-478 This sentence should be rewritten since it is confusing. It is not a problem of using urine as both positive (cancer) and negative (healthy stimulus), but the problem of using the same samples (donors) for the training and testing. Also, Ellier et al. 2014 was not the only study that recommended using odor samples from many donors and conducting the training and testing using different samples (donors).

The new sentence reads this way: “However, experimental evidence in favor of such hypothesis is scarce, and most citations refer to a work in which urine was employed as positive and negative stimulus during training [48].”.

L.480-481 „ …..using the same type of secretion during the foundational training does not favor errorless discrimination learning…..” (1) - what is the foundational training? - perhaps initial training ?, (2) – finally the dogs have to discriminate odor samples from sick vs healthy humans and not sick humans vs sterile saline solution. The dogs may be perfect at what the authors label as „errorless dicrimination training”, but may show poor performance in real screening scenario.

True, but in vivo experiments (efficacy trial, phase 3) demonstrated a very high effect size for every diagnostic metrics, including PPV. We did not invent errorless discrimination learning, it is actually the product of a very solid scientific work, as cited in the manuscript. The fact the dogs failed during the effectiveness assay does not affect the truth of errorless learning because the reward system was changed. To eliminate the term “foundational training” the new sentence reads this way: “Beyond the many variables specific to certain diseases and specimens that might be responsible for a greater level of difficulty for the dog, using the same type of secretion when the dogs are first trained for scent-detection does not favor errorless discrimination learning [31,32,45].”.

L.483 what stands for Effect size ? perhaps simply "The different diagnostic metrics"?

Just to answer the question of R1: effect size is a mathematical term that refers to the magnitude of a difference between two groups or, simply put, the most important result of most experiments. In this study, the effect size is the difference in diagnostic performance between the index test (K9 scent-detection) and the gold standard (rRT-PCR). To validate our dx test, we used these metrics, or effect sizes: SEN, SPC, PPV, NPV, ACC, and LR. Reviewer 1 is pointing correctly that we have created confusion by calling the values of these metrics “effect sizes”, because one thing is the name of a parameter or a metric or an effect size, and other is its numerical value or magnitude. In the new version, all references to the numeric result of any metric (aka effect size) are called “value”, “magnitude”, or “mean”.

L.488-490 „…Had they been scenting in search of odors other than their target…” please rewrite to be more clear. Perhaps: „..Had they been alerting to odors other than the target odor…”?

The new sentence reads this way: “Had they been alerting to odors other than their target odor, SPC would have been very low, and it was also >95%.”.

L.490-492 – please rewrite to be more clear.

The new sentence reads this way: “While SEN and SPC refer to the index test (canine scent-detection), predictive values quantify the probability that the participants truly had or not COVID-19, taking the reference standard (rRT-PCR) as the truth.”.

L.500-508 I suggest to delete the passage in lines 500-508 because canine learning as such goes beyond the scope of this study.

Done. The reader is directed to references 49-53 in lines 497-499: “Previous knowledge of dog behavior and cognition also suggest that our dogs were not relying in memory when they detected SARS-CoV-2 and discriminated it among so many other odoriferous cues [44,49-54].”.

L.560 while infectious diseases could be detected in seconds, could they also be controlled in seconds?

It is if, as in South Korea and New Zealand, positive patients are isolated immediately. The chain of transmission stops right there. That said, we understand that R1 is pointing to the fact that the data had nothing to do with controlling the transmission of COVID-19. Therefore, the words “and control” were eliminated. The new sentence reads this way: “With improved training methods, canines could, in the near future, provide a sensitive and effective method to detect infectious diseases in a matter of seconds.”.

Fig 1. „…..training phases and experimental design were planned…”? – or were conducted ?.

The new title of Fig 1 reads this way: Fig 1. Efficacy studies. Flow chart depicting the order in which training phases and experimental design were conducted.

Table 3 The term Effect size may be confusing. I suggest to delete (%) in the first left side column and insert (%) instead of Effect Size in column captions.

Done. The title of all columns was modified to reflect the difference between the effect size (metric) and its magnitude (numerical value).

In my opinion this is an interesting study that is worth publishing, but would definitely benefit from a better preparation of the second revision, including restructuring and extensive editing of English. Some parts of the Discussion are still not clear or difficult to follow. Therefore I recommend minor revision before final acceptance.

Done. After introducing the changes described above, the manuscript was reviewed and edited (in style) by a US native scientist who is familiar with the methods employed in this study.

Attachment

Submitted filename: Response_to_Reviewers.docx

Decision Letter 2

Etsuro Ito

2 Sep 2021

Highly sensitive scent-detection of COVID-19 patients in vivo by trained dogs

PONE-D-21-16873R2

Dear Dr. Vesga,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Etsuro Ito

Academic Editor

PLOS ONE

Acceptance letter

Etsuro Ito

14 Sep 2021

PONE-D-21-16873R2

Highly sensitive scent-detection of COVID-19 patients in vivo by trained dogs

Dear Dr. Vesga:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Etsuro Ito

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Supporting information containing S1 and S2 Figs and Videos, as well as S1 through S8 Tables.

    (DOCX)

    Attachment

    Submitted filename: Response_to_Reviewers.docx

    Attachment

    Submitted filename: Response_to_Reviewers.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES