Amplicon-Based Detection and Sequencing of SARS-CoV-2 in Nasopharyngeal Swabs from Patients With COVID-19 and Identification of Deletions in the Viral Genome That Encode Proteins Involved in Interferon Antagonism

Shona C Moore; Rebekah Penrice-Randal; Muhannad Alruwaili; Nadine Randle; Stuart Armstrong; Catherine Hartley; Sam Haldenby; Xiaofeng Dong; Abdulrahman Alrezaihi; Mai Almsaud; Eleanor Bentley; Jordan Clark; Isabel García-Dorival; Paul Gilmore; Ximeng Han; Benjamin Jones; Lisa Luu; Parul Sharma; Ghada Shawli; Yani Sun; Qin Zhao; Steven T Pullan; Daniel P Carter; Kevin Bewley; Jake Dunning; En-min Zhou; Tom Solomon; Michael Beadsworth; James Cruise; Derrick W Crook; David A Matthews; Andrew D Davidson; Zana Mahmood; Waleed Aljabr; Julian Druce; Richard Vipond; Lisa Ng; Laurent Renia; Peter J M Openshaw; J Kenneth Baillie; Miles W Carroll; James Stewart; Alistair Darby; Malcolm Semple; Lance Turtle; Julian A Hiscox

doi:10.3390/v12101164

. 2020 Oct 14;12(10):1164. doi: 10.3390/v12101164

Amplicon-Based Detection and Sequencing of SARS-CoV-2 in Nasopharyngeal Swabs from Patients With COVID-19 and Identification of Deletions in the Viral Genome That Encode Proteins Involved in Interferon Antagonism

Shona C Moore ^1,^†, Rebekah Penrice-Randal ^1,^†, Muhannad Alruwaili ^1,^†, Nadine Randle ^1,^†, Stuart Armstrong ^1,^†, Catherine Hartley ^1,^†, Sam Haldenby ¹, Xiaofeng Dong ¹, Abdulrahman Alrezaihi ¹, Mai Almsaud ¹, Eleanor Bentley ¹, Jordan Clark ¹, Isabel García-Dorival ¹, Paul Gilmore ¹, Ximeng Han ¹, Benjamin Jones ¹, Lisa Luu ¹, Parul Sharma ¹, Ghada Shawli ¹, Yani Sun ^1,², Qin Zhao ^1,², Steven T Pullan ³, Daniel P Carter ³, Kevin Bewley ³, Jake Dunning ^3,⁴, En-min Zhou ², Tom Solomon ^1,^4,⁵, Michael Beadsworth ⁶, James Cruise ⁶, Derrick W Crook ⁷, David A Matthews ⁸, Andrew D Davidson ⁸, Zana Mahmood ^1,⁹, Waleed Aljabr ^1,¹⁰, Julian Druce ¹¹, Richard Vipond ^3,⁴, Lisa Ng ^1,^4,^5,¹², Laurent Renia ¹², Peter J M Openshaw ¹³, J Kenneth Baillie ¹⁴, Miles W Carroll ^3,^4,⁷, James Stewart ^1,⁵, Alistair Darby ^1,⁵, Malcolm Semple ^1,^4,⁵, Lance Turtle ^1,^4,^5,^*, Julian A Hiscox ^1,^4,^5,^12,^*

¹Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool L3 5RF, UK; Shona.Moore@liverpool.ac.uk (S.C.M.); hlrpenri@liverpool.ac.uk (R.P.-R.); Muhannad.Alruwaili@liverpool.ac.uk (M.A); nprandle@liverpool.ac.uk (N.R.); sarmstro@liverpool.ac.uk (S.A.); csguy@liverpool.ac.uk (C.H.); haldenby@liverpool.ac.uk (S.H.); Xiaofeng.Dong2@liverpool.ac.uk (X.D.); A.Alrezaihi@liverpool.ac.uk (A.A.); Mai.Almsaud@liverpool.ac.uk (M.A.); hlebentl@student.liverpool.ac.uk (E.B.); Jordan.Clark@liverpool.ac.uk (J.C.); isagardo@liverpool.ac.uk (I.G.-D.); gilmore@liverpool.ac.uk (P.G.); Ximeng.Han@liverpool.ac.uk (X.H.); Benjamin.Jones3@liverpool.ac.uk (B.J.); lisaluu@liverpool.ac.uk (L.L.); parul13@liverpool.ac.uk (P.S.); G.Shawli@liverpool.ac.uk (G.S.); Yani.Sun@liverpool.ac.uk (Y.S.); Qin.Zhao@liverpool.ac.uk (Q.Z.); tsolomon@liverpool.ac.uk (T.S.); Z.Mahmood2@liverpool.ac.uk (Z.M.); waljabr@kfmc.med.sa (W.A.); lisa_ng@immunol.a-star.edu.sg (L.N.); jpstewar@liverpool.ac.uk (J.S.); acdarby@liverpool.ac.uk (A.D.); castle@liverpool.ac.uk (M.S.)

²College of Veterinary Medicine, Northwest A&F University, Yangling 712100, China; zhouem@nwsuaf.edu.cn

³National Infection Service, Public Health England, Salisbury SP4 0JG, UK; Steven.Pullan@phe.gov.uk (S.T.P.); daniel.carter@phe.gov.uk (D.P.C.); kevin.bewley@phe.gov.uk (K.B.); jake.dunning@phe.gov.uk (J.D.); richard.vipond@phe.gov.uk (R.V.); Miles.Carroll@phe.gov.uk (M.W.C.)

⁴Health Protection Research Unit in Emerging and Zoonotic Infections, Liverpool L69 7BE, UK

⁵Liverpool Health Partners, Liverpool L3 5TF, UK

⁶Tropical & Infectious Disease Unit, Liverpool University Hospitals NHS Foundation Trust (a member of Liverpool Health Partners), Liverpool L7 8XP, UK; mike.beadsworth@rlbuht.nhs.uk (M.B.); james.cruise@liverpoolft.nhs.uk (J.C.)

⁷Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, UK; derrick.crook@ndcls.ox.ac.uk

⁸School of Cellular and Molecular Medicine, University of Bristol, Bristol BS8 1TD, UK; d.a.matthews@bristol.ac.uk (D.A.M.); Andrew.Davidson@bristol.ac.uk (A.D.D.)

⁹Laboratory Department, Directorate of Veterinary in Sulaimany, Rizgari 402, Sulaimani, Kurdistan Region 46001, Iraq

¹⁰Research Center, King Fahad Medical City, Riyadh 11525, Saudi Arabia

¹¹The Peter Doherty Institute for Infection and Immunity, Melbourne 3000, Australia; Julian.Druce@mh.org.au

¹²Infectious Diseases Horizontal Technology Centre (ID HTC), A*STAR, Singapore 138648, Singapore; renia_laurent@immunol.a-star.edu.sg

¹³Faculty of Medicine, Imperial College London, London SW7 2AZ, UK; p.openshaw@imperial.ac.uk

¹⁴Roslin Institute, University of Edinburgh, Edinburgh EH25 9RG, UK; j.k.baillie@ed.ac.uk

Correspondence: lance.turtle@liverpool.ac.uk (L.T.); julian.hiscox@liverpool.ac.uk (J.A.H.)

^†

These authors Contributed equally to this work.

PMCID: PMC7602519 PMID: 33066701

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of coronavirus disease 2019 (COVID-19). Sequencing the viral genome as the outbreak progresses is important, particularly in the identification of emerging isolates with different pathogenic potential and to identify whether nucleotide changes in the genome will impair clinical diagnostic tools such as real-time PCR assays. Although single nucleotide polymorphisms and point mutations occur during the replication of coronaviruses, one of the biggest drivers in genetic change is recombination. This can manifest itself in insertions and/or deletions in the viral genome. Therefore, sequencing strategies that underpin molecular epidemiology and inform virus biology in patients should take these factors into account. A long amplicon/read length-based RT-PCR sequencing approach focused on the Oxford Nanopore MinION/GridION platforms was developed to identify and sequence the SARS-CoV-2 genome in samples from patients with or suspected of COVID-19. The protocol, termed Rapid Sequencing Long Amplicons (RSLAs) used random primers to generate cDNA from RNA purified from a sample from a patient, followed by single or multiplex PCRs to generate longer amplicons of the viral genome. The base protocol was used to identify SARS-CoV-2 in a variety of clinical samples and proved sensitive in identifying viral RNA in samples from patients that had been declared negative using other nucleic acid-based assays (false negative). Sequencing the amplicons revealed that a number of patients had a proportion of viral genomes with deletions.

Keywords: SARS-CoV-2, next-generation sequencing, amplicon, MinION

1. Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in China in 2019 and has sequence similarity to SARS-CoV and certain bat coronaviruses [1]. After an incubation period that averages 5 days but may be between two days and two weeks [2], patients with the disease, termed coronavirus disease 2019 (COVID-19), typically present with fever, myalgia, cough, sore throat, and difficulty breathing. A range of other symptoms are also possible, in particular gastrointestinal symptoms. Approximately 80% of infections are mild, but of those that require hospitalisation, mortality is around 30% [3,4]. SARS-CoV-2 is highly transmissible with a reproductive number (R₀) of around 3 if unmanaged.

SARS-CoV-2 has a positive-sense RNA genome of approximately 30 kb, and viral RNA synthesis occurs in the cytoplasm of an infected cell. During RNA synthesis, several processes occur including replication of the genome and transcription of a nested set of subgenomic messenger RNAs, which encode for the suite of viral proteins, as well as the genome for ORF1AB. During viral RNA synthesis, errors can occur including point mutation of single nucleic acids and recombination [5]. Recombination events manifest themselves in insertions and/or deletions in the viral genome and can result in major changes in viral tropism. One of the unusual features of the SARS-CoV-2 genome sequence, compared to the most closely related coronaviruses, such as SARS-CoV, is the insertion of a furin cleavage site in the spike glycoprotein (S protein) sequence [6]. The origin of the furin cleavage site is unknown but may be either viral or cellular in origin. The presence of the furin cleavage site reduces the dependence on cellular proteases to process the S protein into two functional subunits, S1 and S2. Thus, recombination can result in the insertion of a non-coronavirus sequence. The furin cleavage site is potentially unstable [7].

SARS-CoV-2 has caused a worldwide pandemic and has severe economic and health implications for all affected countries. Given that mechanisms of both point mutation and recombination operate in coronaviruses, monitoring for these changes in the SARS-CoV-2 genome is essential not only for potential contact tracing but also for confidence in vaccine efficacy, which are generally based around the S protein. Experience from developing vaccines against the coronavirus infectious bronchitis virus (IBV), which arguably have been the most successful vaccines developed against coronaviruses so far, has suggested that differences of as little as 5% between the S1 sequences can result in poor cross-protection against different variants of IBV [8]. Differences in S1 of 2% to 3% (10 to 15 amino acids) can change serotype, suggesting that a small number of epitopes are immunodominant with respect to neutralizing antibody [8].

Anecdotal data from diagnostic laboratories (at least based in the UK) early in the outbreak suggested that nucleic acid-based diagnostics to SARS-CoV-2 and complications around the sampling processes can lead to unreliable results and false negatives. The advantage of both laboratory [9] and field-based sequencing [10] approaches in characterising viral infection was illustrated in the 2013–2016 West African Ebola virus outbreak, and allowed the origin of clusters of infection to be rapidly identified [11]. Therefore, the accurate identification and sequencing of SARS-CoV-2 in samples from patients provide multiple information from reducing false negatives, contact tracing, to assessing the suitability of diagnostic assays (especially nucleic acid-based) and investigating whether vaccines are likely to be or remain efficacious in the background of nucleotide substitution and recombination during virus replication. The International Severe Acute Respiratory and emerging Infection Consortium (ISARIC) 4C UK study provided a large and detailed collection of sequentially collected virological samples from well-characterised hospitalised patients; here we assess the utility of the Rapid Sequencing Long Amplicon (RSLA) protocol in monitoring viral RNA from early participants.

2. Materials and Methods

2.1. Ethics and Clinical Information

Patients (n = 24) were recruited under the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) Clinical Characterisation Protocol (CCP) (https://isaric.net/ccp) by giving informed consent. ISARIC CCP was reviewed and approved by the national research ethics service, Oxford (13/SC/0149). Samples from clinical specimens were processed at CL3 in the Outbreak Response Laboratory at the University of Liverpool, Liverpool, UK.

2.2. RNA Extraction and Preparation

Nasopharyngeal swabs were collected into viral transport medium from patients with COVID-19. RNA was isolated using either a QIAamp Viral RNA Mini Kit (Qiagen, Mancheste, UK) by spin-column procedure or Trizol LS (Invitrogen, UK), according to the manufacturer’s instructions. Total RNA was purified from SARS-CoV-2 infected Vero cells following AVL inactivation using the Qiagen RNA Mini Kit. Infection of Vero cells was conducted at Public Health England, Porton Down at CL3. RNA samples were treated with Turbo DNase (Invitrogen).

2.3. Primer Design

Alignments generated from the NCBI reference sequence for SARS-CoV-2 NCBI (NC_045512.2) and published sequences on GISAID were used to identify conserved regions for primer design. Primers (Table 1) were chosen that sequentially amplified roughly 1000 bp with an ~200 bp overlapping region.

Table 1.

Primer pairs by sequence order.

Primer Name	Sequence 5′–3′	Location		Size (bp)	Multiplex Pool
Primer Name	Sequence 5′–3′	Start	End	Size (bp)	Multiplex Pool
SARS-CoV-2_1_F	GTGTGACCGAAAGGTAAGATGG	248	269	956	1
SARS-CoV-2_1_R	TTGCATTCATTTGGTGACGC	1203	1184	956	1
SARS-CoV-2_2_F	GGTGTATACTGCTGCCGTGA	944	963	1213	2
SARS-CoV-2_2_R	GCCAATCAAGGACGGGTTTG	2156	2137	1213	2
SARS-CoV-2_3_F	CCGCACTCTTGAAACTGCTC	1912	1931	1254	3
SARS-CoV-2_3_R	GCAGAAGTGGCACCAAATTC	3165	3146	1254	3
SARS-CoV-2_4_F	ACACCACTGGGCATTGATTTAG	2936	2957	1264	4
SARS-CoV-2_4_R	TTTCAGTAGTGCCACCAGCC	4199	4180	1264	4
SARS-CoV-2_5_F	CTTCATCCAGATTCTGCCAC	4052	4071	1296	5
SARS-CoV-2_5_R	AGCAGGTGGATTAAACTTCAACTC	5347	5324	1296	5
SARS-CoV-2_6_F	CAACATTAACCTCCACACGC	4990	5009	1189	6
SARS-CoV-2_6_R	ATCAATAGCCACCACATCACC	6178	6158	1189	6
SARS-CoV-2_7_F	AGAAACCTGCTTCAAGAGAGC	6108	6128	1373	1
SARS-CoV-2_7_R	ATTACAACCGTCTACAACATGCAC	7480	7457	1373	1
SARS-CoV-2_8_F	GTCACTATTGCAACCTACTGTAC	7091	7113	1093	2
SARS-CoV-2_8_R	CTTGCCGAGCTGCTGAAATA	8183	8164	1093	2
SARS-CoV-2_9_F	AATCAGCGTCTGTTTACTACAGTC	7929	7952	1192	3
SARS-CoV-2_9_R	GTGTCAGGGCGTAAACTTTC	9120	9101	1192	3
SARS-CoV-2_10_F	TTGTCGTGCCTGGTTTGC	8856	8873	1303	4
SARS-CoV-2_10_R	ACGTCATCAAGCCAAAGACC	10158	10139	1303	4
SARS-CoV-2_11_F	AGTGGAGCAATGGATACAAC	9917	9936	1239	5
SARS-CoV-2_11_R	AGCTACAGTGGCAAGAGAAG	11209	11190	1239	5
SARS-CoV-2_12_F	AGGGTACACACCACTGGTTG	10995	11014	1185	6
SARS-CoV-2_12_R	CACCATTAGCAACAGCCTGC	12179	12160	1185	6
SARS-CoV-2_13_F	GTGAAGAAATGCTGGACAACAG	12057	12078	1180	1
SARS-CoV-2_13_R	GCACCACCAAAGGATTCTTG	13236	13217	1180	1
SARS-CoV-2_14_F	TAGTTTAGCTGCCACAGTACG	12997	13017	1200	2
SARS-CoV-2_14_R	AGTTAAAGCCCTGGTCAAGG	14196	14177	1200	2
SARS-CoV-2_15_F	ATACGCCAACTTAGGTGAACG	13962	13982	1284	3
SARS-CoV-2_15_R	AACATGTTGTGCCAACCACC	15245	15226	1284	3
SARS-CoV-2_16_F	TGAGTTATGAGGATCAAGATGCAC	14996	15019	1243	4
SARS-CoV-2_16_R	GCCTGTAAGACTGTATGCGG	16238	16219	1243	4
SARS-CoV-2_17_F	CCCAGATCCATCAAGAATCCTAG	15933	15955	1214	5
SARS-CoV-2_17_R	TGCGAGCAGAAGGGTAGTAG	17146	17127	1214	5
SARS-CoV-2_18_F	AAGGTGACTATGGTGATGCTG	16841	16861	1336	6
SARS-CoV-2_18_R	GGTATGCCAGGTATGTCAACAC	18176	18155	1336	6
SARS-CoV-2_19_F	ACTCAAACCACTGAAACAGCTC	17875	17896	1239	1
SARS-CoV-2_19_R	GTCACTACAAGGCTGTGCATC	19113	19093	1239	1
SARS-CoV-2_20_F	AGCTAGTTGTGATGCAATCATGAC	18846	18869	1235	2
SARS-CoV-2_20_R	CTTGTTTGGGACCTACAGATGG	20098	20077	1235	2
SARS-CoV-2_21_F	TTTGGGTGTGGACATTGCTG	19842	19861	1323	3
SARS-CoV-2_21_R	ATAGCCACGGAACCTCCAAG	21164	21145	1323	3
SARS-CoV-2_22_F	TAAGACAGTGGTTGCCTACG	20912	20931	1125	4
SARS-CoV-2_22_R	TCTGAACTCACTTTCCATCCAAC	22036	22014	1125	4
SARS-CoV-2_23_F	TTCGAAGACCCAGTCCCTAC	21895	21914	1405	5
SARS-CoV-2_23_R	TGGATCACGGACAGCATCAG	23299	23280	1405	5
SARS-CoV-2_24_F	TTGAACTTCTACATGCACCAGC	23106	23127	1111	6
SARS-CoV-2_24_R	CCAGAAGTGATTGTACCCGC	24216	24197	1111	6
SARS-CoV-2_25_F	TTGCTGCTAGAGACCTCATTTG	24093	24114	1190	1
SARS-CoV-2_25_R	GCAACTGGTCATACAGCAAAG	25282	25262	1190	1
SARS-CoV-2_26_F	GGTGACATCTCTGGCATTAATGC	25061	25083	1163	2
SARS-CoV-2_26_R	TGCTTACAAAGGCACGCTAG	26223	26204	1163	2
SARS-CoV-2_27_F	ACCAGCTGTACTCAACTCAATTG	26027	26049	1137	3
SARS-CoV-2_27_R	CTGCTACTGGAATGGTCTGTG	27163	27143	1137	3
SARS-CoV-2_28_F	TGACCAGACCGCTTCTAGAAAG	26908	26929	1180	4
SARS-CoV-2_28_R	GCCTCATCCACGCACAATTC	28087	28068	1180	4
SARS-CoV-2_29_F	TGTCACGCCTAAACGAACATG	27876	27896	1147	5
SARS-CoV-2_29_R	GATTTCTTAGTGACAGTTTGGCC	29022	29000	1147	5
SARS-CoV-2_30_F	CGAATTCGTGGTGGTGACG	28550	28568	1173	6
SARS-CoV-2_30_R	GGTGGCTCTTTCAAGTCCTC	29722	29703	1173	6

Open in a new tab

2.4. RT and PCR

SuperScript IV (Invitrogen) was used to generate single-strand cDNA using random primer mix (NEB, Hitchin, UK). The primer sets were used to generate 30 amplicons from the cDNA. Subsequently, primers were pooled to generate six sets each containing five primer pairs. Reactions were performed using 2 × Q5-High Fidelity master mix (NEB) with 0.5 μM of each primer for the individual reactions and 0.1 μM of each primer for the pooled primers. The reaction conditions were as follows: denaturation at 98 °C for 30 sec followed by 35 to 40 cycles of 10 sec denaturation at 98 °C, 30 sec annealing at 66 °C and then 50 sec of extension at 72 °C. A final extension step was done for 2 min at 72 °C.

2.5. Library Preparation for MinION Sequencing

Following amplification, PCR products were purified at a 1:1 ratio with AMPure XP beads (Beckman Coulter, High Wycombe, UK). The library was prepared as per the sequencing by ligation protocol with native barcodes for multiplexing (Oxford Nanopore, Oxford, UK).

2.6. Informatics Analysis

For assembly of the SARS-CoV-2 genomes, Minimap2 (v. 2.17-r941) was used to align fastq sequences to the coronavirus 2 isolate Wuhan-Hu-1 reference genome (NC_045512.2) using the -ax map-ont parameters. Samtools (v.1.10) was used to sort and index alignment files, and Picard (v.2.23.4) was used to mark duplicates. A custom script written in perl (v.5.26.2) was used to determine viral genome coverage, the viral consensus sequence and the presence of indels. Viral genome coverage was visualised in RStudio (v. 4.0.2). For the identification of deletions within the SARS-CoV-2 genome, ONT RSLA library reads were aligned to the coronavirus 2 isolate Wuhan-Hu-1 reference genome using Minimap2, as part of the Nextflow ARTIC analysis pipeline (https://github.com/connor-lab/ncov2019-artic-nf). Raw aligned BAM files were used as input to SVIM, to detect putative deletions. Resulting candidate deletions were filtered to accept any with at least 5 supporting reads and intersected with reference gene annotations to determine which genes intersected with the deletions.

3. Results

3.1. Primer and Amplicon Design

To sequence SARS-CoV-2 with a view to identify both nucleotide polymorphisms and recombination events on the viral genome, a longer read length amplicon-based system was developed. A series of primers were designed (Table 1), allowing overlapping sections of the SARS-CoV-2 genome to be amplified sequentially in 1000 base-paired fragments, with an approximately 200 base pair overlap to facilitate sequence assembly from the amplicon data (Figure 1A). The primers were selected on the basis of conserved regions in the SARS-CoV-2 genome based upon an initial deposition of 17 genomes available on GISAID.

(A) Schematic diagram of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome showing the position of major open reading frames and the position of the amplicons along the genome. (B) Agarose gel electrophoresis analysis of the amplicon products resulting from RT-PCR using the designated forward and reverse primers to amplify the SARS-CoV-2 genome from RNA purified from Vero cells infected with the virus. (C) The amplicon products were purified and sequenced on a single flow cell using an Oxford Nanopore MinION. Shown are the number of reads that map (y-axis) to each amplicon across the SARS-CoV-2 genome from 5′ to 3′ (x-axis).

3.2. Validation of Amplicon Generation and Oxford Nanopore Sequencing Using RNA from Cells Infected with SARS-CoV-2

Purified RNA from SARS-CoV-2 (MT007544.1 GenBank)-infected Vero cells was used to confirm that the primers could generate amplicons. This RNA was used as a template for cDNA synthesis followed by PCR using the conserved primers. This generated 30 separate amplicons covering the SARS-CoV-2 genome, and the synthesis and expected size of these amplicons were confirmed using agarose gel electrophoresis (Figure 1B). These amplicons were sequenced on an Oxford Nanopore (UK) flow cell using MinION and reads were mapped to the SARS-CoV-2 genome confirming that the amplicons represented viral sequence (Figure 1C). This approach was termed Rapid Sequencing Long Amplicons (RSLAs).

3.3. Initial Evaluation of the Amplicon Approach to Detect SARS-CoV-2 on RNA Purified from Nasopharyngeal Swabs Collected from Patients with COVID-19 Correctly Identified a Positive but also a False Negative Sample

The ability of the amplicon-based approach to detect SARS-CoV-2 was initially evaluated on nasopharyngeal swabs collected from patients with COVID-19. The first patient (REMRQ0001) was admitted to the Royal Liverpool University Hospital on 23 February 2020. The patient was a member of staff from the Diamond Princes cruise ship that had an outbreak of SARS-CoV-2 leading to passengers developing COVID-19 [12,13]. The patient was asymptomatic but was screened for SARS-CoV-2 on 20 February 2020, left the ship on 21 February and was repatriated from Japan on 22 February 2020. During the flight, a positive result for SARS-CoV-2 using a nucleic acid-based system was received. A repeat nasopharyngeal swab was taken on admission and again tested positive with a viral load of Ct = 31.3. Two repeat swabs were collected on 24 February, one was positive (Ct = 32.7) and the second was negative. On 25 February, a fourth diagnostic swab was again negative, but a separately collected swab yielded PCR products from RNA isolated to evaluate the amplicon system. Here, 30 separate amplicons were synthesized and then visualised by agarose gel electrophoresis (Figure 2A). The data indicated that all amplicons (F1&R1 to F30&R30) could be identified and spanned the SARS-CoV-2 genome. These amplicons were sequenced on an Oxford Nanopore flow cell using MinION and sequence reads were mapped to the SARS-CoV-2 genome (Figure 2B), confirming that the amplicons represented viral sequence.

(A) Agarose gel electrophoresis analysis of amplicons generated by RT-PCR from RNA isolated from a nasopharyngeal swab taken from patient REMRQ0001, who had coronavirus disease 2019 (COVID-19), and diagnosed positive for SARS-CoV-2 by a laboratory-based test. Primer pairs are indicated above each amplicon. (B) The amplicon products were purified and sequenced on a single flow cell using an Oxford Nanopore MinION. Shown are the number of reads that map (y-axis) to each amplicon across the SARS-CoV-2 genome from 5′ to 3′ (x-axis). (C) Agarose gel electrophoresis analysis of amplicons generated by RT-PCR from RNA isolated from a nasopharyngeal swab taken from patient REMRQ0001, who had COVID-19, and subsequently found negative for SARS-CoV-2 by a laboratory-based test. Note that the brightness of the image has been adjusted post-image capture to more clearly show amplicon products.

A final nasopharyngeal swab was taken from patient REMRQ0001 on 27 February. This latter swab was not used for a diagnostic assay but was used to evaluate the sensitivity of the amplicon-based system using RNA isolated from the sample. In this case, several amplicon products were identified, though fewer than on the previous sample (note that the brightness on the gel image was adjusted post-exposure to show these products more clearly) (Figure 2C). The products most clearly visible included amplicons from the polymerase (orf1ab) region (F1&R1, F3&R3, F7&R7, F12&R12, F15&R15, F16&R16, F20&R20) and the membrane (M) region (F26&R26). Other amplicon products were less visible including F28&R28, spanning the orf8/nucleoprotein gene sequence. Therefore, the data suggested that the viral genome or viral sequence was present in a patient that had been evaluated as negative.

The second patient (REMRQ0002) had onset of symptoms of COVID-19 on 22 February 2020 and was admitted to the Royal Liverpool University Hospital on 27 February 2020 after a diagnostic nasopharyngeal swab taken on 25 February tested positive for SARS-CoV-2. A repeat swab taken on the day of admission against tested positive for SARS-CoV-2. A further swab was taken for research the same day, and RNA isolated from this swab was used to evaluate the amplicon system. The amplicons were visualised by agarose gel electrophoresis (Figure 3A,B; same gel as shown in 3A but with the brightness enhanced post-image capture). These amplicons were sequenced on an Oxford Nanopore flow cell using MinION and sequence reads were mapped to the SARS-CoV-2 genome (Figure 3C), confirming that the amplicons represented viral sequence. There were some regions of the genome that had lower sequence read depth.

(A,B) Agarose gel electrophoresis analysis of amplicons generated by RT-PCR from RNA isolated from a nasopharyngeal swab taken from patient REMRQ0002, who had COVID-19, and diagnosed positive for SARS-CoV-2 by a laboratory-based test. Primer pairs are indicated above each amplicon. Note that the image in (B) is the same image as (A) but the brightness has been enhanced post-image capture in order to more clearly show amplicon products. (C) The amplicon products were purified and sequenced on a single flow cell using an Oxford Nanopore MinION. Shown are the number of reads that map (y-axis) to each amplicon across the SARS-CoV-2 genome from 5′ to 3′ (x-axis).

3.4. Development of a Multiplex Method to Generate Amplicons

In order to increase capacity and reduce complexity (the need to conduct individual PCRs for each amplicon primer set), primer sets were pooled into six groups (Table 1). These were arranged so that adjacent regions were not in the same pool and that there was coverage across the genome in each pool. This way, if a reaction failed, there would still be genome coverage in each amplicon group. The ability of this multiplex method to identify SARS-CoV-2 RNA in clinical material was evaluated by isolating RNA from nasopharyngeal swabs taken from patients. Total RNA from SARS-CoV-2 infected cells was also used as a positive control, and negative controls were also included. The six groups of primer sets were used to generate a mixture of amplicons in each pool for each patient and the positive control. These amplicons were visualised by agarose gel electrophoresis (exemplar data shown in Figure 4).

(A) Agarose gel electrophoresis analysis of amplicons generated by multiplex RT-PCR from RNA isolated from a nasopharyngeal swab taken from patients who had COVID-19 and diagnosed positive for SARS-CoV-2 by a laboratory-based test. Primer pairs are indicated above each amplicon and exemplar data from two patients (numbers 36 and 37) are shown. Note that amplicons from multiplex pool 1, for patient 36, is shown to the left as these were run on a separate gel. Also shown are negative controls and a positive control using RNA isolated from SARS-CoV-2 infected cells. (B,C) The amplicon products were purified, barcoded and sequenced on a single flow cell using an Oxford Nanopore MinION. Shown are the number of reads that map (y-axis) to each amplicon across the SARS-CoV-2 genome from 5′ to 3′ (x-axis).

3.5. Identification of Deletions in SARS-CoV-2 from Patients with COVID-19

Recombination is a major mechanism of genetic change in coronaviruses, and this will manifest in insertions and deletions being present in the SARS-CoV-2 genome [7]. For coronaviruses in general, this process is well characterised in cell culture and has been demonstrated in vivo for both animal coronaviruses [5] and also in humans for Middle East respiratory syndrome coronavirus (MERS-CoV) [14,15]. To assess whether and where these were occurring during SARS-CoV-2 replication in patients, RNA was extracted from nasopharyngeal swabs taken from 24 patients and amplicons generated using the RSLA multiplex approach and sequenced on Oxford Nanopore flow cells using either MinION and/or GridION. Sequence reads were mapped to the SARS-CoV-2 genome and putative deletions identified (Table 2). Interestingly, several patients had deletions, at the minor variant level, in the SARS-CoV-2 genome scored with high confidence, mainly in orf3a and orf7a, which are associated with interferon antagonism and would result in potential defective genomes being present.

Table 2.

Analysis of deletions in the SARS-CoV-2 genome in patients with COVID-19. The columns from left to right are as follows: sample barcode, deletion start position (bp), deletion end position (bp), number of reads supporting this deletion, quality score (similar to number of reads supporting, but also takes into account read mapping quality scores with a score greater than 10 having higher confidence), standard deviation (SD) of deletion span (bp) from supporting reads, SD of deletion position (bp) from supporting reads. If the deletion interrupts a gene, these are the coordinates of the gene, the gene name, and the bp overlap with the deletion. In cases where the deletion overlaps >1 gene, the information of the second gene is provided.

Deletion Information							Affected Gene Information
Patient Number	Start (bp)	End (bp)	Supporting Reads	Quality Score	SD Span	SD Pos	Gene Start	Gene End	Gene Name	Overlap (bp)	Gene2 Start	Gene2 End	Gene2 Name	Overlap2 (bp)
22	19325	19380	9	10	4.42	58.42	266	21555	Orf1ab	55	−	−	−	−
22	20294	20429	8	10	0	0	266	21555	Orf1ab	135	−	−	−	−
22	25417	25796	10	12	0.79	0.86	25393	26220	Orf3a	379	−	−	−	−
22	27578	27624	16	19	2.49	2.45	27394	27759	Orf7a	46	−	−	−	−
22	28756	28884	7	8	2.27	3.4	28274	29533	N	128	−	−	−	−
23	2143	2198	5	5	4.36	73.69	266	21555	Orf1ab	55	−	−	−	−
23	2375	2421	8	8	5.54	82.85	266	21555	Orf1ab	46	−	−	−	−
23	2589	2642	6	6	3.2	41.78	266	21555	Orf1ab	53	−	−	−	−
23	2654	2741	5	5	8.73	51.88	266	21555	Orf1ab	87	−	−	−	−
23	2859	2904	9	10	2.42	88.48	266	21555	Orf1ab	45	−	−	−	−
25	20274	20383	7	8	6.2	7.5	266	21555	Orf1ab	109	−	−	−	−
25	20279	20340	10	12	3.83	2.57	266	21555	Orf1ab	61	−	−	−	−
25	27594	27640	10	11	2.37	31.77	27394	27759	Orf7a	46	−	−	−	−
26	27386	27699	91	99	0.86	1.66	27394	27759	Orf7a	305	27202	27387	ORF6	1
26	20274	20338	9	11	1.5	1.02	266	21555	Orf1ab	64	−	−	−	−
27	27020	27073	11	13	0.6	0.3	26523	27191	M	53	−	−	−	−
27	27522	27761	24	29	1.09	2.04	27394	27759	Orf7a	237	−	−	−	−
27	27689	27763	23	27	3.58	29.9	27394	27759	Orf7a	70
28	2025	2088	5	5	7.16	80.94	266	21555	Orf1ab	63
28	25508	25568	6	6	8.31	61.45	25393	26220	Orf3a	60
28	27884	27934	6	6	7.19	27.43	27894	28259	Orf8	40
29	27546	27650	7	8	14.61	64.8	27394	27759	Orf7a	104
29	28454	28731	6	7	26.89	58.79	28274	29533	N	277
30	25398	25776	12	14	1.56	69.87	25393	26220	Orf3a	378
30	27555	27625	31	38	1.94	5.71	27394	27759	Orf7a	70
30	8676	8723	11	12	1.79	38.61	266	21555	Orf1ab	47
31	28444	28775	62	77	1.29	1.31	28274	29533	N	331
32	25480	25551	97	99	1.62	2.04	25393	26220	Orf3a	71
36	20274	20339	11	13	1.43	0.87	266	21555	Orf1ab	65
37	25429	25641	11	13	17	7.89	25393	26220	Orf3a	212
37	25432	25808	10	12	5.24	44.62	25393	26220	Orf3a	376
37	28444	28776	8	9	0.71	0.65	28274	29533	N	332
43	27426	27559	5	5	19.77	50.25	27394	27759	Orf7a	133
43	27690	27732	6	6	2.25	49.85	27394	27759	Orf7a	42
43	28011	28062	13	14	5.36	77.82	27894	28259	Orf8	51
43	28196	28238	5	5	2	96.73	27894	28259	Orf8	42
43	28481	28536	5	5	3	64.73	28274	29533	N	55
	28601	28718	5	5	4.82	43.76	28274	29533	N	117

Open in a new tab

4. Discussion

This work demonstrates that both amplicon-based detection and subsequent sequencing are feasible for identifying the SARS-CoV-2 genome or nucleic acid in samples from patients with COVID-19. The rationale is based on amplifying and sequencing longer fragments than comparative techniques, and our approach was named Rapid Sequencing Long Amplicons (RSLAs) to reflect this and its origins based on identifying MERS-CoV from viral subgenomic messenger RNAs present in clinical samples in Saudi Arabia. The amplicon system was evaluated first on RNA purified from cells infected with SARS-CoV-2 as a positive control and then on RNA purified from nasopharyngeal swabs from patients with COVID-19. We would note that this system best lends itself to identifying and sequencing SARS-CoV-2 from RNA that is of good quality. In general, the Ct value of these diagnostic leftover samples was not known. Different platforms are used in different settings and diagnostic laboratories in the UK, and there is no common uniformity. Interestingly, the RSLA system could identify nucleic acid corresponding to parts of the SARS-CoV-2 genome in a sample from a patient that had tested SARS-CoV-2 negative by an alternative diagnostic nucleic acid-based test performed on a separate sample. Provided that the primer binding sites remain conserved in the pathogen being tested, RT-qPCR is generally more sensitive for diagnostic purposes. In this case, diagnostic reagents for RT-qPCR can be reassessed based upon using sequencing as sentinel for these events. We would note that, based upon our comparison, multiple sites should be targeted within the genome to increase diagnostic sensitivity. Not all regions of the viral genome had equal sequence coverage, and this may be reflective of differential RNA stability in a clinical sample and the gradient of subgenomic messenger RNAs. Nevertheless, there was sufficient coverage to build consensus genomes.

The non-multiplexed RSLA approach has been compared to other viral genome sequencing protocols such as ARTIC and viral bait capture, using the Illumina MiSeq platform, further demonstrating the feasibility and flexibility of this method. Although RSLA generates less depth in comparison to other methods, the breadth of coverage tends to be more even [16]. RNA quality and viral load (Ct values) can influence sequence read depth and obtaining full genome coverage regardless of method. Previous studies on sequencing viral genomes and investigating depth of coverage and accuracy of base calling have shown that MinION sequencing is comparable to short-read sequencing platforms such as Illumina [17,18]. Long-read sequencing data has advantages in detecting insertions and deletions and allows linkage of potentially different sequence variations. These can be missed depending on the type of bioinformatic analysis in short-read data. Much needed information can be gained from sequencing of viral genomes in terms of viral adaptation [9], thus informing molecular epidemiological studies during outbreaks [18]. Current diagnostics may not remain fit for purpose due to recombination in coronaviruses (e.g., [5,19]). Interestingly, analysis of deletions that occurred in the SARS-CoV-2 genome in infected patients from this study identified several that mapped to genes encoding proteins involved in interferon and host antagonism. Deletions with high confidence scores were particularly situated in Orf3a but particularly in Orf7a. These genes are conserved with SARS-CoV [20]. ORF3A in SARS-CoV has been shown to activate the NLRP3 inflammasome [21], induce NF-κB activation and chemokine production [22] and may act as a viroporin. Data from SARS-CoV suggested that ORF7A acts as an RNA silencing suppressor (RSS) and therefore is a suppressor of siRNA activity in mammalian cells [23]. The presence of deletion variants of the genome may also act as defective interfering genomes within patients. Such deletion variants may account for the transient emergence of isolates of SARS-CoV-2 that are associated with the milder disease in patients [24].

While most analysis of genome variation in viruses with RNA genomes is focused on nucleotide substitution, it is recombination in coronaviruses that is one of the big drivers of adaptation and change. A novel genotype of human coronavirus OC43 has been shown to emerge through recombination [25]. Indeed, recombination in coronaviruses can occur between highly divergent coronaviruses of the same strain [26] and even between coronaviruses that infect different species [27], as has been proposed for the origin of these severe coronaviruses in humans. Therefore, we would caution that any surveillance and sequencing pipeline is able to identify recombination and monitor for recombination between SARS-CoV-2 and other coronaviruses—particularly human ones.

Acknowledgments

We would like to acknowledge the research teams at the hospital sites who gave their time and effort for this study. We would particularly like to thank the patients who consented to having several additional samples collected over the course of this research. We would like to thank the laboratory staff for their continuous work during the COVID-19 pandemic and the University of Liverpool for maintaining these facilities.

Author Contributions

Conceptualization: M.W.C., M.G.S., J.K.B., P.J.M.O., J.D., L.T. and J.A.H. Methodology: S.C.M., R.P.-R., L.T., M.A. (Muhannad Alruwaili), X.D., D.W.C., N.R., S.H., Z.M., D.A.M., R.V., W.A., S.T.P. and A.D.D. Software: R.P.-R., X.D., S.H. and D.A.M. Validation: S.C.M., R.P.-R., S.A. and M.A. (Muhannad Alruwaili) Formal analysis and investigation: K.B., S.C.M., R.P.-R., M.A. (Mai Almsaud), X.D., S.T.P., A.D., D.B., D.W.C., D.P.C., J.K.B., Q.Z., Y.S., C.H., N.R., S.A., C.H., J.D., A.A., M.A. (Muhannad Alruwaili), E.B., J.C. (Jordan Clark), I.G.-D., P.G., X.H., B.J., L.L., P.S., G.S., J.D. and J.A.H. Resources: S.T.P., D.P.C., J.K.B., J.S., J.D., M.B., J.C. (James Cruise), L.T., L.N., L.R., D.A.M., A.D.D. and J.D. Data Curation: R.P.-R., S.H., and X.D. Writing—Original Draft: S.C.M., R.P.-R., M.A. (Muhannad Alruwaili), L.T. and J.A.H. Writing—Review & Editing: All authors. Visualization: R.P.-R., X.D. and M.A. (Muhannad Alruwaili) Supervision: J.A.H., L.T., A.D., L.F.P.N., M.W.C., J.P.S., E.-m.Z. and W.A. Funding acquisition: J.A.H., M.W.C., W.A., T.S., L.T., J.K.B., P.J.M.O. and M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the United States Food and Drug Administration grant number HHSF223201510104C ‘Ebola Virus Disease: correlates of protection, determinants of outcome and clinical management’ amended to incorporate urgent COVID-19 studies. Awarded to J.A.H., A.D., D.A.M. and M.W.C. Recruitment, sample acquisition, transport, laboratory and management costs were supported by the Medical Research Council “Protocol for Severe Emerging Infection” grant number MC_PC_19025 awarded to J.K.B., M.G.S., and P.J.M.O. This work was supported by Research Center, King Fahad Medical City, Saudi Arabia grant number 019-003 ‘Elucidating the viral biology of MERS-CoV and the host response using high resolution sequencing’. Awarded to W.A. This work was supported by the Medical Research Council Discovery Medicine North Doctoral Training Partnership and directly funded the studentship of R.P.-R.This work was supported by the National Institute for Health Research Health Protection Research Unit (NIHR HPRU) in Emerging and Zoonotic Infections at University of Liverpool in partnership with Public Health England (PHE), in collaboration with Liverpool School of Tropical Medicine. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, the Department of Health or Public Health England. Awarded to T.S. This work is also supported by the Centre of Excellence in Infectious Diseases Research (CEIDR) and the Alder Hey Charity awarded to J.A.H. and L.T. We also acknowledge support of Liverpool Health Partners and the Liverpool-Malawi-Covid-19 Consortium. L.T. is a Wellcome Trust clinical career development fellow, supported by grant number 205228/Z/16/Z.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Andersen K.G., Rambaut A., Lipkin W.I., Holmes E.C., Garry R.F. The proximal origin of SARS-CoV-2. Nat. Med. 2020;26:450–452. doi: 10.1038/s41591-020-0820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Chen N., Zhou M., Dong X., Qu J., Gong F., Han Y., Qiu Y., Wang J., Liu Y., Wei Y., et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in SARS-CoV-2, China: A descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Docherty A.B., Harrison E.M., Green C.A., Hardwick H.E., Pius R., Norman L., Holden K.A., Read J.M., Dondelinger F., Carson G., et al. Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: Prospective observational cohort study. BMJ. 2020;369:m1985. doi: 10.1136/bmj.m1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Knight S.R., Ho A., Pius R., Buchan I., Carson G., Drake T.M., Dunning J., Fairfield C.J., Gamble C., Green C.A., et al. Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: Development and validation of the 4C Mortality Score. BMJ. 2020;370:m3339. doi: 10.1136/bmj.m3339. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Kottier S.A., Cavanagh D., Britton P. Experimental evidence of recombination in coronavirus infectious bronchitis virus. Virology. 1995;213:569–580. doi: 10.1006/viro.1995.0029. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Coutard B., Valle C., de Lamballerie X., Canard B., Seidah N.G., Decroly E. The spike glycoprotein of the new coronavirus 2019-nCoV contains a furin-like cleavage site absent in CoV of the same clade. Antivir. Res. 2020;176:104742. doi: 10.1016/j.antiviral.2020.104742. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Davidson A.D., Williamson M.K., Lewis S., Shoemark D., Carroll M.W., Heesom K.J., Zambon M., Ellis J., Lewis P.A., Hiscox J.A., et al. Characterisation of the transcriptome and proteome of SARS-CoV-2 reveals a cell passage induced in-frame deletion of the furin-like cleavage site from the spike glycoprotein. Genome Med. 2020;12:68. doi: 10.1186/s13073-020-00763-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Cavanagh D. Severe acute respiratory syndrome vaccine development: Experiences of vaccination against avian infectious bronchitis coronavirus. Avian Pathol. 2003;32:567–582. doi: 10.1080/03079450310001621198. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Carroll M.W., Matthews D.A., Hiscox J.A., Elmore M.J., Pollakis G., Rambaut A., Hewson R., Garcia-Dorival I., Bore J.A., Koundouno R., et al. Temporal and spatial analysis of the 2014–2015 Ebola virus outbreak in West Africa. Nature. 2015;524:97–101. doi: 10.1038/nature14594. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Quick J., Loman N.J., Duraffour S., Simpson J.T., Severi E., Cowley L., Bore J.A., Koundouno R., Dudas G., Mikhail A., et al. Real-time, portable genome sequencing for Ebola surveillance. Nature. 2016;530:228–232. doi: 10.1038/nature16996. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Pini A., Zomahoun D., Duraffour S., Derrough T., Charles M., Quick J., Loman N., Cowley L., Leno M., Ouedraogo N., et al. Field investigation with real-time virus genetic characterisation support of a cluster of Ebola virus disease cases in Dubreka, Guinea, April to June 2015. Eurosurveillance. 2018;23:17-00140. doi: 10.2807/1560-7917.ES.2018.23.12.17-00140. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kato H., Shimizu H., Shibue Y., Hosoda T., Iwabuchi K., Nagamine K., Saito H., Sawada R., Oishi T., Tsukiji J., et al. Clinical course of 2019 novel coronavirus disease (COVID-19) in individuals present during the outbreak on the Diamond Princess cruise ship. J. Infect. Chemother. 2020;26:865–869. doi: 10.1016/j.jiac.2020.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Russell T.W., Hellewell J., Jarvis C.I., van Zandvoort K., Abbott S., Ratnayake R., CMMID COVID-19 working group. Flasche S., Eggo R.M., Edmunds W.J., et al. Estimating the infection and case fatality ratio for coronavirus disease (COVID-19) using age-adjusted data from the outbreak on the Diamond Princess cruise ship, February 2020. Eurosurveillance. 2020;25:2000256. doi: 10.2807/1560-7917.ES.2020.25.12.2000256. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Sabir J.S., Lam T.T., Ahmed M.M., Li L., Shen Y., Abo-Aba S.E., Qureshi M.I., Abu-Zeid M., Zhang Y., Khiyami M.A., et al. Co-circulation of three camel coronavirus species and recombination of MERS-CoVs in Saudi Arabia. Science. 2016;351:81–84. doi: 10.1126/science.aac8608. [DOI] [PubMed] [Google Scholar]
15.Kim J.I., Kim Y.J., Lemey P., Lee I., Park S., Bae J.Y., Kim D., Kim H., Jang S.I., Yang J.S., et al. The recent ancestry of Middle East respiratory syndrome coronavirus in Korea has been shaped by recombination. Sci. Rep. 2016;6:18825. doi: 10.1038/srep18825. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Nasir J.A., Kozak R.A., Aftanas P., Raphenya A.R., Smith K.M., Maguire F., Maan H., Alruwaili M., Banerjee A., Mbareche H., et al. A Comparison of Whole Genome Sequencing of SARS-CoV-2 Using Amplicon-Based Sequencing, Random Hexamers, and Bait Capture. Viruses. 2020;12:895. doi: 10.3390/v12080895. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Kafetzopoulou L.E., Efthymiadis K., Lewandowski K., Crook A., Carter D., Osborne J., Aarons E., Hewson R., Hiscox J.A., Carroll M.W., et al. Assessment of metagenomic Nanopore and Illumina sequencing for recovering whole genome sequences of chikungunya and dengue viruses directly from clinical samples. Eurosurveillance. 2018;23:1800228. doi: 10.2807/1560-7917.ES.2018.23.50.1800228. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Dudas G., Carvalho L.M., Bedford T., Tatem A.J., Baele G., Faria N.R., Park D.J., Ladner J.T., Arias A., Asogun D., et al. Virus genomes reveal factors that spread and sustained the Ebola epidemic. Nature. 2017;544:309–315. doi: 10.1038/nature22040. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.So R.T.Y., Chu D.K.W., Miguel E., Perera R., Oladipo J.O., Fassi-Fihri O., Aylet G., Ko R.L.W., Zhou Z., Cheng M.S., et al. Diversity of Dromedary Camel Coronavirus HKU23 in African Camels Revealed Multiple Recombination Events among Closely Related Betacoronaviruses of the Subgenus Embecovirus. J. Virol. 2019;93 doi: 10.1128/JVI.01236-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Wu A., Peng Y., Huang B., Ding X., Wang X., Niu P., Meng J., Zhu Z., Zhang Z., Wang J., et al. Genome Composition and Divergence of the Novel Coronavirus (2019-nCoV) Originating in China. Cell Host Microbe. 2020;27:325–328. doi: 10.1016/j.chom.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Siu K.L., Yuen K.S., Castano-Rodriguez C., Ye Z.W., Yeung M.L., Fung S.Y., Yuan S., Chan C.P., Yuen K.Y., Enjuanes L., et al. Severe acute respiratory syndrome coronavirus ORF3a protein activates the NLRP3 inflammasome by promoting TRAF3-dependent ubiquitination of ASC. FASEB J. 2019;33:8865–8877. doi: 10.1096/fj.201802418R. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Kanzawa N., Nishigaki K., Hayashi T., Ishii Y., Furukawa S., Niiro A., Yasui F., Kohara M., Morita K., Matsushima K., et al. Augmentation of chemokine production by severe acute respiratory syndrome coronavirus 3a/X1 and 7a/X4 proteins through NF-kappaB activation. FEBS Lett. 2006;580:6807–6812. doi: 10.1016/j.febslet.2006.11.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Karjee S., Minhas A., Sood V., Ponia S.S., Banerjea A.C., Chow V.T., Mukherjee S.K., Lal S.K. The 7a accessory protein of severe acute respiratory syndrome coronavirus acts as an RNA silencing suppressor. J. Virol. 2010;84:10395–10401. doi: 10.1128/JVI.00748-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Young B.E., Fong S.W., Chan Y.H., Mak T.M., Ang L.W., Anderson D.E., Lee C.Y., Amrun S.N., Lee B., Goh Y.S., et al. Effects of a major deletion in the SARS-CoV-2 genome on the severity of infection and the inflammatory response: An observational cohort study. Lancet. 2020;396:603–611. doi: 10.1016/S0140-6736(20)31757-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Zhang Y., Li J., Xiao Y., Zhang J., Wang Y., Chen L., Paranhos-Baccala G., Ren L., Wang J. Genotype shift in human coronavirus OC43 and emergence of a novel genotype by natural recombination. J. Infect. 2015;70:641–650. doi: 10.1016/j.jinf.2014.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Hewson K.A., Noormohammadi A.H., Devlin J.M., Browning G.F., Schultz B.K., Ignjatovic J. Evaluation of a novel strain of infectious bronchitis virus emerged as a result of spike gene recombination between two highly diverged parent strains. Avian Pathol. 2014;43:249–257. doi: 10.1080/03079457.2014.914624. [DOI] [PubMed] [Google Scholar]
27.Terada Y., Matsui N., Noguchi K., Kuwata R., Shimoda H., Soma T., Mochizuki M., Maeda K. Emergence of pathogenic coronaviruses in cats by homologous recombination between feline and canine coronaviruses. PLoS ONE. 2014;9:e106534. doi: 10.1371/journal.pone.0106534. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B1-viruses-12-01164] 1.Andersen K.G., Rambaut A., Lipkin W.I., Holmes E.C., Garry R.F. The proximal origin of SARS-CoV-2. Nat. Med. 2020;26:450–452. doi: 10.1038/s41591-020-0820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2-viruses-12-01164] 2.Chen N., Zhou M., Dong X., Qu J., Gong F., Han Y., Qiu Y., Wang J., Liu Y., Wei Y., et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in SARS-CoV-2, China: A descriptive study. Lancet. 2020;395:507–513. doi: 10.1016/S0140-6736(20)30211-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3-viruses-12-01164] 3.Docherty A.B., Harrison E.M., Green C.A., Hardwick H.E., Pius R., Norman L., Holden K.A., Read J.M., Dondelinger F., Carson G., et al. Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: Prospective observational cohort study. BMJ. 2020;369:m1985. doi: 10.1136/bmj.m1985. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4-viruses-12-01164] 4.Knight S.R., Ho A., Pius R., Buchan I., Carson G., Drake T.M., Dunning J., Fairfield C.J., Gamble C., Green C.A., et al. Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: Development and validation of the 4C Mortality Score. BMJ. 2020;370:m3339. doi: 10.1136/bmj.m3339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5-viruses-12-01164] 5.Kottier S.A., Cavanagh D., Britton P. Experimental evidence of recombination in coronavirus infectious bronchitis virus. Virology. 1995;213:569–580. doi: 10.1006/viro.1995.0029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6-viruses-12-01164] 6.Coutard B., Valle C., de Lamballerie X., Canard B., Seidah N.G., Decroly E. The spike glycoprotein of the new coronavirus 2019-nCoV contains a furin-like cleavage site absent in CoV of the same clade. Antivir. Res. 2020;176:104742. doi: 10.1016/j.antiviral.2020.104742. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7-viruses-12-01164] 7.Davidson A.D., Williamson M.K., Lewis S., Shoemark D., Carroll M.W., Heesom K.J., Zambon M., Ellis J., Lewis P.A., Hiscox J.A., et al. Characterisation of the transcriptome and proteome of SARS-CoV-2 reveals a cell passage induced in-frame deletion of the furin-like cleavage site from the spike glycoprotein. Genome Med. 2020;12:68. doi: 10.1186/s13073-020-00763-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8-viruses-12-01164] 8.Cavanagh D. Severe acute respiratory syndrome vaccine development: Experiences of vaccination against avian infectious bronchitis coronavirus. Avian Pathol. 2003;32:567–582. doi: 10.1080/03079450310001621198. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9-viruses-12-01164] 9.Carroll M.W., Matthews D.A., Hiscox J.A., Elmore M.J., Pollakis G., Rambaut A., Hewson R., Garcia-Dorival I., Bore J.A., Koundouno R., et al. Temporal and spatial analysis of the 2014–2015 Ebola virus outbreak in West Africa. Nature. 2015;524:97–101. doi: 10.1038/nature14594. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10-viruses-12-01164] 10.Quick J., Loman N.J., Duraffour S., Simpson J.T., Severi E., Cowley L., Bore J.A., Koundouno R., Dudas G., Mikhail A., et al. Real-time, portable genome sequencing for Ebola surveillance. Nature. 2016;530:228–232. doi: 10.1038/nature16996. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11-viruses-12-01164] 11.Pini A., Zomahoun D., Duraffour S., Derrough T., Charles M., Quick J., Loman N., Cowley L., Leno M., Ouedraogo N., et al. Field investigation with real-time virus genetic characterisation support of a cluster of Ebola virus disease cases in Dubreka, Guinea, April to June 2015. Eurosurveillance. 2018;23:17-00140. doi: 10.2807/1560-7917.ES.2018.23.12.17-00140. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12-viruses-12-01164] 12.Kato H., Shimizu H., Shibue Y., Hosoda T., Iwabuchi K., Nagamine K., Saito H., Sawada R., Oishi T., Tsukiji J., et al. Clinical course of 2019 novel coronavirus disease (COVID-19) in individuals present during the outbreak on the Diamond Princess cruise ship. J. Infect. Chemother. 2020;26:865–869. doi: 10.1016/j.jiac.2020.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13-viruses-12-01164] 13.Russell T.W., Hellewell J., Jarvis C.I., van Zandvoort K., Abbott S., Ratnayake R., CMMID COVID-19 working group. Flasche S., Eggo R.M., Edmunds W.J., et al. Estimating the infection and case fatality ratio for coronavirus disease (COVID-19) using age-adjusted data from the outbreak on the Diamond Princess cruise ship, February 2020. Eurosurveillance. 2020;25:2000256. doi: 10.2807/1560-7917.ES.2020.25.12.2000256. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14-viruses-12-01164] 14.Sabir J.S., Lam T.T., Ahmed M.M., Li L., Shen Y., Abo-Aba S.E., Qureshi M.I., Abu-Zeid M., Zhang Y., Khiyami M.A., et al. Co-circulation of three camel coronavirus species and recombination of MERS-CoVs in Saudi Arabia. Science. 2016;351:81–84. doi: 10.1126/science.aac8608. [DOI] [PubMed] [Google Scholar]

[B15-viruses-12-01164] 15.Kim J.I., Kim Y.J., Lemey P., Lee I., Park S., Bae J.Y., Kim D., Kim H., Jang S.I., Yang J.S., et al. The recent ancestry of Middle East respiratory syndrome coronavirus in Korea has been shaped by recombination. Sci. Rep. 2016;6:18825. doi: 10.1038/srep18825. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16-viruses-12-01164] 16.Nasir J.A., Kozak R.A., Aftanas P., Raphenya A.R., Smith K.M., Maguire F., Maan H., Alruwaili M., Banerjee A., Mbareche H., et al. A Comparison of Whole Genome Sequencing of SARS-CoV-2 Using Amplicon-Based Sequencing, Random Hexamers, and Bait Capture. Viruses. 2020;12:895. doi: 10.3390/v12080895. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17-viruses-12-01164] 17.Kafetzopoulou L.E., Efthymiadis K., Lewandowski K., Crook A., Carter D., Osborne J., Aarons E., Hewson R., Hiscox J.A., Carroll M.W., et al. Assessment of metagenomic Nanopore and Illumina sequencing for recovering whole genome sequences of chikungunya and dengue viruses directly from clinical samples. Eurosurveillance. 2018;23:1800228. doi: 10.2807/1560-7917.ES.2018.23.50.1800228. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18-viruses-12-01164] 18.Dudas G., Carvalho L.M., Bedford T., Tatem A.J., Baele G., Faria N.R., Park D.J., Ladner J.T., Arias A., Asogun D., et al. Virus genomes reveal factors that spread and sustained the Ebola epidemic. Nature. 2017;544:309–315. doi: 10.1038/nature22040. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B19-viruses-12-01164] 19.So R.T.Y., Chu D.K.W., Miguel E., Perera R., Oladipo J.O., Fassi-Fihri O., Aylet G., Ko R.L.W., Zhou Z., Cheng M.S., et al. Diversity of Dromedary Camel Coronavirus HKU23 in African Camels Revealed Multiple Recombination Events among Closely Related Betacoronaviruses of the Subgenus Embecovirus. J. Virol. 2019;93 doi: 10.1128/JVI.01236-19. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B20-viruses-12-01164] 20.Wu A., Peng Y., Huang B., Ding X., Wang X., Niu P., Meng J., Zhu Z., Zhang Z., Wang J., et al. Genome Composition and Divergence of the Novel Coronavirus (2019-nCoV) Originating in China. Cell Host Microbe. 2020;27:325–328. doi: 10.1016/j.chom.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21-viruses-12-01164] 21.Siu K.L., Yuen K.S., Castano-Rodriguez C., Ye Z.W., Yeung M.L., Fung S.Y., Yuan S., Chan C.P., Yuen K.Y., Enjuanes L., et al. Severe acute respiratory syndrome coronavirus ORF3a protein activates the NLRP3 inflammasome by promoting TRAF3-dependent ubiquitination of ASC. FASEB J. 2019;33:8865–8877. doi: 10.1096/fj.201802418R. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22-viruses-12-01164] 22.Kanzawa N., Nishigaki K., Hayashi T., Ishii Y., Furukawa S., Niiro A., Yasui F., Kohara M., Morita K., Matsushima K., et al. Augmentation of chemokine production by severe acute respiratory syndrome coronavirus 3a/X1 and 7a/X4 proteins through NF-kappaB activation. FEBS Lett. 2006;580:6807–6812. doi: 10.1016/j.febslet.2006.11.046. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B23-viruses-12-01164] 23.Karjee S., Minhas A., Sood V., Ponia S.S., Banerjea A.C., Chow V.T., Mukherjee S.K., Lal S.K. The 7a accessory protein of severe acute respiratory syndrome coronavirus acts as an RNA silencing suppressor. J. Virol. 2010;84:10395–10401. doi: 10.1128/JVI.00748-10. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B24-viruses-12-01164] 24.Young B.E., Fong S.W., Chan Y.H., Mak T.M., Ang L.W., Anderson D.E., Lee C.Y., Amrun S.N., Lee B., Goh Y.S., et al. Effects of a major deletion in the SARS-CoV-2 genome on the severity of infection and the inflammatory response: An observational cohort study. Lancet. 2020;396:603–611. doi: 10.1016/S0140-6736(20)31757-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B25-viruses-12-01164] 25.Zhang Y., Li J., Xiao Y., Zhang J., Wang Y., Chen L., Paranhos-Baccala G., Ren L., Wang J. Genotype shift in human coronavirus OC43 and emergence of a novel genotype by natural recombination. J. Infect. 2015;70:641–650. doi: 10.1016/j.jinf.2014.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B26-viruses-12-01164] 26.Hewson K.A., Noormohammadi A.H., Devlin J.M., Browning G.F., Schultz B.K., Ignjatovic J. Evaluation of a novel strain of infectious bronchitis virus emerged as a result of spike gene recombination between two highly diverged parent strains. Avian Pathol. 2014;43:249–257. doi: 10.1080/03079457.2014.914624. [DOI] [PubMed] [Google Scholar]

[B27-viruses-12-01164] 27.Terada Y., Matsui N., Noguchi K., Kuwata R., Shimoda H., Soma T., Mochizuki M., Maeda K. Emergence of pathogenic coronaviruses in cats by homologous recombination between feline and canine coronaviruses. PLoS ONE. 2014;9:e106534. doi: 10.1371/journal.pone.0106534. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Amplicon-Based Detection and Sequencing of SARS-CoV-2 in Nasopharyngeal Swabs from Patients With COVID-19 and Identification of Deletions in the Viral Genome That Encode Proteins Involved in Interferon Antagonism

Shona C Moore

Rebekah Penrice-Randal

Muhannad Alruwaili

Nadine Randle

Stuart Armstrong

Catherine Hartley

Sam Haldenby

Xiaofeng Dong

Abdulrahman Alrezaihi

Mai Almsaud

Eleanor Bentley

Jordan Clark

Isabel García-Dorival

Paul Gilmore

Ximeng Han

Benjamin Jones

Lisa Luu

Parul Sharma

Ghada Shawli

Yani Sun

Qin Zhao

Steven T Pullan

Daniel P Carter

Kevin Bewley

Jake Dunning

En-min Zhou

Tom Solomon

Michael Beadsworth

James Cruise

Derrick W Crook

David A Matthews

Andrew D Davidson

Zana Mahmood

Waleed Aljabr

Julian Druce

Richard Vipond

Lisa Ng

Laurent Renia

Peter J M Openshaw

J Kenneth Baillie

Miles W Carroll

James Stewart

Alistair Darby

Malcolm Semple

Lance Turtle

Julian A Hiscox

Abstract

1. Introduction

2. Materials and Methods

2.1. Ethics and Clinical Information

2.2. RNA Extraction and Preparation

2.3. Primer Design

Table 1.

2.4. RT and PCR

2.5. Library Preparation for MinION Sequencing

2.6. Informatics Analysis

3. Results

3.1. Primer and Amplicon Design

Figure 1.

3.2. Validation of Amplicon Generation and Oxford Nanopore Sequencing Using RNA from Cells Infected with SARS-CoV-2

3.3. Initial Evaluation of the Amplicon Approach to Detect SARS-CoV-2 on RNA Purified from Nasopharyngeal Swabs Collected from Patients with COVID-19 Correctly Identified a Positive but also a False Negative Sample

Figure 2.

Figure 3.

3.4. Development of a Multiplex Method to Generate Amplicons

Figure 4.

3.5. Identification of Deletions in SARS-CoV-2 from Patients with COVID-19

Table 2.

4. Discussion

Acknowledgments

Author Contributions

Funding

Conflicts of Interest

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles