TABLE 1.
Protein genome position (nt) a | Trivial name construct expressed | Size (aa) | Boundaries | MW (kDa) | Homol. SCoV (%) b | Template PDB c | SCoV2 PDB d |
---|---|---|---|---|---|---|---|
nsp1 | Leader | 180 | 19.8 | 84 | |||
266–805 | |||||||
Full-length | 180 | 1–180 | 19.8 | 83 | |||
Globular domain (GD) | 116 | 13–127 | 12.7 | 85 | 2GDT | 7K7P | |
nsp2 | 638 | 70.5 | 68 | ||||
806–2,719 | |||||||
C-terminal IDR (CtDR) | 45 | 557–601 | 4.9 | 55 | |||
nsp3 | 1,945 | 217.3 | 76 | ||||
2,720–8,554 | |||||||
a | Ub-like (Ubl) domain | 111 | 1–111 | 12.4 | 79 | 2IDY | 7KAG |
a | Ub-like (Ubl) domain + IDR | 206 | 1–206 | 23.2 | 58 | ||
b | Macrodomain | 170 | 207–376 | 18.3 | 74 | 6VXS | 6VXS |
c | SUD-N | 140 | 409–548 | 15.5 | 69 | 2W2G | |
c | SUD-NM | 267 | 409–675 | 29.6 | 74 | 2W2G | |
c | SUD-M | 125 | 551–675 | 14.2 | 82 | 2W2G | |
c | SUD-MC | 195 | 551–743 | 21.9 | 79 | 2KQV | |
c | SUD-C | 64 | 680–743 | 7.4 | 73 | 2KAF | |
d | Papain-like protease PLpro | 318 | 743–1,060 | 36 | 83 | 6W9C | 6W9C |
e | NAB | 116 | 1,088–1,203 | 13.4 | 87 | 2K87 | |
Y | CoV-Y | 308 | 1,638–1,945 | 34 | 89 | ||
nsp5 | Main protease (Mpro) | 306 | 33.7 | 96 | |||
10,055–10,972 | |||||||
Full-length e | 306 | 1–306 | 33.7 | 96 | 6Y84 | 6Y84 | |
nsp7 | 83 | 9.2 | 99 | ||||
11,843–12,091 | |||||||
Full-length | 83 | 1–83 | 9.2 | 99 | 6WIQ | 6WIQ | |
nsp8 | 198 | 21.9 | 98 | ||||
12,092–12,685 | |||||||
Full-length | 198 | 1–198 | 21.9 | 97 | 6WIQ | 6WIQ | |
nsp9 | 113 | 12.4 | 97 | ||||
12,686–13,024 | |||||||
Full-length | 113 | 1–113 | 12.4 | 97 | 6W4B | 6W4B | |
nsp10 | 139 | 14.8 | 97 | ||||
13,025–13,441 | |||||||
Full-length | 139 | 1–139 | 14.8 | 97 | 6W4H | 6W4H | |
nsp13 | Helicase | 601 | 66.9 | 100 | |||
16,237–18,039 | |||||||
Full-length | 601 | 1–601 | 66.9 | 100 | 6ZSL | 6ZSL | |
nsp14 | Exonuclease/methyltransferase | 527 | 59.8 | 95 | |||
18,040–19,620 | |||||||
Full-length | 527 | 1–527 | 59.8 | 95 | 5NFY | ||
MTase domain | 240 | 288–527 | 27.5 | 95 | |||
nsp15 | Endonuclease | 346 | 38.8 | 89 | |||
19,621–20,658 | |||||||
Full-length | 346 | 1–346 | 38.8 | 89 | 6W01 | 6W01 | |
nsp16 | Methyltransferase | 298 | 33.3 | 93 | |||
20,659–21,552 | |||||||
Full-length | 298 | 1–298 | 33.3 | 93 | 6W4H | 6W4H | |
ORF3a | 275 | 31.3 | 72 | ||||
25,393–26,220 | |||||||
Full-length | 275 | 1–275 | 31.3 | 72 | 6XDC | 6XDC | |
ORF4 | Envelope (E) protein | 75 | 8.4 | 95 | |||
26,245–26,472 | |||||||
Full-length | 75 | 1–75 | 8.4 | 95 | 5X29 | 7K3G | |
ORF5 | Membrane glycoprotein (M) | 222 | 25.1 | 91 | |||
26,523–27,387 | |||||||
Full-length | 222 | 1–222 | 25.1 | 91 | |||
ORF6 | 61 | 7.3 | 69 | ||||
27,202–27,387 | |||||||
Full-length | 61 | 1–61 | 7.3 | 69 | |||
ORF7a | 121 | 13.7 | 85 | ||||
27,394–27,759 | |||||||
Ectodomain (ED) | 66 | 16–81 | 7.4 | 85 | 1XAK | 6W37 | |
ORF7b | 43 | 5.2 | 85 | ||||
27,756–27,887 | |||||||
Full-length | 43 | 1–43 | 5.2 | 85 | |||
ORF8 | 121 | 13.8 | 32 | ||||
27,894–28,259 | |||||||
ORF8 | Full-length | 121 | 1–121 | 13.8 | 32 | ||
ΔORF8 | w/o signal peptide | 106 | 16–121 | 12 | 41 | 7JTL | 7JTL |
ORF9a | Nucleocapsid (N) | 419 | 45.6 | 91 | |||
28,274–29,533 | |||||||
IDR1-NTD-IDR2 | 248 | 1–248 | 26.5 | 90 | |||
NTD-SR | 169 | 44–212 | 18.1 | 92 | |||
NTD | 136 | 44–180 | 14.9 | 93 | 6YI3 | 6YI3 | |
CTD | 118 | 247–364 | 13.3 | 96 | 2JW8 | 7C22 | |
ORF9b | 97 | 10.8 | 72 | ||||
28,284–28,574 | |||||||
Full-length | 97 | 1–97 | 10.8 | 72 | 6Z4U | 6Z4U | |
ORF14 | 73 | 8 | n.a | ||||
28,734–28,952 | |||||||
Full-length | 73 | 1–73 | 8 | n.a | |||
ORF10 | 38 | 4.4 | 29 | ||||
29,558–29,674 | |||||||
Full-length | 38 | 1–38 | 4.4 | 29 |
Genome position in nt corresponding to SCoV2 NCBI reference genome entry NC_045512.2, identical to GenBank entry MN908947.3.
Sequence identities to SCoV are calculated from an alignment with corresponding protein sequences based on the genome sequence of NCBI Reference NC_004718.3.
Representative PDB that was available at the beginning of construct design, either SCoV or SCoV2.
Representative PDB available for SCoV2 (as of December 2020).
Additional point mutations in fl-construct have been expressed.
n.a.: not applicable.