Table 4.
Sequence identity and similarities between SARS CoV-2 proteins and SARS CoV proteins determined through LALIGN (17) (see Supporting Information)
| Entry | Protein | Amino acid overlap | Sequence identity | Sequence similarity |
|---|---|---|---|---|
| 1 | NSP1 | 180 | 84.4% | 93.4% |
| 2 | NSP2 | 638 | 68.3% | 90.0% |
| 3 | NSP3 | 1,952 | 76.0% | 91.8% |
| 4 | NSP4 | 500 | 80.0% | 95.0% |
| 5 | NSP5 | 306 | 96.1% | 99.7% |
| 6 | NSP6 | 287 | 88.2% | 98.3% |
| 7 | NSP7 | 83 | 98.8% | 100.0% |
| 8 | NSP8 | 198 | 97.5% | 100.0% |
| 9 | NSP9 | 113 | 97.3% | 99.1% |
| 10 | NSP10 | 139 | 97.1% | 99.3% |
| 11 | NSP11 | 13 | 84.6% | 100.0% |
| 12 | NSP12 | 932 | 96.4% | 99.4% |
| 13 | NSP13 | 601 | 99.8% | 100.0% |
| 14 | NSP14 | 527 | 95.1% | 99.1% |
| 15 | NSP15 | 346 | 88.7% | 97.7% |
| 16 | NSP16 | 298 | 93.3% | 99.0% |
| 17 | S protein | 1,277 | 76.0% | 91.5% |
| 18 | ORF3a | 1,381 | 72.4% | 90.2% |
| 19 | E Protein | 76 | 94.7% | 97.4% |
| 20 | M Protein | 222 | 90.5% | 98.2% |
| 21 | ORF6 | 61 | 68.9% | 93.4% |
| 22 | ORF7a | 122 | 85.2% | 95.9% |
| 23 | ORF7b | 41 | 85.4% | 92.7% |
| 24a | (ORF8 vs 8a)a | 41 | 31.7% | 70.7% |
| 24b | (ORF8 vs 8b)a | 42 | 40.5% | 66.7% |
| 25 | N Protein | 422 | 90.5% | 97.2% |
| 26 | (ORF10 vs 9b)a | 21 | 28.6% | 52.4% |
a(SARS CoV-2 protein vs SARS CoV protein). Other reports have also reported amino acid sequence identities using different algorithms (3,67)