Table 6.
Viral versus human proteome overlap at the 6-mer level
| Virusa | 1 | 2 | 3b | 4b | 5 | 6 |
|---|---|---|---|---|---|---|
| HBV | 1,589 | 1,593 | 496 | 1,394 | 1,224 | 31.2 |
| JCV | 1,527 | 1,604 | 461 | 1,468 | 1,301 | 30.1 |
| Human parvovirus B19 | 1,442 | 1,991 | 431 | 1,341 | 1,150 | 29.8 |
| HRV-14 | 2,174 | 2,174 | 579 | 1,456 | 1,270 | 26.6 |
| HPV-1 | 2,204 | 2,204 | 619 | 1,483 | 1,288 | 28.0 |
| SAF-V | 2,285 | 2,285 | 622 | 1,392 | 1,280 | 27.2 |
| HPV16 | 2,412 | 2,412 | 739 | 2,010 | 1,703 | 30.6 |
| HTLV-I | 2,559 | 2,559 | 967 | 3,261 | 2,658 | 37.7 |
| HCV | 3,005 | 3,005 | 1,025 | 4,195 | 3,099 | 34.1 |
| Rubella virus | 3,169 | 3,169 | 1,133 | 3,918 | 3,119 | 35.7 |
| YFV | 3,406 | 3,406 | 1,005 | 2,680 | 2,322 | 29.5 |
| WNV | 3,428 | 3,428 | 996 | 2,744 | 2,365 | 29.0 |
| HIV-1 | 3,084 | 3,526 | 904 | 2,135 | 1,832 | 29.3 |
| Rabies virus | 3,575 | 3,575 | 1,162 | 2,635 | 2,195 | 32.5 |
| RRV | 3,613 | 3,613 | 1,069 | 2,708 | 2,301 | 29.5 |
| HIV-2 | 3,283 | 3,714 | 1,006 | 4,653 | 2,780 | 30.6 |
| hMPV | 4,117 | 4,118 | 1,245 | 3,263 | 2,720 | 30.2 |
| H5N1 | 4,408 | 4,417 | 1,191 | 2,674 | 2,332 | 27.0 |
| HRSV | 4,483 | 4,485 | 1,258 | 2,847 | 2,455 | 28.0 |
| HPIV3 | 4,812 | 4,812 | 1,350 | 3,194 | 2,721 | 28.0 |
| Lake Victoria marburgvirus | 4,811 | 4,811 | 1,497 | 5,631 | 3,695 | 31.1 |
| Mumps virus | 4,787 | 4,937 | 1,463 | 3,734 | 3,017 | 30.5 |
| Measles virus | 4,937 | 5,165 | 1,585 | 3,823 | 3,208 | 32.1 |
| Zaire virus | 4,868 | 5,448 | 1,448 | 3,488 | 2,929 | 29.7 |
| Hendra virus | 5,210 | 6,011 | 1,516 | 3,538 | 3,035 | 29.0 |
| SARS-CoV | 9,767 | 14,144 | 2,704 | 9,438 | 5,770 | 27.6 |
| HHV-4 | 32,634 | 34,566 | 12,358 | 50,013 | 18,445 | 37.8 |
| HHV-6 | 42,432 | 44,160 | 12,126 | 35,793 | 16,847 | 28.5 |
| Variola virus | 53,136 | 53,304 | 13,870 | 33,911 | 16,167 | 26.1 |
| HHV-5 | 63,115 | 64,330 | 21,873 | 87,931 | 23,629 | 34.6 |
| Allc | 282,932 | 298,966 | 87,224 | 259,655 | ||
Human proteome formed by 36,103 proteins and 15,734,725 occurrences of 8,247,275 unique 6-mers. Column number refers to: (1) unique 6-mers in the viral proteome; (2) total number of 6-mers in the viral proteome (including multiple occurrences); (3) unique viral 6-mers occurring in the human proteome; (4) viral overlap occurrences in the human proteome (including multiple occurrences); (5) number of human proteins involved in overlap; (6) % of unique viral 6-mers which occur in the human proteome (i.e. 100 × column 3/column 1).
Abbreviations as in Table 1.
The results of linear regression analysis between columns 1 and 3, and 1 and 4 are: column 3 = 0.31426 × column 1 − 42.237 (r = 0.98762). Column 4 = 1.0972 × column 1 − 877.68 (r = 0.93373).
Obtained by combining all 30 viral proteomes into one viral proteome, and then computing the overlap with the entire human proteome.