Skip to main content
. 2020 Jul 18;21:316. doi: 10.1186/s12859-020-03650-y

Fig. 2.

Fig. 2

Schematic diagram of strain-based alignment approach. Influenza viral sequences are aligned by FluCS as follows: a The FluSeed Dataset is constructed by quality-checked and rearranged viral sequences. Blocks in different colors represent ten viral segments. The size of each block corresponds to the length of the viral sequence originally retrieved. Blocks in any color tagged with the same Arabic numbers are identified as the same strain. b Rearranged viral sequences are sorted into 11 protein clusters based on gene segments and well aligned within the cluster. Aligned sequences are subjected correct ORF into PB2, PB1, PA, HA, NP, NA, M1, NS1, and PB1-F2. M2 and NS2 are alternatively spliced proteins from M and NS ORF mRNAs, and respectively. c Delineated viral amino acid sequences are easily aligned based on standard influenza viral nomenclature. The analysis platform provides benefits for multi-layer subgrouping based on epidemiological significance