Eukaryote evolution timeline of branches where GH7 genes have been found and the percent identities (ID%) and similarities (Sim%) of the protein sequences within the GH7 domain to the D. discoideum Cel7A sequence. Branch names and time points of divergence are from reference 72. Only points of early divergence (<600 million years ago) are included; the later divergence among stramenopiles, amoebozoa, and metazoans is not resolved here. The multiple-sequence alignment of selected GH7 sequences was done with the MUSCLE web service, and flanking regions (e.g., the signal peptide, CBM) were trimmed off before calculation of pairwise sequence identities and similarities using the Gonnet substitution matrix. UniProt accession numbers are provided, except in the case of Schizochytrium
aggregatum, for which the U.S. patent number is indicated (76). *, the Aureococcus anophagefferens GH7 sequence (UniProt accession number F0YSW7) appears to be a fragment containing only 135 residues of the C-terminal part of the GH7 domain in the alignment.