Abstract
Myosins, a large family of actin-based motors, have one or two heavy chains with one or more light chains associated with each heavy chain. The heavy chains have a (generally) N-terminal head domain with an ATPase and actin-binding site, followed by a neck domain to which the light chains bind, and a C-terminal tail domain through which the heavy chains self-associate and/or bind the myosin to its cargo. Approximately 140 members of the myosin superfamily have been grouped into 17 classes based on the sequences of their head domains. I now show that a phylogenetic tree based on the sequences of the combined neck and tail domains groups 144 myosins, with a few exceptions, into the same 17 classes. For the nine myosin classes that have multiple members, phylogenetic trees based on the head domain or the combined neck/tail domains are either identical or very similar. For class II myosins, very similar phylogenetic trees are obtained for the head, neck, and tail domains of 47 heavy chains and for 29 essential light chains and 19 regulatory light chains. These data strongly suggest that the head, neck, and tail domains of all myosin heavy chains, and light chains at least of class II myosins, have coevolved and are likely to be functionally interdependent, consistent with biochemical evidence showing that regulated actin-dependent MgATPase activity of Dictyostelium myosin II requires isoform specific interactions between the heavy chain head and tail and light chains.
Myosins, a large family of actin-dependent motors involved in contractile and motile functions in higher and lower animals, plants, and fungi (1), contain one or two (identical) heavy chains of masses ranging from about 110,000 to 250,000 Da and one or more light chains of masses between about 15,000 and 20,000 Da per heavy chain. The heavy chains contain a head (catalytic, motor) domain, generally N-terminal, followed by a neck domain to which the light chains bind, and a C-terminal tail domain, which, in some myosins, dimerizes with an identical heavy chain by forming coiled-coil helical regions and through which some myosin dimers polymerize into filaments. The tail domain of other myosin heavy chains is thought to associate with membranes and organelles or other cargoes that the myosin molecules may transport along actin filaments.
The databases now contain complete DNA sequences of approximately 145 different myosin heavy chains from about 40 different species. Myosin heavy chains have been grouped into 17 different classes (J. Cope and T. Hodge, The Myosin Home Page, http://www.mrc-lmb.cam.ac.uk/myosin/myosin.html) based on the derived amino acid sequences of their head domains. [After completion of this study, a two-member class XVIII was identified (2).] Head domain sequences were used because myosins were defined by the actin-dependent MgATPase activity of the head (3). It was recognized, however, that the initial groupings of myosins by head domain sequences into three (4) and seven (3) classes were tightly correlated with gross structural differences of their tail domains. Although many complete heavy chain sequences are now available, phylogenetic trees of the entire myosin superfamily based on sequences of heavy chain domains other than the head domain have not been developed.
Traditionally, and largely as a result of extensive biochemical studies of the conventional (class II) muscle myosins, myosin catalytic activity and its regulation have been attributed solely to the heavy chain head domain and light chains associated with the neck domain (1). However, there is increasing evidence, especially for the class II myosins from the slime mold Dictyostelium discoideum and the soil amoeba Acanthamoeba castellanii, that contradicts this dogma. Specifically, phosphorylation of serine residues in the tail of Dictyostelium myosin II heavy chain down-regulates the actin-dependent MgATPase activity of the head by increasing the KATPase for F-actin (5) and phosphorylation of serine residues at the C-terminal end of the tail of the heavy chain of Acanthamoeba myosin II down-regulates the activity of its head domain by substantially reducing Vmax (6). In the accompanying paper (7), we show that the actin-dependent MgATPase activity of chimeras in which the head and neck (and associated light chains) domains of Dictyostelium myosin II are fused to the tail domain of either Acanthamoeba myosin II or chicken smooth muscle myosin II is substantially greater than the activity of wild-type Dictyostelium myosin II and essentially unregulated, i.e., in contrast to wild-type Dictyostelium and smooth muscle myosin II, actin-dependent MgATPase activity of the chimeras is only minimally inhibited by unphosphorylated regulatory light chain and, therefore, only minimally activated by phosphorylation of the Dictyostelium regulatory light chain and, in contrast to wild-type Acanthamoeba myosin II, the activity of the chimera with Acanthamoeba myosin II tail is not inhibited by phosphorylation of the Acanthamoeba tail domain. These and other results (8) indicative of strong coupling between the head, neck, and tail domains of the heavy chains of Dictyostelium, Acanthamoeba, and smooth muscle myosin II led me to inquire whether the implied coevolution of the head, neck, and tail domains of these, and possibly all, myosins would be supported by phylogenetic analysis of their amino acid sequences.
Experimental Method
Unrooted phylogenetic trees were constructed by progressive multiple sequence alignment using the default mode of the PC version of the clustal x program (9) with 1,000 bootstrap trials, without correction for gaps or multiple substitutions and without correction for any misalignments (9) that may have occurred. Trees were plotted by using the neighbor joining method (9) and manually adjusted by rotating branches around nodes.
Normally, sequences were entered in alphabetical order but, in some instances, the sequence order was randomized, and 10,000 bootstrap trials were performed. Complete heavy chain sequences were divided into head domains and combined neck/tail domains or, for class II myosins, separate head, neck, and tail domains, as described below. The myosins that were analyzed, their accession numbers, and the residues comprising the head and neck/tail domains or head, neck, and tail domains are enumerated in the figure legends.
The first step was to establish the boundaries between the head, neck, and tail domains by multiple alignments within each of the 17 myosin classes defined by the phylogenetic tree (http://www.mrc-lmb.cam.ac.uk/myosin/myosin.html) of the core motor domains. For class II myosins, this was straightforward as all of the head domains terminate with amino acids FFK or FFR, the neck domains are the next 71 amino acids (except for ScMYO1, which has 72 residues in its neck domain) and all of the tail domains begin with a Pro residue. It also was not difficult to define the end of the head domains of the other myosins; for example, class I myosin heads terminate with FIR or FIK and, occasionally, FVK. However, it proved very difficult, if not impossible, to define unequivocally the junctions between the neck and tail domains of the non-class II myosins. Although neck domains characteristically are defined by the presence of one or more IQ motifs (10), to which the calmodulin-like light chains bind, these motifs can be sufficiently degenerate or incomplete that reasonable people can reach different conclusions on the number of IQ motifs in a given heavy chain, especially as there is no independent biochemical information about the nature or number of light chains of most myosins. For this reason, separate phylogenetic analyses of the neck domains for myosins other than class II myosins were not attempted. Instead, phylogenetic trees of the head domains were compared with phylogenetic trees of the combined neck and tail domains. This almost certainly did not affect the conclusions (other than the absence of explicit information about the phylogeny of the neck domains) because the neck domains are substantially shorter than the tail domains and, thus, contribute very little to the combined neck/tail sequences. Also, combining the neck domains of class II myosins with either the head or tail domains did not change the phylogenetic trees obtained for the head and tail domains alone.
Phylogenetic trees can be presented interchangeably as radial (Fig. 1 and see Figs. 4–6) or vertical (Fig. 2 and 3) plots. The sum of the lengths of the lines connecting any two members of a radial tree and the sum of the lengths of the horizontal lines between any two members of a vertical tree is proportional to the divergence between their amino acid sequences; the standard bar with each tree represents 10% sequence divergence. The vertical distances between members of a vertical tree, the angles between branches in a radial tree, and the positions of branches around the nodes of both radial and vertical trees are arbitrary and have no significance.
Results and Discussion
Class I Myosins.
Thirty myosins from 13 species from amoeba to human were analyzed (Fig. 1). Except for the difference in scale (see standard bar on each figure), the phylogenetic trees of the neck/tail domains and head domains are virtually superimposable, and the two trees are equally robust with 18 of 22 nodes obtained in >95% of bootstrap trials. Within branches, the same myosins have essentially the same relationships in the two trees. For example, Btbeta, HsIbeta, and RnMyr2 are very close together, and EhIB, AcIB, and DdIB are rather distant from each other in both trees. Also, in both trees myosins from the same species are on separate, often quite distant, branches, except for ScMYO3 and ScMYO5 and MmIalpha and MmIA, which are together on both trees.
Class II Myosins.
Trees were developed for the head, neck, and tail domains of 47 myosins from 25 species (Fig. 2). The trees for the head and tail domains are very similar. In both trees, vertebrate striated muscle myosins form a subgroup with separate subgroups for cardiac alpha, cardiac beta, and skeletal muscle myosins; the two cardiac myosin subgroups are closer together than either is to vertebrate skeletal muscle myosin; invertebrate striated muscle myosins are a somewhat more distant subgroup; and vertebrate smooth muscle myosins are closer to vertebrate and invertebrate nonmuscle myosins than to striated muscle myosins. Yeast myosins and protozoal myosins form two separate groups in both trees but with an inversion of their relationship to smooth muscle and nonmuscle myosins. With a few exceptions, the relationships of individual myosins within each group and subgroup are quite similar in the two trees—the position of Drosophila muscle myosin II (Dm) is slightly different in the two trees. The tail tree is essentially as robust as the head tree; 33 nodes were found in >90% of the bootstrap trials of the tail tree compared with 36 nodes in the head tree. Bezanilla and Pollard (11) reported similar results in comparing phylogenetic trees of the combined head/neck domains (as defined here) and tail domains of 34 class II myosins.
The phylogenetic tree for the neck domains (Fig. 2) is very similar to the trees for the head and tail domains with similar groupings of cardiac and skeletal striated muscle myosins, smooth muscle myosins, and nonmuscle myosins of vertebrate and invertebrate species. Protozoal myosins also have a similar position in the three trees but yeast myosins are closer to striated muscle myosins in the neck tree rather than to the smooth muscle and nonmuscle myosins as in the head and tail trees. Fewer nodes (17 of them) were found in >90% of bootstrap trials of the neck tree, than in the other two trees, perhaps because the necks contain only 71 amino acids. Generally, the distances between head domains are less than the distances between neck domains and tail domains have the greatest sequence divergence (note the different lengths of the scale bars for the three trees). However, the relative distances between any two myosins are very similar in the three trees.
Because fewer myosins were analyzed, the trees for the 29 essential light chains and 18 regulatory light chains (Fig. 3) cannot be compared precisely either to each other or to the trees for the three heavy chain domains. However, as was found for the heavy chain domains, the essential and regulatory light chains of smooth muscle myosins group with nonmuscle myosin light chains, invertebrate and vertebrate striated myosin light chains fall into separate groups, and cardiac and skeletal muscle myosin light chains also form subgroups. Although the light chain trees are less robust than the heavy chain trees, i.e. a smaller fraction of nodes were found in >90% of the bootstrap trials, the same trees were obtained for the light chains with 10,000 trials as with 1,000 trials and the order of entry of the sequences made no difference.
Myosin Classes V, VI, VII, VIII, IX, XI, and XIV.
Although there may be more myosins yet to be discovered, at this time these seven classes contain only from five (class IX) to 15 (class XI) members from a limited number of species. Class VIII is limited to higher plants and class XIV to two parasitic protozoa. AtMYA1 (GenBank accession number CAA82234), AtMYA2 (GenBank accession number CAA84067), HaHamy2 (GenBank accession number AAB71527), and ZmMyo2 (GenBank accession number AAD34597) were not included in the class XI tree because the available sequences were too short at either the N-terminal or C-terminal end. Note that DdMyoJ was included in both the class V and class XI trees because in the two papers describing this myosin (12, 13) some uncertainty was expressed as to which of these two classes it belongs or if, indeed, these are separate classes.
In general, the head and neck/tail trees are equally robust within each of the seven classes, i.e., the nodes are as strongly supported in the neck/tail tree as in the head tree for each class in Figs. 4 and 5. As for the class I and class II trees, the trees for the head and neck/tail domains of these seven classes (Figs. 4 and 5) are essentially superimposable, when adjusted for the greater distances between the neck/tail domains, and the relative distances between the head domains and neck/tail domains of any two myosins are very similar. The most obvious exceptions are ScMyo2 and ScMyo4 in class V (Fig. 4), whose neck/tail domains are relatively more divergent than their head domains, and Cr, Cco, and AtH in class XI (Fig. 5), whose relative positions are somewhat different in the head domain and neck/tail domain trees. Also, the node where ScMyo2 and ScMyo4 separate is much closer to the DdMyoJ branch in the class V neck/tail tree than in the head tree (Fig. 4). DdMyoJ is more distant from its closest neighbors in class XI than in class V, especially in the neck/tail trees, consistent with the classification of DdMyoJ as a class-V myosin by Cope and Hodge (http://www.mrc-lmb.cam.ac.uk/myosin/myosin.html). Also, when single head and neck/tail domain trees were prepared containing all of the myosins in class V and class XI, the members of each class segregated at opposite ends of both trees with DdMyoJ (closer to the myosin V group) and Cr (closer to the myosin XI group) in the middle (not shown).
Myosin Classes I–XVII.
Phylogenetic trees were produced for the head domains and neck/tail domains of 144 myosins including those analyzed in Figs. 1–5 and members of the eight classes not analyzed in the previous figures: classes III, IV, XII, XV, and XVI, comprising one myosin each (class III contains two myosins but Limulus polyphemus myosin was not included because the sequence in the database (GenBank accession number AAC16332) has a tail of only 46 residues compared with 495 for DmIII), and classes X, XIII, and XVII, each of which has two members.
Because there is much greater sequence identity among the head domains, not surprisingly, the neck/tail domain tree (Fig. 6) is less robust than the head domain tree (not shown), 80 vs. 102 nodes supported by >90 of bootstrap trials. Nonetheless, with a few exceptions, the neck/tail domains segregated into the 17 classes previously defined by analysis of the head domains. Two of the five Dictyostelium class I myosins, DdIA and DdIE, separated from the 28 other class I myosins and from each other; Ce-IX did not segregate with the four other class IX myosins; class V myosins divided into two groups, six vertebrate and invertebrate myosins and three Dictyostelium and yeast myosins; and class VII myosins separated into two groups, three Drosophila and Dictyostelium myosins and two mammalian myosins. The divisions within class V, class VII, and class XI parallel the alignments within the individual class trees (Figs. 4 and 5), but the separations of the two Dictyostelium class I myosins do not. Although not shown in detail, the alignments within each class in Fig. 6 are otherwise essentially the same as the alignments within the nine individual class trees. Interestingly, in the 144-head domain tree (not shown) DdMyoJ and DdMyoI also separated from the other class V and class VII myosins, respectively.
Concluding Comments
This analysis demonstrates that, with a few exceptions, phylogenetic analysis divides the neck/tail domains of the heavy chains of 144 myosins into the same 17 classes previously defined by comparison of their head domain sequences. The “mislocation,” when compared with previous alignments, of head domains of DdMyoJ and DdMyoI may have resulted from using total head sequences rather than the much more highly conserved core motor domains previously used, but may reflect some ambiguity in the classification of these two myosins. The additional divergences within the 144-neck/tail domain tree could indicate either a need for further refinement of the classifications or a divergence between the phylogeny of head and neck/tail domains in these few cases. These discrepancies might disappear, however, if the initial sequence alignments were corrected for gaps, multiple substitutions, and possible misalignments before plotting the trees. Thompson et al. (9) have cautioned that misalignments can occur in any automated method, especially when sequences are less than 30% identical as is commonly the case for myosin neck/tail domains in contrast to head domains.
The head domains and neck/tail domains of the individual myosins within each class, including the large myosin I and myosin II classes, segregate very similarly within their respective trees and, at least for class II myosins, light chain phylogeny parallels heavy chain phylogeny. The coalignments of head and neck/tail domains might be even better if the alignment procedure were refined, as discussed above.
The logical inference from these results, that the apparent coevolution of myosin heavy chain head and tail (and possibly neck) domains reflects a functional interdependence of the two (or three) domains, is consistent with the finding that the catalytic properties of the head domain of Dictyostelium myosin II are greatly affected when the tail domain is replaced by the tail domain of either Acanthamoeba, chicken smooth muscle, or chicken skeletal muscle myosin II (7, 8). Whether such tight coupling between head and tail domains can be generalized to all or most myosins needs to be determined. For example, exchange of head and tail domains within or between the cardiac alpha or cardiac beta heavy chain subgroups may have little or no effect because the sequences of the head, neck, and tail domains are 95–99% identical in pairwise comparisons between species within each of these subgroups and 85–90% identical in pairwise comparisons between the subgroups. This contrasts to only about 50% sequence identity between the head domains and 20–25% sequence identity between tail domains of Dictyostelium myosin II and either Acanthamoeba, chicken smooth muscle, or chicken skeletal muscle myosin II.
Acknowledgments
I thank Dr. James R. Sellers (National Heart, Lung, and Blood Institute) for many helpful discussions during the course of this research and Dr. Sellers and Dr. Robert S. Adelstein (National Heart, Lung, and Blood Institute) for suggestions for improving the manuscript.
Footnotes
Article published online before print: Proc. Natl. Acad. Sci. USA, 10.1073/pnas.230441597.
Article and publication date are at www.pnas.org/cgi/doi/10.1073/pnas.230441597
References
- 1.Sellers J R. Myosins. Oxford, U.K.: Oxford Univ. Press; 1999. [Google Scholar]
- 2.Yamashita, R. A., Sellers, J. R. & Anderson, J. B. (2000) J. Muscle Res. Cell Motil., in press. [DOI] [PubMed]
- 3.Cheney R E, Riley M A, Mooseker M S. Cell Motil Cytoskel. 1993;24:215–223. doi: 10.1002/cm.970240402. [DOI] [PubMed] [Google Scholar]
- 4.Goodson H V, Spudich J A. Proc Natl Acad Sci USA. 1993;90:659–663. doi: 10.1073/pnas.90.2.659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Truong T, Medley Q G, Côt é. J Biol Chem. 1992;267:9767–9772. [PubMed] [Google Scholar]
- 6.Collins J H, Kuznicki J, Bowers B, Korn E D. Biochemistry. 1982;21:6910–6915. doi: 10.1021/bi00269a045. [DOI] [PubMed] [Google Scholar]
- 7.Liu X, Shu S, Yamashita R, Korn E D. Proc Natl Acad Sci USA. 2000;97:12553–12558. doi: 10.1073/pnas.230441497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ito K, Liu X, Katayama E, Uyeda T Q P. Biophys J. 1999;76:985–992. doi: 10.1016/S0006-3495(99)77262-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Thompson J D, Gibson T J, Plewniak F, Jeanmougin F, Higgins D G. Nucleic Acids Res. 1997;25:4876–4882. doi: 10.1093/nar/25.24.4876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Cheney R E, Mooseker M S. Curr Biol. 1992;4:27–35. doi: 10.1016/0955-0674(92)90055-h. [DOI] [PubMed] [Google Scholar]
- 11.Bezanilla M, Pollard T D. Mol Biol Cell. 2000;11:79–91. doi: 10.1091/mbc.11.1.79. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Peterson M D, Urioste A S, Titus M A. J Muscle Res Cell Motil. 1996;17:411–424. doi: 10.1007/BF00123358. [DOI] [PubMed] [Google Scholar]
- 13.Hammer J A, III, Jung G. J Biol Chem. 1996;271:7120–7127. doi: 10.1074/jbc.271.12.7120. [DOI] [PubMed] [Google Scholar]