Skip to main content
Plant Signaling & Behavior logoLink to Plant Signaling & Behavior
. 2017 Jul 10;12(8):e1347245. doi: 10.1080/15592324.2017.1347245

Positive selection analysis highlights key positions in plant PP2A regulatory subunits

Matthew A Booker 1,*, Alison DeLong 1,
PMCID: PMC5616155  PMID: 28692336

ABSTRACT

A versatile hub for cellular control, the eukaryotic protein phosphatase 2A (PP2A) enzyme family is thought to achieve specificity through combinatorial complexity. Phylogenetic analysis has revealed that expansion of PP2A gene families resulted from whole genome duplications followed by non-random gene loss, and selection analysis suggests that retention of B56/PPP2R5 gene family members after genome duplication events was driven by functional diversification. Here we identify the sites at which positive selection is detected in the plant B56 gene family, and discuss the significance of selection at these positions in the context of PP2A holoenzyme structure. The pattern of positive selection observed in the B11 subclade is distinctive, and suggests selective pressure on interactions with substrates and the enzymatic core.

Keywords: Molecular phylogeny, phosphatase evolution, phosphatase regulatory subunits, positive selection analysis, protein phosphatase 2A

Results and discussion

Canonical heterotrimeric PP2A holoenzymes contain catalytic (C), regulatory (B), and structural/scaffolding (A) subunits; these complexes regulate diverse cellular transactions by dephosphorylating target proteins (reviewed in refs.1-3). We recently reported the results of our phylogenetic analysis of PP2A gene families encoding A, B and C subunits,4 with particular focus on the 3 regulatory B subunit types known in plants: B55, B56 and B72 (reviewed in refs.1,2). Although each B subunit type is found in heterotrimeric PP2A complexes, the gene families encoding these proteins are unrelated and the protein products are structurally and biochemically distinct. Members of the B55 family are β-propeller proteins5 and B72 family members are calcium-binding proteins,6-9 while B56 proteins comprise a conserved core of α-helical Huntingtin/Elongation factor 3/A subunit/TOR (HEAT) repeats.10 The overall pattern of PP2A gene family evolution is one of expansion driven by paleopolyploidization events, followed by non-random gene loss.4 Interestingly, each regulatory B subunit family includes one clade that deviates from this overall pattern by showing little or no expansion. We also observed several lineage specific losses among plant PP2A gene families; in the B56 family, for instance, the B5/8 clade has been lost in grasses and the BØ clade is absent in plant species that do not support beneficial microbial associations (Fig. 1).4,11,12

Figure 1.

Figure 1.

Overall Topology of the Seed Plant B56 Gene Family. Bayesian analysis was used to construct a phylogenetic tree for the B56 gene family. The tree at left shows a simplified version of the tree constructed through analysis of B56 cDNA sequences from 34 plant species (Chlamydomonas reinhardtii, Physcomitrella patens, Selaginella moellendorffii, Ceratopteris richardii, Adiantum capillus-veneris, Gnetum gnemon, Podocarpus macrophyllus, Pinus taeda, Picea abies, Amborella trichopoda, Spirodela polyrhiza, Phoenix dactylifera, Musa acuminata, Sorghum bicolor, Brachypodium distachyon, Oryza sativa, Aquilegia coerulea, Nelumbo nucifera, Solanum lycopersicum, Mimulus guttatus, Utricularia gibba, Vitis vinifera, Fragaria vesca, Populus trichocarpa, Citrus sinensis, Medicago truncatula, Theobroma cacao, Carica papaya, Cleome/Tarenaya hassleriana, Eutrema salsugineum, Brassica rapa, Capsella rubella, Arabidopsis lyrata, and Arabidopsis thaliana; for details of analysis see ref.4). The first column shows the taxa in which each clade is found (Present) and the second column indicates lineage-specific losses observed for that B subunit clade (Absent). The final column lists the Arabidopsis genes belonging to each clade. For simplicity, 3 clades present in gymnosperms (2 B5/6/8/9/10-like clades and one B11-like clade) and all clades found in non-seed plant species are not shown. For detailed phylogenetic trees including all B56 sequences analyzed, see Supplemental Figures 2 and 3 in ref. 4.

Several clades within the B56 gene family in plants (Fig. 1) show evolutionary evidence of functional diversification.4 We used the branch-site likelihood method13 to map the amino acid positions (sites) at which we could detect positive selection within the B56 protein sequence. Among B56 gene families in seed plant species, the B11 clade exhibited the largest number of sites of positive selection (9; Table 1 and Fig. 2). Several other clades showed evidence of positive selection at 2 (B6 and B3/4/7 clades) or 3 (BØ, B6/9/10 and B5/8/6/9/10 clades) sites. Most of these sites of positive selection mapped to positions of lower sequence conservation (< 50% sequence identity); only 5 of the 24 sites mapped to more highly conserved positions (55 – 75% sequence identity; Table 1 and Fig. 2). Three sequence positions showed evidence of positive selection in 2 distinct clades.

Table 1.

Sites of Positive Selection in the B56 Regulatory Subunit Family Selection analysis of the plant B56 gene family was performed using the branch-site likelihood method13 as described previously.4 Analyses were performed on the amino acid sequence tree (the plant sub-tree shown in Supplemental Fig. S3 of ref.4), and the codon alignment was generated from the amino acid alignment used for the phylogenetic analysis and the nucleotide coding sequence as input into TranslatorX.22 Branch points at the bases of clades containing BØ and A. thaliana isoform lineages were tested. Initial omega values for alternate hypotheses were 1.5 for tests of branch points all clades except the B3 clade (initial omega of 15). All branches were analyzed in independent runs.

Site number B56 Clade B11 positiona BEB valueb Amino acid identity (%)c
1 B5/8/6/9/10 72 0.977
2 93 0.979 55
3 B3/4/7 123 0.986
4 B11 126 0.955
5 B5 127 0.981 45
6 B6/9/10 0.964
7 B3/4 133 0.972
8 B11 138 0.991 45
9 0.995
10 B11 148 0.954 45
11 B11 152 0.98 45
12 B11 155 0.987 64
13 B11 157 0.993 64
14 B6/9/10 161 0.977
15 B11 302 0.986
16 B6/9/10 0.968
17 B11 320 0.97
18 B5/8/6/9/10 340 0.985 45
19 B11 364 0.979 73
20 B6 401 0.97
21 B5/8/6/9/10 405 0.952
22 B3/4/7 413 0.99 64
23 B6 441 0.956 45
24 447 0.993
a

Amino acid residue in the predicted Arabidopsis B11 (AT5G25510) protein sequence

b

Bayes Empirical Bayes value; the criterion for reporting positive selection at any position was a BEB value ≥ 0.950 (and a log likelihood ratio test result of p < 0.05 for positive selection on the clade overall, as reported in ref. 4).

c

–, Less than 40% amino acid sequence identity in Fig. 2 alignment.

Figure 2.

Figure 2.

Sites of Positive Selection within the Core Domain of Plant B56 Regulatory Subunits. Amino acid sequences of B56 domains for all 9 Arabidopsis B56 isoforms plus the Medicago truncatula BØ (PP2AB’112) and human B’γ-1 were aligned using ClustalX.20 Positions at which positive selection was detected in the B11 clade and in other B56 clades are indicated by red and yellow circles, respectively, above the sequence alignment; boxes in the alignment indicate the clade[s] in which selection was detected. Highlighted amino acids in the human B’γ-1 sequence are points of contact with the A (green) or C (blue) subunit,10 or with the SLIM motif from RepoMan and BubR119 (red). The consensus logo was auto-calculated in Jalview.21 Residues in the consensus logo are sized and shaded by percent identity, with the smallest/lightest logo letters indicating 45% identity, and increasing letter size/shading indicating increasing identity (in increments of 9.1%).

We used the structural modeling programs PHYRE214 and ProtMod (http://ffas.burnham.org/protmod-cgi/protModHome.pl) to generate predicted structures for A. thaliana B11 and the Medicago truncatula BØ isoforms, and PyMOL (The PyMOL Molecular Graphics System, Version 1.8.6.0 Schrödinger, LLC) to compare the predicted structures to the crystallographically determined structure of human B’γ-1. B56 regulatory subunits carry relatively short and non-conserved amino- and C-terminal regions that flank a highly conserved core, the B56 domain.15 This core domain comprises 8 HEAT repeat motifs (each consisting of a pair of α helices connected by a short ‘intra-repeat’ loop), followed by an unpaired α helix.10 The alignments exhibit small gaps in loops with variable lengths, but neither modeling program predicted obvious structural differences between the B11 or BØ protein and the human B’γ-1 protein. Superimposition of the predicted B11 structure with the human B’γ-1 structure yields a very tight alignment through the entire B56 domain (dark vs. light gray in Fig. 3A). Superimposition of the predicted BØ structure yielded nearly identical results (data not shown).

Figure 3.

Figure 3.

Mapping Sites of Positive Selection onto the PP2A Complex. Sites at which positive selection was detected in the B11 clade (red) and in other B56 clades (yellow) are highlighted in a structural model for the PP2A complex. A structural model for the B56 domain of the Arabidopsis B11 protein (light gray; residues K81 – K466 in Fig. 2) was generated using FFAS ProtMod and superimposed on the crystal structure of human B’γ-1 (dark gray) in a heterotrimeric PP2A complex (PDB ID 2NPP;10) using PyMOL. Ribbon diagrams of the subunits (panel A) show the amino termini (and first α-helical HEAT repeats) of the B56 structures at top, with the C-termini at bottom. The scaffolding subunit is shown in green, and the catalytic subunit is shown in blue. The long intra-repeat loop of HEAT repeat 2 in the B56 subunit, which includes several sites of positive selection highlighted in red and yellow, projects toward the enzyme's active site. A surface representation of the complex (panel B, which is rotated slightly relative to panel A) highlights the exposed positions of several sites of positive selection in HEAT repeats 5 – 8.

To determine whether selection affects residues that might play specific roles in the context of the heterotrimeric PP2A holoenzyme, we mapped the sites of positive selection onto this structural model for the B56 subunit in the PP2A complex (Fig. 3). More than half of the amino acid positions identified in our positive selection analysis (13/24) mapped to HEAT repeats 1 and 2, in the amino-terminal portion of the B56 domain. Five of those positions were predicted to fall in the extended intra-repeat loop in HEAT repeat 2 that projects toward the active site in the heterotrimeric enzyme complex; residues in this loop are required for PP2A-B56 activity toward tyrosine hydroxylase.16 One site mapped upstream of HEAT repeat 1, in the amino-terminal region not included in the structural model. The remaining 10 positions mapped to HEAT repeats 5, 6, 7, 8, and the C-terminal α helix.

None of the 24 sites of positive selection mapped to the highly conserved third and fourth HEAT repeats, which contain the binding site for a short linear interaction motif (SLIM) recently identified in numerous substrate and scaffolding proteins recognized by B56-containing PP2A complexes17-19 (Fig. 2). However, 5 sites (10, 11, 13, 14 and 18) are predicted to lie at or immediately next to positions in the inter-subunit contact interfaces of the holoenzyme.10 Strikingly, all but one of these sites (number 18) map to the extended intra-repeat loop sequence in HEAT repeat 2 (Figs 2 and 3A), which has previously been linked to recruitment of specific substrates.16 Three of these sites were identified in the B11 clade, suggesting that the B11 clade has experienced unique selective pressure in this region.

The 10 sites that map to the region from HEAT repeat 5 to the C-terminus exhibit a wider phylogenetic distribution, affecting the BØ, B6, B11, B3/4/7, B6/9/10 and B5/8/6/9/10 clades. Only one of these sites (number 18) maps close to a predicted point of contact in the core of the heterotrimeric complex, while 5 of the remaining 9 (numbers 17, 20, 21, 23 and 24) are predicted to affect surface-exposed residues distal to the A and C subunits (Fig. 3A and B). Positive selection at these C-terminal positions thus may reflect selective pressure on interactions with regulatory factors or substrates, rather than inter-subunit interactions within the trimeric complex.

Overall, the distribution of sites of positive selection within plant B56 gene families suggests that the B11 gene family has experienced positive selection at the largest number of sites, including a cluster of positions that are predicted to contact the A/C enzyme core and/or mediate substrate recruitment. In contrast, several other B56 subclades exhibit evidence of positive selection at more dispersed positions that appear less likely to affect interactions with the enzyme core. These data are consistent with the hypothesis that members of the B11 clade may recruit a distinct set of substrates. In contrast, selective pressure that may affect interactions with regulatory factors (e.g. those controlling subcellular localization) is observed in several other B56 clades.

Disclosure of potential conflicts of interest

No potential conflicts of interest were disclosed.

Acknowledgments

We are indebted to G. Jogl for advice on implementing PyMol modeling. We also thank M. Johnson, J. Bender and other members of the Brown Arabidopsis Molecular Genetics Group for helpful discussions. This material is based upon work supported by the National Science Foundation grant IOS-1145585. M.A.B. was partially supported by the National Institutes of Health predoctoral training program (grant no. GM007601).

References

  • 1.Durian G, Rahikainen M, Alegre S, Brosché M, Kangasjärvi S. Protein phosphatase 2A in the regulatory network underlying biotic stress resistance in plants. Front Plant Sci 2016; 7:812; PMID:27375664; https://doi.org/ 10.3389/fpls.2016.00812 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Farkas I, Dombrádi V, Miskei M, Szabados L, Koncz C. Arabidopsis PPP family of serine/threonine phosphatases. Trends Plant Sci 2007; 12:169-76; PMID:17368080; https://doi.org/ 10.1016/j.tplants.2007.03.003 [DOI] [PubMed] [Google Scholar]
  • 3.Uhrig RG, Labandera AM, Moorhead GB. Arabidopsis PPP family of serine/threonine protein phosphatases: many targets but few engines. Trends Plant Sci 2013; 18:505-13; PMID:23790269; https://doi.org/ 10.1016/j.tplants.2013.05.004 [DOI] [PubMed] [Google Scholar]
  • 4.Booker MA, DeLong A. Atypical protein phosphatase 2A gene families do not expand via paleopolyploidization. Plant Physiol 2017; 173:1283-300; PMID:28034953; https://doi.org/ 10.1104/pp.16.01768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Xu Y, Chen Y, Zhang P, Jeffrey PD, Shi Y. Structure of a protein phosphatase 2A holoenzyme: insights into B55-mediated Tau dephosphorylation. Mol Cell 2008; 31:873-85; PMID:18922469; https://doi.org/ 10.1016/j.molcel.2008.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ahn J-H, Sung JY, McAvoy T, Nishi A, Janssens V, Goris J, Greengard P, Nairn AC. The B″/PR72 subunit mediates Ca2+-dependent dephosphorylation of DARPP-32 by protein phosphatase 2A. Proc Natl Acad Sci 2007; 104:9876-81; PMID:17535922; https://doi.org/ 10.1073/pnas.0703589104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Janssens V, Jordens J, Stevens I, Van Hoof C, Martens E, De Smedt H, Engelborghs Y, Waelkens E, Goris J. Identification and functional analysis of two Ca2+-binding EF-hand motifs in the B'/PR72 subunit of protein phosphatase 2A. J Biol Chem 2003; 278:10697-706; PMID:12524438; https://doi.org/ 10.1074/jbc.M211717200 [DOI] [PubMed] [Google Scholar]
  • 8.Leivar P, Antolín-Llovera M, Ferrero S, Closa M, Arró M, Ferrer A, Boronat A, Campos N. Multilevel control of Arabidopsis 3-hydroxy-3-methylglutaryl coenzyme A reductase by protein phosphatase 2A. Plant Cell 2011; 23:1494-511; PMID:21478440; https://doi.org/ 10.1105/tpc.110.074278 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wlodarchak N, Guo F, Satyshur KA, Jiang L, Jeffrey PD, Sun T, Stanevich V, Mumby MC, Xing Y. Structure of the Ca2+-dependent PP2A heterotrimer and insights into Cdc6 dephosphorylation. Cell Res 2013; 23:931-46; PMID:23752926; https://doi.org/ 10.1038/cr.2013.77 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Xu Y, Xing Y, Chen Y, Chao Y, Lin Z, Fan E, Yu JW, Strack S, Jeffrey PD, Shi Y. Structure of the protein phosphatase 2A holoenzyme. Cell 2006; 127:1239-51; PMID:17174897; https://doi.org/ 10.1016/j.cell.2006.11.033 [DOI] [PubMed] [Google Scholar]
  • 11.Bravo A, York T, Pumplin N, Mueller LA, Harrison MJ. Genes conserved for arbuscular mycorrhizal symbiosis identified through phylogenomics. Nat Plants 2016; 2:15208; PMID:27249190; https://doi.org/ 10.1038/nplants.2015.208 [DOI] [PubMed] [Google Scholar]
  • 12.Charpentier M, Sun J, Wen J, Mysore KS, Oldroyd GE. Abscisic acid promotion of arbuscular mycorrhizal colonization requires a component of the PROTEIN PHOSPHATASE 2A complex. Plant Physiol 2014; 166:2077-90; PMID:25293963; https://doi.org/ 10.1104/pp.114.246371 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zhang J, Nielsen R, Yang Z. Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. Mol Biol Evolution 2005; 22:2472-9; PMID:16107592; https://doi.org/ 10.1093/molbev/msi237 [DOI] [PubMed] [Google Scholar]
  • 14.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJ. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc 2015; 10:845-58; PMID:25950237; https://doi.org/ 10.1038/nprot.2015.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.McCright B, Rivers AM, Audlin S, Virshup DM. The B56 family of protein phosphatase 2A (PP2A) regulatory subunits encodes differentiation-induced phosphoproteins that target PP2A to both nucleus and cytoplasm. J Biol Chem 1996; 271:22081-9; PMID:8703017; https://doi.org/ 10.1074/jbc.271.36.22081 [DOI] [PubMed] [Google Scholar]
  • 16.Saraf A, Oberg EA, Strack S. Molecular determinants for PP2A substrate specificity: charged residues mediate dephosphorylation of tyrosine hydroxylase by the PP2A/B' regulatory subunit. Biochemistry 2010; 49:986-95; PMID:20017541; https://doi.org/ 10.1021/bi902160t [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hertz EP, Kruse T, Davey NE, Lopez-Mendez B, Sigurethsson JO, Montoya G, Olsen JV, Nilsson J. A Conserved Motif Provides Binding Specificity to the PP2A-B56 Phosphatase. Mol Cell 2016; 63:686-95; PMID:27453045; https://doi.org/ 10.1016/j.molcel.2016.06.024 [DOI] [PubMed] [Google Scholar]
  • 18.Wang J, Wang Z, Yu T, Yang H, Virshup DM, Kops GJ, Lee SH, Zhou W, Li X, Xu W, et al.. Crystal structure of a PP2A B56-BubR1 complex and its implications for PP2A substrate recruitment and localization. Protein Cell 2016; 7:516-26; PMID:27350047; https://doi.org/ 10.1007/s13238-016-0283-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wang X, Bajaj R, Bollen M, Peti W, Page R. Expanding the PP2A Interactome by Defining a B56-Specific SLiM. Structure 2016; 24:2174-81; PMID:27998540; https://doi.org/ 10.1016/j.str.2016.09.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, et al.. Clustal W and Clustal X version 2.0. Bioinformatics 2007; 23:2947-8; PMID:17846036; https://doi.org/ 10.1093/bioinformatics/btm404 [DOI] [PubMed] [Google Scholar]
  • 21.Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 2009; 25:1189-91; PMID:19151095; https://doi.org/ 10.1093/bioinformatics/btp033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Abascal F, Zardoya R, Telford MJ. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res 2010; 38:W7-13; PMID:20435676; https://doi.org/ 10.1093/nar/gkq291 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Plant Signaling & Behavior are provided here courtesy of Taylor & Francis

RESOURCES