Abstract
Some secreted proteins that assemble into large complexes, such as extracellular matrices or hormones and enzymes in storage granules, must be kept at subaggregation concentrations during intracellular trafficking. We show surfeit locus protein 4 (Surf4) is the cargo receptor that establishes different steady-state concentrations for a variety of soluble cargo proteins within the endoplasmic reticulum (ER) through interaction with the amino-terminal tripeptides exposed after removal of leader sequences. We call this motif the ER-Exit by Soluble Cargo using Amino-terminal Peptide-Encoding motif (ER-ESCAPE motif). Proteins that most readily aggregate in the ER lumen (e.g., dentin sialophosphoprotein [DSPP] and amelogenin, X-linked [AMELX]) have strong ER-ESCAPE motifs to inhibit aggregate formation, while less susceptible cargo exhibits weaker motifs. Specific changes in a single amino acid of the tripeptide result in aggregate formation and failure to efficiently traffic cargo out of the ER. A logical subset of 8,000 possible tripeptides starting a model soluble cargo protein (growth hormone) established a continuum of steady-state ER concentrations ranging from low (i.e., high affinity for receptor) to the highest concentrations associated with bulk flow–limited trafficking observed for nonbinding motifs. Human cells lacking Surf4 no longer preferentially trafficked cargo expressing strong ER-ESCAPE motifs. Reexpression of Surf4 or expression of yeast’s ortholog, ER-derived vesicles protein 29 (Erv29p), rescued enhanced ER trafficking in Surf4-null cells. Hence our work describes a new way of preferentially exporting soluble cargo out of the ER that maintains proteins below the concentrations at which they form damaging aggregates.
Author summary
The majority of proteins secreted from eukaryotic cells are targeted to the endoplasmic reticulum (ER) for folding and maturation and are then transferred by ER-exit vesicles to the Golgi for additional processing and eventual secretion from the plasma membrane. Large protein complexes such as those formed by extracellular matrices and hormones require that monomer concentrations in the ER remain low to inhibit polymerization or aggregation. For such cargo, diffusion into the lumen of ER-exit vesicles alone (bulk flow) is insufficient, and more efficient trafficking is required, necessitating binding to ER-exit vesicle transmembrane cargo receptors. We show that amino-terminal tripeptides of several secreted proteins bind to transmembrane cargo receptor, surfeit locus protein 4 (Surf4), for increased ER-trafficking efficiency. Yeast’s homolog, ER-derived vesicles protein 29 (Erv29p), is functionally conserved, as it could rescue efficient ER trafficking in Surf4-null human cells. Proteome database analyses of candidate secreted proteins and direct experimentation with representative cargo proteins uncovered a range of tripeptide compositions that confer high, modest, or low efficiency of ER trafficking. Furthermore, many soluble chaperone proteins that function within the ER—as well as proteins too large to enter classic coat protein complex II (COPII) ER-exit vesicles, such as fibrillar collagens—start with tripeptides that cannot bind to Surf4/Erv29p. When cargo concentrations exceeded Surf4 levels, trafficking of problematical proteins was prioritized by use of their higher affinity ER-Exit by Soluble Cargo using Amino-terminal Peptide-Encoding motifs (ER-ESCAPE motifs).
Introduction
Approximately one-half of proteins encoded within the human genome start with a leader sequence and/or encode transmembrane domains, suggesting they are translocated into the endoplasmic reticulum (ER) for trafficking to other organelles or into the extracellular environment [1]. Many soluble proteins translocated into the ER are destined to form polymers for a variety of evolutionary reasons, including assembling extracellular matrices. Because many extracellular environment properties of higher eukaryotes (neutral pH, oxidative environment, approximately 1 mM Ca2+, etc.) are similar to that in the lumen of the ER [2], premature assembly of matrix components is possible. Even proteins such as hormones form temporary close-packed arrays within storage-type secretory vesicles. While controlled oligomerization of some proteins in the ER is apparently desired [3], other ER-associated interactions could form damaging aggregates within the lumen. The simplest solution to this problem is to remove problematic cargo proteins from the ER before they can accumulate to concentrations high enough to enable inappropriate or premature associations. Diffusion of cargo proteins from the ER lumen into cargo vesicles destined to the Golgi apparatus (“bulk flow”) is insufficient to keep many proteins at concentrations low enough to prevent aggregation. Cargo receptors have long been proposed to bind specific cargo proteins in coat protein complex II (COPII) exit vesicles for more efficient ER trafficking. Such cargo receptors are generally transmembrane proteins with lumenal domain(s) to bind the cargo while their cytosolic domain(s) interact with cytosolic COPII-vesicle proteins [4]. Since thousands of cargo exist, it is unlikely that cells have a unique cargo receptor for each problematic soluble cargo protein. Hence, cargo receptors must exist that recognize generic signals in proteins with similar trafficking specificities.
One protein that has high tendency to aggregate in the ER lumen and therefore must utilize a cargo receptor is dentin sialophosphoprotein (DSPP). To our knowledge, all nonsyndromic cases of Dentinogenesis Imperfecta (DGI) and the less severe dentin dysplasia (DD) are the result of dominant mutations in the DSPP gene encoding for DSPP. We have shown that mutant DSPP proteins failed to traffic out of the ER [5]. Many of the disease mutations change one of the first three amino acids of the mature protein that is left after removal of the leader sequence. These three amino acids encode for isoleucine-proline-valine (IPV) [5]. These changes were either direct, single-base missense mutations (P17L, P17S, P17T, or V18D) or mutations causing exon-3 to be skipped, also resulting in an acidic, isoleucine-proline-aspartic acid (IPD) amino-terminus. Because these mutations result in the more severe disease, DGI, we proposed that the amino-terminal IPV tripeptide was DSPP’s motif that bound an ER cargo receptor. Failure to interact with this unknown cargo receptor would cause IPD-DSPP proteins, with its fully intact Ca2+-binding repeat domain, to accumulate to higher steady-state concentrations in the Ca2+-rich ER and result in the formation of Ca2+-associated aggregation. Experiments in support of this showed that the amount of wild-type DSPP (IPV) secreted by human embryonic kidney cell line 293A (HEK293A) was inversely correlated with increasing amounts of coexpressed mutant protein (IPD, isoleucine-serine-valine [ISV], or isoleucine-threonine-valine [ITV]-DSPP) [5].
In this current work, we identify that indeed the first three amino acids of many secreted proteins form a tripeptide motif that enhances exit from ER. We call this the ER-Exit by Soluble Cargo using Amino-terminal Peptide-Encoding motif (ER-ESCAPE motif). Furthermore, we defined the location, size, and biochemical properties of the proposed ER-ESCAPE motif. We identify surfeit locus protein 4 (Surf4) and its yeast homolog, ER-derived vesicles protein 29 (Erv29p), as the cargo receptor in human/yeast cells (respectively) that binds to the ER-ESCAPE motif, thereby enhancing ER trafficking of specific soluble cargo proteins. We explore implications that variations of the motif result in different Surf4-binding affinities and different priorities in ER exit for substrates that have differential exit requirements.
Results
Defining required properties of IPV-like motifs for ER trafficking of DSPP and AMELX
Investigating the effectiveness of trafficking soluble cargo out of the ER for the 8,000 possible permutations of amino-terminal tripeptides was unrealistic. Therefore, the investigation was limited to select permutations of amino acid size, charge, and hydrophobicity/hydrophilicity. A survey of mammalian DSPP starting tripeptides (Fig 1, S1 Table) shows a consensus motif of hydrophobic-proline-hydrophobic (Φ-P-Φ) with both hydrophobic amino acids being limited to large ones; isoleucine, valine, leucine, and phenylalanine. For reptilian DSPP-like sequences, polar-but-uncharged amino acids—serine (python) or threonine (alligators and crocodiles)—were observed in the first position.
Mouse DSPP expression constructs (with first 31 amino acids replaced by the human sequence as well as a 6xFLAG tag, when noted), starting with wild-type tripeptide (IPV) or noted tripeptides directly after the leader peptide, were used to test ER trafficking. HEK293A cells express a limited amount of bone morphogenetic protein 1 (BMP1) that cleaves DSPP into dentin sialoprotein (DSP) and dentin phosphoprotein (DPP) fragments during secretion [8]. Therefore, antibody against mouse DSP domain on western blots resulted in both intact (Mr approximately 200 kDa) and DSP fragments (broad Mr approximately 80 kDa) bands in culture media. When serine (serine-proline-valine [SPV]) or threonine (threonine-proline-valine [TPV]) were substituted for the starting isoleucine (IPV), DSPP was trafficked out of the ER and into the conditioned media as effectively as wild-type IPV-DSPP (Fig 2A).
Because it seemed unlikely that the proposed ER’s soluble cargo-trafficking receptor had evolved solely to traffic DSPP (a protein found only in creatures with complex, dentin-based tooth structures [9]), starting tripeptides of other acidic secreted proteins were queried next (Fig 1). Like DSPP, these very acidic proteins could be expected to form problematic, Ca2+-driven aggregates within the ER lumen if not kept to low concentrations by use of a high-affinity ER cargo receptor. The small hydrophobic amino acid, alanine, was found in the first position of chromogranin B (alanine-proline-valine [APV]) in some mammals and reptiles as well as the third position of some fish osteopontin (OPN; arginine-proline-alanine [RPA]) and reptile bone sialoprotein (BSP; phenylalanine-serine-alanine [FSA]). Alanine in the first position (APV) did enable efficient DSPP trafficking out of the ER (Fig 2A). Unpredicted from DSPP evolutionary data, the positively charged amino acid arginine was found starting dentin matrix acidic phosphoprotein 1 (DMP1) proteins of some reptiles (arginine-proline-valine [RPV], arginine-proline-leucine [RPL]) as well as OPN of specific fish (RPL, RPA, RPV, arginine-proline-glycine [RPG]) and one hummingbird (RPV).
Experimentally, DSPP was trafficked efficiently out of HEK293A cells when the positively charged arginine was in the first (RPV) or third (isoleucine-proline-arginine [IPR]) position (Fig 2A). As noted earlier [5], loss of proline in position 2 caused loss of trafficking (Fig 2B). Furthermore, acidic amino acids aspartate (D) or glutamate (E) in the first or third positions of the motif had similar negative effects (Fig 2B). DSPP lacks conserved hydrophobic domains found in stably folded proteins and therefore is like two other small integrin-binding ligand, N-linked glycoprotein (SIBLING) family members, BSP and OPN, which have been shown by NMR to be disordered (unstructured) proteins [10]. Indeed, Fig 2C shows that the accumulation of IPD-DSPP in the ER did not activate the IRE-1 unfolded protein response (UPR, ER-stress) pathway within 52 hr of transfection. In contrast, only 5 hr of treatment with tunicamycin (inhibits N-glycosylation needed by many proteins for correct folding) induced the alternate X-box-binding protein 1 (XBP-1) splice event of inositol-requiring enzyme 1 (IRE1)/UPR pathway. Of course, ER aggregates formed by other soluble cargo proteins could induce UPR pathways and contribute to negative evolutionary selection after loss/change of a beneficial amino-terminal tripeptide.
Our earlier work noted reports that several acidic proteins associated with mineralized matrices made by creatures as diverse as vertebrates, sea urchins, mollusks, and corals start with variations on the IPV motif, suggesting that trafficking acidic proteins via an ER cargo receptor was an ancient process [11]. Logically, however, any protein that is destined to form homopolymers (e.g., extracellular matrices and temporary aggregates in hormone/enzyme storage granules) would also require low ER concentrations to obviate aggregate formation. Furthermore, this hypothesis should include combinations of proteins destined to coassemble into complexes. The first line of defense against premature aggregation is by receptor-directed trafficking out of the ER. AMELX constitutes approximately 90% of the temporary extracellular matrix essential for enamel formation[12, 13]. The isoelectric point of AMELX is neutral and is unlikely to bind/share calcium ions like acidic proteins. However, upon secretion, AMELX self-associates into large structures [14, 15]. AMELX of mammals, reptiles, and amphibians (Fig 1) starts with a Φ-P-Φ, which, like DSPP, uses exclusively larger hydrophobic amino acids to flank the invariant proline.
Experimentally, we show that AMELX starting with the native tripeptide, methionine-proline-leucine (MPL), was efficiently trafficked out of HEK293A cells (Fig 2D). Replacement of proline with serine (ISV, phenylalanine-serine-methionine [FSM]) or starting the protein with an acidic amino acid, glutamic acid-proline-leucine (EPL), caused cellular retention of AMELX (Fig 2D). Use of the proposed ER cargo receptor, therefore, is likely not limited to acidic proteins that aggregate in millimolar Ca2+.
The binding to cargo receptor is limited to the first three amino acids
Several protein hormones known to condense within secretory granules start with IPV-like tripeptides (Fig 1). We next investigated trafficking of growth hormone (GH) because of its ability to accumulate to higher levels than DSPP or AMELX in the ER of HEK293A cells before forming aggregates. Crystal structures of GH show that its amino-terminus is available for binding [16]. GH is also devoid of both N-linked oligosaccharides and glycophosphatidylinositol (GPI) modifications and therefore cannot utilize ER-Golgi intermediate compartment 53 protein (ERGIC-53)/lectin mannose-binding 1 (LMAN1) or p24 ER cargo receptors, respectively [17, 18]. GH is also sufficiently small (about 26 kDa) to be unlikely to interact with transport and Golgi organization 1 (TANGO1)/cTAGE5-associated large-cargo exit vesicles [19, 20].
To query the fourth amino acid’s importance, an asparagine-threonine-threonine (NTT) N-linked oligosaccharide-attachment motif was inserted in position 4 of GH (APVNTT-GH). This oligosaccharide is several times the mass of IPV-like tripeptide itself (Fig 3A). The majority of APVNTT-GH was glycosylated by HEK293A cells, and its steady-state concentration in the cell remained as low as GH lacking the modification (APV-GH, Fig 3B). Because N-linked oligosaccharide addition to GH may have enabled ER trafficking by ERGIC-53/LMAN1, a control cargo protein (glutamic acid-glutamic acid-threonine [EET]-GH) was tested. It has two acidic amino acids and loss of position 2 proline, conditions that reduce/destroy ER trafficking of DSPP and AMELX. Consistent with an inability to interact with HEK293A’s IPV-motif cargo receptor, EET-GH established a much higher intracellular concentration than APV-GH (Fig 3B). Addition of an N-linked oligosaccharide (EETNTT-GH) did not decrease steady-state levels, suggesting that ERGIC-53/LMAN1 could not replace the IPV-motif ER-exit pathway. Endo H removes N-linked oligosaccharides from proteins found within the ER but has no effect on Golgi-modified proteins. All forms of the oligosaccharide can be removed by PNGase F. Susceptibility of both APVNTT-GH and EETNTT-GH in cell lysates (but not from media) to Endo H showed that most cell-associated GH remained within the ER (Fig 3B). These results present strong evidence that the IPV-like motif is limited to the first three amino-terminal amino acids. The abundance of secreted EET-GH suggests that HEK293A has a robust ER-Golgi “bulk flow” process.
Identification of yeast ER-cargo receptor that interacts with proteins starting with IPV-like motifs
Saccharomyces cerevisiae has long been an experimental model for basic eukaryotic cell functions, including ER-Golgi trafficking. Haploid wild-type alpha (α) cells made and secreted IPV-DSPP, while IPD-DSPP was predominantly retained within cells (α lanes, Fig 4A). The amino-terminus of the carboxy-terminal fragment observed in the media was found by microsequencing to start with asparagine-serine-proline [NSP], suggesting that cleavage was the result of the yeast’s endogenous Golgi propeptidase, kexin 2 (Kex2p), at the aspartic acid-lysine-arginine [DKR]-NSP motif near the BMP1 cleavage domain in higher eukaryotes [8]. Clones were selected from an α cell knockout (KO) library that were lacking one of 12 ER-associated, transmembrane proteins considered reasonable candidates for being ER cargo receptors (24 kDa endomembrane protein [Emp24p], 46 kDa endomembrane protein [Emp46p], 47 kDa endomembrane protein [Emp47p], erythrocyte protein 1 [Erp1p], erythrocyte protein 2 [Erp2p], ER-derived vesicles protein 14 [Erv14p], ER-derived vesicles protein 15 [Erv15p], ER-derived vesicles protein 25 [Erv25p], ER-derived vesicles protein 26 [Erv26p], Erv29p, ER-derived vesicles protein 46 [Erv46p], retention of ER proteins 1 [Rer1p]). Each KO strain was modified to express either IPV-DSPP or IPD-DSPP, and only the erv29Δ strain gave results predicted for loss of ER-to-Golgi receptor (S1 Fig). The clone lacking ERV29 gene poorly trafficked both IPV-DSPP and mutant IPD-DSPP proteins (erv29Δ lanes, Fig 4A). Importantly, trafficking of IPV-DSPP but not IPD-DSPP was rescued when each was expressed in the erv29Δ cells cotransformed with second plasmid constitutively expressing Erv29p (rescue lanes, Fig 4A).
To verify that mammalian DSPP results were not just an expression artifact in yeast, two pathogen-related yeast (PRY) proteins starting with Φ-P-Φ motifs, Pry1p (APA) and Pry2p (APV), were expressed in wild-type and erv29Δ cells. Pry1p and Pry2p are both acidic (pI = 3.4), sterol-binding proteins said to reside in the cell for a time to bind toxic sterols before being secreted [21]. Both proteins were labeled near the amino-terminus with a 2xHA-tag. Pry1p in wild-type cells (α) was predominantly an Mr approximately 85 kDa band (Fig 4B). Pry1p expressed in erv29Δ cells resulted in some Mr approximately 85 kDa band but much more Mr approximately 70 kDa band. The smaller band is consistent with an ER-retained protein (i.e., before Golgi-related posttranslational modifications were added). Similarly, Pry2p protein in wild-type cells was predominantly the larger, Golgi-modified size (Mr about 95 kDa), while the smaller ER form (Mr approximately 75 kDa) increased in the cells lacking Erv29p cargo receptors (Fig 4B). Furthermore, changing the starting tripeptide of Pry1p and Pry2p to alanine-proline-aspartic acid (APD) resulted in increased ER-associated Mr bands of both wild-type and erv29Δ cells (Fig 4B). Pry1p and Pry2p proteins modified to express both hemagglutinin (HA)-tag (near amino-terminus) and Myc-tag (carboxy-terminal) were detected by both HA-antiserum (top panel) and Myc-antiserum (bottom panel), thereby showing that the faster electrophoretic bands (ER-retained proteins) were not produced by endogenous proteases (Fig 4C). Therefore, Erv29p that was reported to be a soluble cargo receptor for several yeast proteins including Pry1p [22] and the acidic (pI = 4.8) mating hormone, pro-α-factor [23], likely uses IPV-like motif to bind cargo.
Depletion of Erv29 homolog, Surf4, in human cell line negates enhanced ER-Golgi trafficking for specific soluble cargo
Surf4, the human homolog of ERV29 [24], is considered a housekeeping gene in higher eukaryotes [25, 26]. Surf4 was expressed as a netlike structure including punctate colocalization with the ER exit site (ERES) marker, Sec23 (Fig 5A), and the ERGIC marker ERGIC-53, (Fig 5B). Even when overexpressed, recombinant Surf4 (HA-tagged) showed only low-to-modest residence in cis-Golgi (Fig 5C). However, as predicted [27], changing two of three lysines near the carboxy-terminus of Surf4 to alanines (proposed COPI vesicle-associated recycling motif) greatly increased its expression in cis-Golgi (Fig 5D, HA-Surf4-alanine-alanine-lysine [AAK]). There was little localization of HA-Surf4 in either the rough ER (rER) (Fig 5E) or ER quality control (QC) domain (Fig 5F), suggesting that newly synthesized Surf4 resides predominantly in the ERES and the microenvironment near actively forming COPII exit vesicles.
Using clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated 9 (Cas9) technology, the alleles of Surf4 were deleted in HEK293A cells (S2 Fig). Based on differences in sequence analyses, three independent colonies of Surf4KO cells were obtained. Loss of Surf4 in the fruit fly is lethal [28], but cells of all three Surf4KO clones were viable and had only slightly slower growth rates than their parent cells. A polyclonal antibody against the Surf4 carboxy-terminal cytosol domain detected the Mr approximately 26 kDa protein in wild-type HEK293A membrane extracts but not in Surf4KO extracts (Fig 6A). Surf4KO cells transfected with an expression plasmid encoding human HA-Surf4 rescued the protein’s expression.
Loss of Surf4 had no significant effect on the secretion of two model proteins lacking ER-ESCAPE motifs. Secretion of a commercially available, secreted form of alkaline phosphatase (SEAP, starting with native isoleucine-isoleucine-proline [IIP] tripeptide) or lactoperoxidase with a carboxy-terminal luciferase enzyme for analysis (LPO-Gluc; starting with glutamine-threonine-threonine [QTT]tripeptide) from both wild-type and Surf4KO cells suggests no dramatic changes in critical components of the basic secretory pathway (Fig 6B).
As noted above, wild-type HEK293A cells could make and secrete wild-type AMELX while EPL- AMELX was retained inside the cell (Fig 6C). As would be expected if Surf4 is the Φ-P-Φ motif-binding ER cargo receptor, accumulation of wild-type MPL-AMELX equaled that of mutant EPL-AMELX protein construct in the Surf4KO cells (Fig 6C). The efficient trafficking of native MPL-AMELX but not EPL-AMELX was rescued by coexpression of Surf4 in Surf4KO cells (Fig 6D). Although only about 30% identical to human Surf4 protein (S3 Fig), coexpression of yeast homolog, Erv29p, also specifically rescued trafficking of AMELX starting with native MPL tripeptide in Surf4KO cells (Fig 6D), thereby showing functional conservation in eukaryotic cells. Two lysine-to-alanine changes in the near-carboxy-terminus that caused Surf4 not to cycle back from Golgi to ER also did not rescue MPL-AMELX protein trafficking in Surf4KO cells (Fig 6D, Surf4-AAK lanes). Similar results were found for DSPP constructs with Surf4KO cells being unable to efficiently traffic either wild-type (IPV-DSPP) or mutant (IPD-DSPP; Fig 6E), while coexpression of the same proteins with either Surf4 or Erv29p rescued only DSPP starting with the intact IPV-motif (Fig 6F).
Fig 6G shows that wild-type IPV-DSPP accumulating in Surf4KO cells is in the Ca2+-stabilized aggregate proposed earlier [5]. Permeabilizing intact cells for 10 min with Ca2+-free digitonin released about half of the DSPP into the >100,000 x g supernatant, while including 10 mM Ca2+ stabilized the DSPP aggregate such that it remained predominantly in the pelleted fraction. Wild-type MPL-AMELX aggregates in the Surf4KO cells remained in the >100,000 x g pellet even in the presence of a membrane-solubilizing detergent (Fig 6G).
We next addressed whether Surf4 is a cargo receptor only for proteins starting with the Φ-P-Φ-like motif found in DSPP and AMELX. As the spectrum of human proteins starting with IPV-like tripeptides expanded from acidic, Ca2+-binding acidic proteins (e.g., SIBLINGs) to other matrix proteins known to self-aggregate in the extracellular environment (e.g., AMELX), as well as hormones and enzymes that aggregate temporarily in storage granules (e.g., GH), database searches were expanded to observe variation of IPV-related motifs occurring in proteins of other species. It was logical that some types of amino acids within tripeptides that did not keep DSPP or AMELX at sufficiently low concentrations within the ER to prevent aggregation may be successful for other proteins that form aggregates only at higher concentrations. For example, BSP is an acidic, calcium-binding SIBLING member with a well-conserved hydrophobic-serine-hydrophobic (Φ-S-Φ) in mammals, birds, and reptiles (Fig 1), even though a position number 2 serine for DSPP and AMELX resulted in loss of ER-to-Golgi trafficking. GH appeared to require a higher ER concentration to form aggregates in HEK293A cells and was successfully trafficked when started with tripeptide motifs other than Φ-P-Φ. ISV-GH and arginine-serine-valine (RSV)-GH both trafficked more efficiently in Surf4KO cells when coexpressed with Surf4 (Fig 6H), illustrating the diversity of this receptor for different positive-binding motifs. Because secretion of LPO-Gluc was shown above to be unaffected by loss of Surf4 (Fig 6B), its starting tripeptide, QTT, was changed to two effective motifs (IPV or RSV) to see if intracellular steady-state levels could be decreased by addition of Surf4-interacting motifs. Fig 6I shows that starting lactoperoxidase (LPO) with either IPV or RSV significantly reduced their steady-state levels in HEK293A cells but not in Surf4KO cells. Those results suggest that Surf4 is not only the cargo receptor for proteins starting with the Φ-P-Φ-like motif but also for proteins with differing compositions of the starting tripeptide motif. We propose use of the term ER-ESCAPE motif.
Systematic mapping of ER-ESCAPE motif composition using GH as model soluble cargo
Soluble chaperone proteins must remain in the ER to perform their functions, and any with exposed amino termini should have negligible binding affinities for Surf4/Erv29p. Our results from DSPP and AMELX experiments suggest that lack of proline in positions 2 and/or acidic amino acids in one or more of positions would decrease or even negate binding of proteins to Surf4/Erv29p. Chaperones glucose-related protein 78 (GRP78, or binding immunoglobulin protein [BiP]), glucose-related protein 94 (GRP94 [HSP90B1]), protein disulfide isomerase family A member 2 (PDIA2), protein disulfide isomerase family A member 4 (PDIA4), and calreticulin (CALR) across the eukaryotic evolutionary spectrum have at least one acidic amino acid within their starting tripeptide, and position 2 prolines are rare (Fig 7 and S2 Table). Similarly, vertebrate fibrillar collagens interact with TANGO1/cTAGE5-associated cargo receptor complexes to specifically direct them into alternative exit vesicles large enough to encapsulate them [19, 20]. Classic 60–70 nm COPII exit vesicles are too small for these large proteins. Therefore, any solvent-available starting tripeptides of large collagens should not interact with Surf4. Starting tripeptides of several large, fibrillar collagens of vertebrates (Fig 7, S2 Table) show at least one acidic amino acid or, curiously, one or two glutamines. The abundance of glutamines in position 1 and/or 2 (e.g., Type III collagen’s glutamine-glutamine-glutamine [QQQ], Fig 7) suggests that glutamines help establish motifs as Surf4 nonbinders.
Sixty starting tripeptides for GH were expressed in HEK293A and cell extracts analyzed by GH ELISA to determine steady-state levels. (Technical University of Denmark’s SignalP 4.1 predicted retained amino-terminal tripeptides.) Because not all trial permutations could be tested in a single experiment, candidate tripeptides’ results were normalized with respect to the representative well-trafficked ER-ESCAPE-motif IPV-GH included in each experiment (Fig 8A). This data confirmed earlier DSPP and AMELX results that a variety of Φ-P-Φ motifs established the lowest steady-state GH levels, as did positively charged arginines. When started with conserved BSP motif, Φ-S-Φ, FSM-GH had predicted modest steady-state level. This suggests that FSM, while not as good as a motif with a proline flanked by two hydrophobic amino acids, is successfully used by this Ca2+-binding, acidic protein to keep it from reaching its own problematic ER concentrations. While no harm would likely come to BSP acquiring a Φ-P-Φ motif, there appears to be evolutionary pressure to retain its modest-affinity motif, perhaps to keep from competing with more problematical cargo. Because GH can accumulate to higher levels in HEK293A before forming aggregates, this cargo model also highlighted that a single acidic amino acid within the tripeptide (e.g., IPD-GH, glutamic acid-proline-alanine [EPA]-GH) placed soluble cargo’s steady-state levels in mid to high ranges of ER-accumulation (i.e., low but positive affinity), whereas two or more acidic amino acids always resulted in highest steady-state levels (i.e., probably nonbinding).
The GH ELISA assay was also used to test the hypothesis that the large, uncharged/polar amino acid glutamine, often found in positions 1 and/or 2 of chaperones and other proteins reasonably expected to not bind Surf4/Erv29p (Fig 7), would accumulate to higher levels in cells. Indeed, GH starting with glutamine-proline-valine (QPV), glutamine-serine-valine (QSV), phenylalanine-glutamine-valine (FQV), or glutamine-glutamine-valine (QQV) were found at steady-state levels sufficiently high in cells to suggest weak or no binding to cargo receptor (Fig 8A). Chemically similar asparagine (R-group chain only one CH2 shorter than glutamine) did not substitute for glutamine in the database tripeptide sequences of chaperones or fibrillar collagens. Correspondingly, GH with asparagine in positions 1 (asparagine-proline-valine [NPV]-GH) or 2 (isoleucine-asparagine-valine [INV]-GH) resulted in low or modest GH steady-state levels, respectively, indicating stronger binding constants for Surf4 than similar tripeptides including glutamine.
As results of 60 tripeptide permutations were obtained, some guiding principles seemed to be reasonable: acidic amino acids and glutamine are bad for strong binding to Surf4; proline is good in position 2 but not 3; hydrophobic amino acids are good in positions 1 and 3 but not position 2, etc. (Fig 8B). However, levels of some tripeptide-GH proteins that such “rules” would seem to predict to be excellent binders did not always work as expected. For example, phenylalanine-proline-valine (FPV) is a Φ-P-Φ motif, but the FPV-GH construct had higher steady levels (modest affinity) than predicted (Fig 8A). Perhaps the binding pocket of Surf4 finds the combination of two of the largest hydrophobic amino acids flanking the proline to be slightly destabilizing. In another example, yeast aspartate protease (PrA) is a modestly acidic (pI 4.7) protein described as being dependent on Erv29p for ER-Golgi trafficking [24]. PrA starts with lysine-valine-histidine (KVH) tripeptide, a motif that, lacking a proline or serine in the number 2 position, did not fit the prediction model of a successful ER-ESCAPE motif. However, KVH-GH did have a positive, if modest, ability to enhance trafficking of GH out of the ER (Fig 8A). Therefore, the summary of our interpretation of the relative good/neutral/bad contribution of each of the 20 possible amino acids in positions 1, 2, and 3 in the amino-terminal tripeptide (Fig 8B) is generally useful for predicting strong or nonbinding motifs but is somewhat subjective for more modestly binding tripeptides.
When Surf4/Erv29p is in excess, each soluble cargo protein should come to an independent ER steady-state concentration based on its amino-terminal tripeptide’s receptor affinity. To test this hypothesis, GH starting with a well-trafficked motif, APV, was first separately expressed then coexpressed with either nontrafficking EET-GH or a modestly trafficking ITV-GH. To distinguish the two coexpressed GH proteins on western blots, one protein in each experiment was encoded with an N-linked oligosaccharide motif (APV-NTT, EET-NTT, ITV-NTT), causing it to electrophorese slower on western blots. To control for possible interactions between these carbohydrates and the lectin aspects of ERGIC-53/LMAN1 cargo receptor, each coexpression pairing was conducted twice, with expression of NTT motif alternating between the two proteins. The same amount of plasmid for each protein was used for transfection. Transfections involving a single construct included inert plasmid to make amounts of DNA equal. As expected from earlier GH ELISA results, western blots showed that a single expression of the strong ER-ESCAPE motif, APV-GH, established lowest steady-state concentration in cells, followed by modest levels of ITV-GH, while highest levels were observed with EET-GH whether these proteins contained an N-linked oligosaccharide or not (Fig 9A). Because GH steady-state levels +/− NTT were very similar when directly comparing results of each starting tripeptide (e.g., EET versus EET-NTT), GH trafficking by ERGIC-53/LMAN1 in HEK293A appears to be insignificant for GH. As noted earlier for SEAP and LPO-Gluc proteins (Fig 6B), bulk flow in HEK293A cells was robust, as significant secretion of GH into the media of all constructs including EET-GH was observed (Fig 9B). In contrast, bulk flow processes were unable to traffic similar amounts of DSPP or AMELX out of HEK293A cells because they have much a higher propensity to interact and were entrapped within aggregates shortly after expression commenced. Thus, monomeric DSPP and AMELX proteins apparently rarely escape being added to aggregates to diffuse to ERES domain for bulk flow ER trafficking. Cotransfection of expression plasmids differing only in starting tripeptides (plus NTT motif for one of each pair, as noted) was performed in HEK293A cells. The low steady-state levels of APV and APV-NTT GH proteins within the cell lysates remain unchanged when coexpressed with nonbinding EET-GH or the modestly binding ITV-GH proteins (Fig 9A). Because levels of ITV-GH when coexpressed with APV-GH protein also remained unchanged, Surf4 is in functional excess in HEK293A cells. The combination of robust bulk flow and expression of functional excess Surf4 even during overexpression of cargo by transfection may explain why HEK293A cells have long been used for recombinant protein production.
When HEK293A cells were treated for a total of 54 hr with Surf4-directed small interfering RNA (siRNA), Surf4 levels were significantly depleted compared to transfection with control siRNA (top panel, Fig 9C). Eighteen hr before harvesting, cells were transfected a second time with equal amounts of expression plasmids for APV-GH and the more modestly binding ITVNTT-GH. Under these conditions, levels of Surf4 were sufficiently low to become limiting and cause higher steady-state levels of both APV-GH and ITVNTT-GH (bottom panel, Fig 9C). However, quantification of new steady-state levels of GH in Surf4 knockdown cells resulted in the ITV:APV ratio being significantly increased (P < 0.001), showing that APV-GH was preferentially trafficked when Surf4 was not in excess (Fig 9D).
Binding of GH with strong, modest, and poor enhanced-trafficking ER-ESCAPE motifs to Surf4 microsomes
True binding-constant analyses require purified proteins and robust reporting systems that produce proportional signals only while the two agents are bound together [29]. We were unable to solubilize and purify Surf4 protein that retained its high-affinity state for APV-GH, an outcome not uncommon for multipass membrane proteins. To obtain an estimate of Surf4’s affinity for GH, microsomes were made from Surf4KO cells without and with Surf4 expression by plasmid transfection. As expected if orientation of the microsomes were predominantly in ER’s original configuration (lumen = inside, cytosol = outside), microsomes from Surf4-expressing cells failed to exhibit preferential binding of 400 nM APV-GH (Lane 1) over the same amount of cargo lacking a functional ER-ESCAPE motif, EET-GH (Lane 2) (Fig 10A). Digitonin has long been used to selectively permeabilize cholesterol-enriched plasma membranes. Like other mammalian cells, digitonin-treated HEK293A released cytosolic proteins but only trace levels of soluble lumenal proteins associated with rER and QC domains (e.g., cyclophilin-B, CALR, S4 Fig) because of the low abundance of cholesterol in membranes of these two ER regions. To our knowledge, the amount of cholesterol in purified ERES membranes has not been directly measured, although acute depletion of cholesterol has been reported to cause loss of ER-to-Golgi trafficking of membrane proteins [30]. We found at least a portion of microsomes associated with Surf4 are effectively permeabilized by digitonin such that GH interacted with the necessarily lumenal domain of its cargo receptor. Therefore, additional aliquots of the two microsome preparations were treated with digitonin and assayed for binding. These showed a >5-fold increase in binding of strong ER-ESCAPE motif APV-GH (Fig 10A, Lane 3), while there was no significant increase in binding of EET-GH (Lane 4) by permeabilization. The increased binding of APV-GH to microsomes was dependent on Surf4 because microsomes made from nontransfected Surf KO cells had no significant change in binding of either GH construct, with or without digitonin treatment (Lanes 5–8).
To overcome difficulties of working with microsome-based binding assays dependent on multiple centrifugation steps, microsomes were bound to magnetic beads precoated with antibodies to carboxy-terminus (final cytosolic domain) of Surf4. Western blot analysis of microsomes associated with washed beads showed the presence of both Surf4 and ERES/COPII protein, Sec23 (Fig 10B insert), thus supporting microscopy localization of Surf4 in ERES domains. Indeed, there was a dose-dependent increase in binding of 400 nM APV-GH to microsomes as the concentrations of digitonin increased, plateauing at about 30 μg/ml. (Thirty μg/ml concentration of digitonin is similar to that reported by Holden and Horton [29] for Cytosol or Buffer 1, commercially available as Cytosol Extraction Buffer [CEB]) (Fig 10B). CEB was used for all subsequent experiments.
We next tracked the levels of GH retained by digitonin-treated microsomes when equal aliquots of beads were incubated with increasing amounts of APV-GH. GH ELISA results show a sigmoidal increase in APV-GH with half-maximal binding, suggesting a binding constant in the 200–300 nM range. The same concentrations of GH starting with a diacidic tripeptide (EET-GH) showed little retention by permeabilized microsomes (similar to that seen after treatment of permeabilized Surf4KO–derived microsomes), verifying apparent lack of affinity by this motif (Fig 10C).
Then, three GH isoforms starting with tripeptides corresponding to modest ability to enhance ER trafficking in vivo (FSM-GH, ISV-GH, and ITV-GH)—as well as a second, poorly trafficked acidic tripeptide, EEE-GH—were compared to the ability of APV-GH and EET-GH to bind to Surf4-associated microsomes. EET-GH and EEE-GH failed to bind Surf4 in permeabilized ERES microsome/beads (Fig 10D). These results correspond well to higher steady-state levels observed in live cell experiments discussed above for GH starting with two or more acidic amino acids. Also, significantly less FSM-GH, ISV-GH, and ITV-GH was retained by the microsomes compared to APV-GH (Fig 10D), results similarly predicted by their steady-state intracellular levels seen earlier in live-cell experiments.
Discussion
The vesicular nature of ER-to-Golgi trafficking was established by electron microscopy in the 1960s and 1970s [31]. Beginning in 1979 with Novick and Schekman [32], a series of papers identified temperature-sensitive secretion mutants in yeast that opened the door for dissection of underlying biochemical processes. These studies culminated in reports of transport vesicle genesis (COPII) from ER microsomes by addition of purified proteins [33, 34]. The default ER-to-Golgi pathway for soluble cargo proteins is to passively diffuse into the lumen of forming ER exit vesicles and subsequent release into the lumenal fluid of the ERGIC/Golgi (bulk flow). Early pulse-chase experiments of secreted proteins [35] likely underestimated bulk flow rates because of delays caused by chaperone-assisted folding, disulfide bond formation, etc. within rER and QC compartments. Thor and colleagues [36] engineered disulfide-free, glycosylation-free, and rapid, chaperone-independent folding domain of Semliki virus capsid protein as a model protein for bulk flow analysis. This elegant approach resulted in the fastest rate of protein transport reported for mammalian cells. However, the authors were unaware that addition of the HA-tag peptide (YPYDVPDYA) immediately after their leader sequence added a very effective ER-ESCAPE motif. In HEK293A, amino-terminal YPY tripeptide motif effectively trafficked our two most problematical proteins, DSPP and AMELX.
When ER-Golgi bulk flow cannot keep a protein’s concentration below its aggregation potential, use of a cargo receptor becomes necessary. Such cargo receptors should have two basic properties: (1) The binding affinity of a receptor for its cargo must be lower than the concentration at which the protein forms aggregates within the ER. In other words, a cargo receptor cannot effectively bind/traffic a protein until the cargo protein’s local concentration is high enough to significantly interact (≥ Kd). (2) Receptors must remain in functional excess by either always having an excess of cargo receptors made by the cell and/or by establishing a priority system by which the most problematical proteins can preferentially leave the ER.
Approximately 70% of secreted proteins may acquire cotranslational asparagine-linked (N-linked) oligosaccharide modifications [37]. The discovery that ERGIC-53/LMAN1’s lumenal domain exhibited mannose-binding lectin properties [38] suggested a solution for ER trafficking of many proteins. Indeed, blood-clotting proteins Factor V and Factor VIII are incorrectly trafficked in patients with ERGIC-53/LMAN1 mutations [39]. However, ERGIC-53/LMAN1 was later shown to have a lumenal, soluble helper protein—multiple coagulation factor deficiency protein 2 (MCFD2)—for the clotting factors [40], suggesting that a protein-related motif can be required for this receptor [41]. Alpha1-antitrypsin’s use of ERGIC-53/LMAN1 was also shown to be protein-conformation dependent [42]. Pro-matrix metalloproteinase-9 (proMMP-9) was recently reported to use ERGIC-53/LMAN1 [43]. The presence of a predicted high-affinity ER-ESCAPE motif for proMMP-9 throughout vertebrate evolution (Fig 1) suggests this protease can use either ERGIC-53/LMAN1 or Surf4 for ER trafficking. Therefore, as appealing as use of N-linked oligosaccharides alone for trafficking may be, surprisingly few examples have come to light. Indeed, in our hands, a human GH isoform that is unable to interact with Surf4 was not efficiently trafficked even when a N-linked oligosaccharide was added to this normally nonglycosylated protein.
Direct evidence for receptor-enhanced ER trafficking of soluble cargo using conserved protein/peptide motifs is similarly rare. Belden and Barlowe [23] showed yeast vacuole carboxypeptidase Y (CPY, PRC1) and mating factor alpha-1 (mfα-1, MAT1) required Erv29p for enhanced ER trafficking. They also showed more pro-mfα-1 trafficked into exit vesicles when purified COPII proteins were added to wild-type ER microsomes, compared to microsomes lacking Erv29p, and that 35S-labeled pro-mfα-1 could be crosslinked to Erv29p in microsomes. Caldwell and colleagues [24] also reported Erv29p-enhanced trafficking of CPY as well as a second vacuole protease, PrA (PEP4). What is the cargo motif/domain that interacts with Erv29p? S. cerevisiae mfα-1 has three N-linked oligosaccharides spaced along its 64-amino-acid prepro-domain. Caplan and colleagues [44] showed that successive and combinatorial loss of these modifications resulted in progressive but incomplete loss of hormone secretion. Deletions within this domain of amino acids 23–37 or 29–63 (both left intact the ER-ESCAPE motif at amino acids 20–22) also caused significant but incomplete loss of hormone processing/secretion. Later, Otte and Barlowe [45] focused on a 25-amino-acid domain (residues 29–53) using a series of alanine substitutions to determine the efficiency of trafficking of 35S-labeled pro-mfα-1 in the microsome/COPII vesicle in vitro assay. The authors proposed that three spaced (large) hydrophobic amino acids (I39, L42, and V52) together constituted pro-mfα-1’s motif that binds to Erv29p. The authors noted that Erv29p-trafficked protein, pro-CPY, had similar hydrophobic amino acids at same spacing in two of three positions. However, for yeast’s Pry1p and Pry2p, only two of these six primary sequence positions are hydrophobic. Furthermore, in 2009, Rakestraw and colleagues [46] performed mutagenesis on the mfα-1 prepro domain before fusing it to a single-chain antibody sequence and selecting for increased secretion rates (3- to 10-fold over wild type). Four of their eight positive constructs had several mutations that included changes in I39, L42, or V52 (to alanine or a polar/hydrophilic amino acids), with their best construct including both L42S and V52A. More recently, Lin-Cereghino and colleagues [47] reported effects of a series of deletions and alanine substitutions within the mfα-1 prepro domain fused to horseradish peroxidase expressed in Pichia pastoris. Deletions of amino acids 30–43 increased the amount of peroxidase activity secreted into the media, although replacement of all 14 amino acids with alanines reduced total secreted activity below that of controls. Therefore, it remains unknown at this time how these three large, hydrophobic amino acids (I39, L42, and V52) in the pro domain that played a role in enhanced trafficking of yeast’s pro-mfα-1 can be extended to other cargo proteins.
Using results of our experiments, we predict that acidic (pI = 4.5) pro-mfα-1 from S. cerevisiae with its classic Φ-P-Φ tripeptide, APV, has high affinity for Surf4/Erv29p. A limited search for starting tripeptides of pro-mfα-1 in Saccharomycetales (S3 Table) shows 36 starting with Φ-P-Φ (24 APV, 8 alanine-proline-isoleucine [API], plus IPV, valine-proline-alanine [VPA], and alanine-proline-alanine [APA]), 4 alanine-proline-threonine [APT], 4 alanine-isoleucine-alanine [AIA], and 1 threonine-alanine-isoleucine [TAI]). One tripeptide our prediction model would suggest has modest to poor binding to ERV29p/Surf4 is pro-mfα-1 for Candida glabrata with a QPV motif. This yeast is widely thought to be asexual although recently reported to have an intact mating gene set as well as a pattern of genetic sequences, suggesting a limited sexual cycle [48]. Perhaps the amount of mating factor secreted by this species is low, and the species has lost its requirement for cargo receptor–assisted ER exit. P. kudriavzevii maintains an mfα-1 starting with an APV motif (S3 Table) but is otherwise lacking the pro-domain. Pro-mfα-1’s motif is similar to two other acidic (pI = 4.3), secreted yeast proteins Pry1p (APA or APV depending on species) and Pry2p (APV), both of which in our experiments trafficked well in wild-type yeast cells but remained predominantly in the ER of erv29Δ cells. Lacking a proline or arginine in the number 2 position, the starting tripeptides of yeast proteins CPY (isoleucine-serine-leucine [ISL] ≈ ISV-GH) and PrA (KVH-GH) had more modest binding in our GH assay than APV-GH, but both still had sufficient affinity for the receptor to explain the reported low ER steady-state levels in wild-type yeast when compared to erv29Δ cells [23, 24]. Invertase (starting with serine-methionine-threonine [SMT] and repeatedly reported not to use Erv29p) we predict to be a weak or nonbinding cargo protein for Erv29p. One indication that proline in amino-terminal number 2 position could be related to trafficking of at least Golgi/ER-resident proteins in higher eukaryotes was in 2009 when Tsukumo and colleagues [49] used a series of alanine substitutions within the first nine amino acids (after removal of signal peptide) of nucleobindin-1 (NUCB1). They showed the P28A mutant of this normally Golgi-resident protein had increased localization in the ER, decreased Golgi-related modifications, and decreased secretion. They also reported three other ER/Golgi-resident proteins (reticulocalbin-1, calumenin, and 45 kDa Ca2+-binding protein [Cab45]) were more highly localized to ER when number 2 prolines were changed to alanines.
In 2001, Caldwell and colleagues [24] noted that Surf4 was a human homolog of Erv29p sharing approximately 30% amino acid identity (S3 Fig). Surf4 is one of four genes closely spaced in the Surfeit cluster of higher eukaryotic species [26]. These genes have no known sequence relationships and were proposed to be housekeeping genes because of their expression in a number of differentiated mouse cell lines as well as the presence of unmethylated, CpG-rich islands in their 5′ domains [25]. Human Surf4 protein sequence was first described in 1995 by Reeves and Fried [27] as a 30 kDa membrane protein with up to seven predicted transmembrane domains and an ER-like cytolocalization consistent with a proposed dilysine ER-retrieval motif near the carboxy-terminal end of the protein. In 2008, Mitrovic and colleagues [50] showed HeLa cell Surf4 predominantly colocalized with ERGIC-53-associated structures with some costaining with early Golgi domains. The authors also verified that Surf4 was likely an ER-Golgi-ER cycling protein by replacing Surf4’s three near-carboxy-terminal lysines with serines, causing the protein to accumulate in the Golgi. We found similar immunocytological localization although with much more ERES expression and weblike structures around ERES for both wild-type and HA-tagged Surf4 in HEK293A cells. Our results extended these observations to note that Surf4 was generally not abundant in rER or QC domains. We support the COPI-retrieval motif hypothesis by showing mutation of two of three carboxy-terminal lysines to alanines caused both a significant increase cis-Golgi accumulation and failure to rescue efficient trafficking of specific cargo in Surf4KO cells. Mitrovic and colleagues also noted that siRNA knockdown of Surf4 to <10% of normal had no effect on the ER-associated degradation (ERAD) of the Z mutant (E366K) of alpha1-antitrypsin, leading them to argue against Surf4 functioning in higher eukaryotic species (as Erv29p was proposed by Caldwell and colleagues [24] to do for yeast) by trafficking misfolded proteins into the Golgi for subsequent degradation. While we have no comment on the ultimate role Surf4 may play in ERAD of misfolded proteins, it is noteworthy that for Caldwell and colleagues’ misfolded proteins, CPY* and PrA*, the proteins both apparently started with their functional ER-ESCAPE motifs discussed in the above paragraph, possibly retaining their ability to be trafficked out of yeast ER by Erv29p. The mutant α1-antitrypsin construct, however, apparently retained its nonbinding diacidic-starting tripeptide (glutamic acid–aspartate acid–proline [EDP]), rendering it unlikely to interact with Surf4 in either its native or misfolded forms. To our knowledge, no one has directly shown that the homolog of Erv29p, Surf4, is a functioning ER cargo receptor enabling more efficient trafficking of soluble proteins to the ERGIC/Golgi in higher eukaryotic cells. Indeed, Mitrovic and colleagues [50] noted that knockdown of Surf4 by siRNA in HeLa cells caused no significant changes in total 35S-methionine-labeled protein secretion.
After showing yeast Erv29p used the ER-ESCAPE motif to enhance trafficking of acidic proteins out of the ER, we next used CRISPR/Cas9 technology to delete the human homolog, SURF4, in HEK293A cells. The failed trafficking of our two most problematical matrix proteins, the Ca2+-binding DSPP and self-assembling AMELX, in Surf4KO cells was rescued by coexpression of Surf4 for wild-type cargo proteins but not when the cargo included a motif-damaging acidic amino acid (IPD-DSPP and EPL- AMELX). Because Erv29p rescued ER trafficking of cargo in the Surf4KO cells, it appears that many major functions of this receptor are conserved across all eukaryotes. The ER-ESCAPE motif for Surf4 is limited to the first three amino-terminal amino acids because introduction of bulky N-linked oligosaccharide to the fourth amino acid of GH did not hinder enhanced ER trafficking. This same cotranslational modification is naturally found on the fourth amino acid of yeast’s classic Erv29p cargo protein, pro-mfα-1 [51, 52], and may ensure the amino-terminus is readily available for receptor binding. Based on this work, use of a high-affinity ER-ESCAPE motif (IPV, RSV, etc.) directly after a strong leader sequence is highly recommended for production of secreted recombinant proteins in eukaryotic cell expression systems. When an amino-terminal antibody tag is required, an HA-tag directly after the leader sequence offers the benefit of a very strong ER-ESCAPE motif, YPY.
The Golgi KDEL receptor binds the carboxy-terminal tetrapeptide KDEL (sometimes, HDEL) of escaped ER-resident soluble proteins such as chaperones and protein-modifying enzymes, returning them to the ER [53]. Our estimated binding constant of Surf4 for a strong Φ-P-Φ ER-ESCAPE motif was 200–300 nM, an affinity similar to 78–200 nM reported for the KDEL receptor [53]. For the energetically valuable process of scavenging escaped chaperones, it makes sense such proteins would have a single or narrow range of high-affinity motifs for receptor binding. In contrast, a range of peptide motifs with different binding affinities is of benefit for prioritizing the ordered ER exit of problematic proteins.
Our current model consists of several points. First, each of the three amino-terminal amino acids contribute to affinity of a cargo protein for the binding pocket located in lumenal domain(s) of Surf4/Erv29p (Fig 11A). Second, each cargo can bind only when its local concentration is bigger than or equal to its binding constant (Fig 11B). Cargo with high affinity may be quickly removed from the ER lumenal fluid, while proteins with more modest affinity ER-ESCAPE motifs bind and release more frequently while being processed for entry into the COPII vesicle. Some cargo (e.g., collagens too large to fit into standard-sized exit vesicles) start with tripeptides with no affinity for Surf4 and are directed to specific exit vesicles lacking Surf4. Small soluble cargo proteins (e.g., chaperones) with no affinity for Surf4 exit the ER only by diffusion into the lumenal fluid of vesicles (bulk flow) (Fig 11).
The chaperone CALR is reported to be secreted to some extent into the extracellular environment [54, 55] perhaps due, in part, to its low- but positive-affinity ER trafficking by Surf4. However, a more interesting consequence of CALR having a low-affinity ER-ESCAPE motif (Fig 1) may be in controlling binding of Surf4 cargo in the rER and QC regions. CALR’s high concentration in the rER/QC domains may outcompete most soluble cargo proteins for newly synthesized Surf4 such that the receptor does not interfere with protein folding. However, low levels of Surf4 in the rER and QC domains may bind high-affinity motifs of the most problematic proteins (e.g., DSPP), even in the presence of high CALR levels, and keep cargo from aggregating within rER/QC lumen. Furthermore, disordered proteins like DSPP and OPN lack appropriate hydrophobic domains used in proposed “ratcheting” mechanisms whereby BiP/GRP78 binds to early hydrophobic folding domains of cargo as they are translocated into the rER and keeps the growing protein from diffusing back out of translocon pores and into the cytosol [56, 57]. In this model, Surf4 could substitute for BiP by binding to the high-affinity ER-ESCAPE motif on these disordered proteins early in translocation. Our Surf4 siRNA experiments support the concept that when Surf4/Erv29p is not locally in excess, cargo with higher-affinity motifs (i.e., those most likely to form damaging aggregates) are preferentially trafficked out of the ER (Fig 12).
Surf4/Erv29p are multipass proteins, with the number of transmembrane domains predicted/reported ranging from four to seven [24, 27, 58]. Surf4/Erv29p’s tripeptide-binding pocket may result from cooperative three-dimensional alignment of two or more lumenal domains. By analogy, binding of extracellular ligands causes some plasma membrane–associated G-protein-coupled seven-transmembrane receptors to undergo conformational changes, resulting in corresponding realignment of cytosolic domains and binding/activation of G-alpha proteins [59]. Thus, binding of cargo to Surf4/Erv29p could result in conformational changes in its transmembrane domains that reorient cytosolic domains and enable interaction with COPII proteins such as Sec24 (Fig 12A). Of course, the opposite is an equally attractive model whereby only after Surf4/Erv29p interacts with a COPII-associated protein such as Sec24 does formation of the high-affinity, ER-ESCAPE motif–binding pocket occur (Fig 12B). Only then could ER-ESCAPE motif–presenting cargo proteins bind to Surf4/Erv29p and preferentially join developing exit vesicles. Future studies defining binding and functional domains of Surf4/Erv29p will help us understand the order in which cargo binding and COPII vesicle formation occur, as well as how the receptor releases its cargo after leaving the ERES.
In summary, a cell’s ER-to-ERGIC/Golgi bulk flow process can be insufficient for keeping specific abundantly secreted proteins’ ER concentration sufficiently low to prevent premature or inappropriate aggregate formation. We have shown that functionally conserved ERES transmembrane protein Surf4 enhances trafficking efficiency of several problematic cargo proteins by binding to their amino-terminal tripeptides. The combinatorial chemical composition of the amino-terminal three amino acids results in different Surf4-binding affinities and therefore the corresponding steady-state levels of each cargo within the ER lumen. The lower the protein’s concentration that can result in aggregate formation, the higher affinity the starting tripeptide evolutionally acquired and maintained. Generally, Φ-P-Φ tripeptides (with arginine substitutions in any position) result in high-affinity binding and associated low steady-state ER levels. In contrast, soluble proteins that should not enter standard COPII vesicles (chaperones and large fibrillar collagens) generally have amino-terminal tripeptides with no Surf4 affinity by including glutamine or acidic amino acids in the motif. Whenever local Surf4/Erv29p levels become limiting, the differing binding affinities automatically prioritize exit of the high-affinity cargo over proteins with lower-affinity motifs. At this time, interactions of the 8,000 possible permutations of three starting amino acids with Surf4 are too complex to be unfailingly predicted by our proposed ER-ESCAPE-motif formula (particularly for modest-/low-affinity motifs). However, using a combination of the current formula, as well as verification of the tripeptide’s conservation among the homologous proteins of many species, can be a useful indication of the strength of protein–Surf4 (Erv29p) interactions. Future high-throughput binding experiments of all 8,000 tripeptides in conjunction with protein database bioinformatics research of all known ER-translocated soluble proteins (with solution-available amino-termini) will result in more comprehensive prediction models.
Materials and methods
Cell culture and transfection
HEK293A (Thermo Fisher Scientific) and Surf4KO HEK293A cells (see following subsection, Generation of Surf4KO HEK293A cells) were maintained in Dulbecco's Modified Eagle Medium (DMEM, Thermo Fisher Scientific) supplemented with 10% fetal bovine serum (Sigma), GlutaMAX (2 mM, Thermo Fisher Scientific), penicillin 100 (IU/mL), and streptomycin (100μg/mL). Expression constructs were transfected into cells using Lipofectamine 2000 reagent (Thermo Fisher Scientific). Briefly, 5×105 cells/well were plated into 6-well plates. After overnight culture (60%–80% confluent), cells were transfected with 2μg of plasmid DNA preincubated with 4μL Lipofectamine 2000 in Opti-MEM (Thermo Fisher Scientific). Transient expression of proteins was analyzed 16–24 hr posttransfection as noted.
Generation of Surf4KO HEK293A cells
Surf4KO HEK293A cells were generated under contract by Applied StemCell using CRISPR/Cas9 technology by targeting two sites in exon-3 of human Surf4, with two independent gRNA (S2A Fig). The genomic deletion was confirmed in three independent single-cell clones—clone 5.3, clone 5.5, and clone 5.19—by PCR (S2B Fig) and sequencing of TOPO clones (S2C Fig). After preliminary confirmation of similar results by all three clones, clone 5.3 was selected for subsequent experiments. KO of Surf4 protein was confirmed by immunoblot using affinity-purified rabbit antibody to Surf4 (S4CT, against carboxy-terminal peptide: CALGPGGVSMDEKKKEW).
Expression constructs
Plasmid vectors for wild-type and mutant DSPP (including 6xFLAG-tag, when noted) were previously described [5]. Full-length human AMELX (variant-3) was amplified by high-fidelity PCR from cDNA (OriGene Technologies) and subcloned into Gateway pENTR/D-TOPO (Thermo Fisher Scientific). AMELX, variant-1, the most abundant variant in human enamel, was generated by deleting exon-4 of variant-3 and used in all experiments. Both Myc-tag at carboxy-terminus and first tripeptide mutations EPL, ISV, and FSM were introduced in AMELX by Q5 Site-Directed Mutagenesis (Q5 Site-Directed Mutagenesis Kit; NEB) using NEB’s web-based oligonucleotide design program. Full-length human growth hormone-1 (hGH) was amplified by high-fidelity PCR of cDNA (TransOMIC Technologies) and subcloned into Gateway pENTR/D-TOPO. First tripeptide mutations and fourth position NTT N-linked oligosaccharide motif mutations were introduced by Q5 Site-Directed Mutagenesis. Full-length human Surf4 (isoform-1) was obtained by high-fidelity PCR of cDNA (TransOMIC Technologies) and subcloned into Gateway pENTR/D-TOPO. HA-tag was introduced to amino-terminus of Surf4 immediately after start methionine by Q5 Site-Directed Mutagenesis. Full-length PRY1, PRY2, and ERV29 genes in S. cerevisiae were amplified by high-fidelity PCR of genomic DNA and cloned into Gateway pENTR/D-TOPO. Amino-terminal HA-tag was introduced to plasmid for ERV29 after Ser37 by Q5 Site-Directed Mutagenesis. Amino-terminal 2×HA-tag and carboxy terminal Myc-tag were introduced by Q5 Site-Directed Mutagenesis to plasmids for PRY1 and PRY2. Then, 2×HA-tag was inserted after Thr27 for Pry1p and Thr26 for Pry2p. APD-Pry1 and APD-Pry2 were introduced by Q5 Site-Directed Mutagenesis to 2XHA-tagged plasmids for PRY1 and PRY2. All pENTR clones were shuttled into Gateway pT-Rex-DEST 30 eukaryotic expression vector (DSPP, AMELX, GH, Surf4, and ERV29) or Gateway pYES-Dest52 S. cerevisiae expression vector (PRY1, PRY2, DSPP) (Thermo Fisher Scientific), or pAG425GPD-ccdB (a gift from Susan Lindquist, Addgene plasmid # 14154) for ERV29. Plasmids were confirmed in every case by Sanger DNA sequencing.
Yeast media and strains
The S. cerevisiae strains are listed in S4 Table. Transformation was performed using S.c. EasyComp kit (Thermo Fisher Scientific). Isolated colonies were selected and grown in 2% (w/v) raffinose and synthetic drop-out medium (without uracil; without both uracil and leucine; or without histidine, leucine, tryptophan, and uracil) depending on plasmid-selection needed (S5 Table) for approximately 48 hr until OD600 ≈ 2–3. Cultures were diluted to OD600 = 0.4 and induced in same drop-out medium containing 2% (w/v) galactose and 1% (w/v) raffinose for 5 hr before harvesting. Cells were lysed with Y-PER Yeast Protein Extraction Reagent (Thermo Fisher Scientific). Total protein concentration was determined by Pierce BCA protein assay (Thermo Fisher Scientific). Conditioned media were concentrated by Amicon Ultra-4 Centrifugal Filter Units, 10 kDa cutoff (Millipore Sigma). All yeast cultures were grown at 30°C with shaking at 200 rpm.
Immunoblotting
Rabbit antiserum against DSP portion of mouse DSPP (LF-153) was as described [60]. Other primary antibodies used for western analysis include mouse anti-Flag M2 (1:1,000, Sigma, Cat #: F1804), mouse anti-Myc (1:1,000, Cell Signaling Tech. Cat #: 2276), mouse anti-β-Actin (1:1,000, Sigma, Cat #: A1978), rabbit anti-HA (1:1,000, Cell Signaling Tech. Cat #: 3724), goat anti-hGH (1:500, Santa Cruz Biotech. Cat #: sc-10365), goat anti-SEC23 (1:200, Abcam, Cat #: AB99522), mouse anti-CALR (FMC75) (1:1,000, Calbiochem, Cat #: 208912), rabbit anti-cyclophilin B (1:1,000, Abcam, Cat #: ab16045) and custom-prepared and affinity-purified rabbit anti-S4CT (0.5 μg/ml). Secondary antibodies include IRDye 680 donkey anti-rabbit IgG, IRDye 800 donkey anti-mouse IgG, and IRDye 680 donkey anti-goat IgG (1:20,000, LI-COR Biosciences).
Whole-cell lysates
Cells were harvested and lysed in NP-40 lysis buffer (1% Nonidet P-40 [Calbiochem], 50 mM Tris-HCl [pH 8.0], 100 mM NaCl, 2 mM EDTA, and cOmplete EDTA-free protease inhibitor cocktail [Roche Holding AG]). DM lysis buffer (0.5% n-Dodecyl-α-D-Maltopyranoside [Anatrace], 50 mM Tris-HCl [pH 8.0], 100 mM NaCl, 2 mM EDTA and cOmplete EDTA-free protease inhibitor cocktail) was used to solubilize Surf4. Total protein concentration was determined by Pierce BCA protein kit (Thermo Fisher Scientific). Media were concentrated by Amicon Ultra-4 Centrifugal Filter Units, 10 kDa cutoff. Lysates or concentrated conditioned media were added to NuPAGE LDS sample buffer (Thermo Fisher Scientific) and incubated at 70°C for 10 min before electrophoresis. Surf4 preparations were treated at RT overnight in NuPAGE LDS sample buffer containing 4 M urea. Equal amounts of total protein of cell lysates or conditioned media volumes were electrophoresed on NuPAGE 4%–12% Bis-Tris PAGE gels in MOPS Buffer (Thermo Fisher Scientific) and transferred onto Immunobilon-FL membranes (Millpore). PageRuler Prestained NIR Protein Ladder is used as MW marker (Thermo Fisher Scientific). Membranes were blocked in Odyssey Blocking Buffer (LI-COR Biosciences), followed by incubation with indicated antibodies overnight in PBS with 1% Tween 20 (PBS-T). After 4× 5 min washes in PBS-T, blots were incubated for 1 hr at RT with noted species-specific IRDye-conjugated secondary antibody in PBS-T plus 0.02% SDS. After 3 final washes, blots were quantified by densitometry using LI-COR Odyssey infrared imaging system.
RT-PCR analysis of XBP-1 mRNA splicing
Total RNA was extracted from HEK293A cells 52 hr after transfection or 5 hr of tunicamycin (2 μg/mL) treatment using an RNeasy Kit (Qiagen). Aliquots of 1 μg of total RNA were reverse-transcribed with SuperScript III First-Strand Synthesis System, PCR-amplified with Platinum Taq DNA polymerase (both from Thermo Fisher Scientific) using primers corresponding to nucleotides 374–395, (5′-GCCTTGTAGTTGAGAACCAGGA-3′) and 623–642 (5′-TGACTGGGTCCAAGTTGTCC-3′) of human XBP-1, resolved on a 6% TBE gel.
Deglycosylation and protein analysis
Indicated cell lysates and cultured media containing noted N-linked glycosylated GH proteins were treated with Endoglycosidase H or PNGase F (New England BioLabs). Briefly, 9 μl of sample containing either 3 μg of lysate protein or 6% of concentrated media was denatured with 1 μL 10×Glycoprotein Denaturing buffer at 95°C for 10 min. Then, 10×GlycoBuffer-3 plus 250 units of Endo H, or 10× GlycoBuffer-2 + NP-40 and 250 units of PNGase F, was added and incubated at 37°C for 2 hr.
Immunocytochemistry
Primary antibodies used include mouse anti-HA (1:200, Abcam, Cat #: AB18181), rabbit anti-HA (1:200, Cell Signaling Tech. Cat #: 3724), goat anti-SEC23 (1:50, Abcam, Cat #: AB99522), rabbit anti-Giantin (1:1,000, BioLegend, Cat #: 924302), mouse anti-calnexin (1:200, Thermo Fisher Scientific, Cat #: MA3-027), mouse anti-ERGIC-53 (1:100, Enzo, Cat #: ENZ-ABS300-0100), goat anti-SEC61α (1:100, Santa Cruz Biotech. Cat #: sc-12322), or rabbit anti-S4CT (2.5 μg/ml) antibodies. Secondary antibodies (1:500, Thermo Fisher Scientific) used in these experiments include Alexa Fluor 568-conjugated goat anti-mouse, Alexa Fluor 488-conjugated goat anti-rabbit, Alexa Fluor 568-conjugated donkey anti-goat, Alexa Fluor 488-conjugated donkey anti-rabbit, Alexa Fluor 568-conjugated donkey anti-goat, Alexa Fluor 488-conjugated donkey anti-mouse, and Alexa Fluor 488-conjugated with Alexa Fluor 568-conjugated goat anti-rabbit.
Eight-well Nunc Lab-Tek II-CC2 Chamber Slide was pretreated for 1 hr with 500 μl of 15 μg/ml fibronectin (Sigma) per well before brief washing with PBS. Wild-type or Surf4KO HEK293A cells were plated at 40,000 cells/well. After 15 hr, cells in Opti-MEM were transfected with 0.2μg of plasmid DNA prebound to 0.4μL Lipofectamine-2000. Eighteen hr posttransfection, cells were fixed for 10 min using 4% paraformaldehyde/PBS and then permeabilized with 0.2% saponin (Sigma) in LI-COR Odyssey Blocking Buffer-PBS for 2 hr at RT. Primary antibody was diluted in blocking buffer containing 0.2% saponin. Cells were incubated in primary antibody overnight at 4°C, washed 3× in PBS, and incubated for 45 min at RT with indicated secondary antibodies in PBS. Cells were washed 3x in PBS and mounted with VECTASHIELD antifade mounting medium with DAPI (Vector Laboratories). Images were obtained using a Zeiss LSM 780 Confocal microscope (63×/1.40 oil objective, Carl Zeiss) or Zeiss Axio Imager Z1 with Apotome 2 (single Z stack slice, 63×/1.40 oil or 100×/1.40 oil objective, Carl Zeiss). Images were analyzed using Zeiss’ Zen software.
SEAP detection by Quanti-Blue assay
HEK293A and Surf4KO cells were transfected as describe above with pSELECT -zeo-SEAP (InvivoGen). Twenty-two hr posttransfection, 5μL conditioned media was assayed for SEAP activity using QUANTI-Blue (InvivoGen).
Luciferase-tagged LPO secretion detection by BioLux Gaussia Luciferase Assay
pGAUS3 plasmid for Gaussia luciferase-tagged LPO (VL5) was a gift from Peter Burbelo, NIDCR/NIH. First tripeptide mutations RSV and IPV were introduced in LPO by Q5 Site-Directed Mutagenesis (Q5 Site-Directed Mutagenesis Kit; NEB). HEK293A and Surf4KO cells were separately transfected with wild-type LPO (QTT) and two first tripeptide mutations (RSV-LPO and IPV-LPO). Cells and conditioned media were harvested 22 hr posttransfection. Washed cells were lysed in NP-40 lysis buffer containing cOmplete EDTA-free protease inhibitor cocktail. Total protein concentration was determined by Pierce BCA protein assay kit. Luciferase activity was determined using 5 μl of conditioned media or 5 μl cell lysate by BioLux Gaussia Luciferase Assay kit (NEB) following assay protocol II using Berthold Technologies CentroXS3 LB960 Luminometer. The Luciferase activities from cell lysates were normalized to Luciferase units/mg protein.
DSPP and AMELX intracellular aggregation detection
HEK293A Surf4KO cells at about 70% confluency in 6-well cell plates were transfected with IPV-DSPP expression plasmid (as above) and harvested 24 hr later. Cell pellet was treated with CEB from Subcellular Protein Fractionation Kit (Thermo Fisher Scientific) in +/− 10 mM CaCl2 on ice for 10 min and centrifuged at >100,000 x g for 10 min (A100/18 rotor in Beckman Airfuge). Ten percent of concentrated supernatant and 10% of pellet were analyzed by western blot. HEK293A Surf4KO cells were transfected with MPL-AMELX expression plasmid (as above) and harvested 24 hr later. Cell pellet was treated with MEB from Subcellular Protein Fractionation Kit on ice for 10 min and centrifuged at >100,000 x g for 10 min (A100/18 rotor in Beckman Airfuge). Ten percent of concentrated supernatant and 10% of pellet were analyzed by western blot.
hGH ELISA for transfected HEK293A cell lysates
HEK293A cells were separately transfected with 60 expression plasmids encoding noted changes in post-leader sequence tripeptide of hGH. Each experiment included both positive (IPV-GH) and negative (EET-GH) trafficking controls. Cells were harvested 18 hr posttransfection with 0.5% trypsin (3 min); the reaction was stopped by washing with PBS, followed by single-cell suspension in 10% fetal bovine serum in DMEM. Cells were briefly washed in PBS, pelleted, and lysed in NP-40 lysis buffer containing cOmplete EDTA-free protease inhibitor cocktail. Total protein concentration was determined by Pierce BCA protein assay. Concentration of GH in cell lysates was determined in triplicate in each experiment by hGH ELISA (IBL America, Cat #: IB19101). GH concentrations from all cell lysates were calculated by standard curve (GraphPad Prism 7, Graphpad Software) and normalized to standard total protein concentration (ng GH/mg protein). The amount of GH present in 59 non-IPV variants of starting tripeptides was normalized to concentrations of control levels of IPV-GH associated with that transfection series. Data represent means ± SEM of at least 3 transfection experiments for each GH variant. (Means of triplicate hGH ELISA analyses were used for each extract).
siRNA transfection
Silencer Select predesigned siRNA for human Surf4 and Silencer Select Negative Control No. 1 siRNA. Surf4 was knocked down using 5 nM of both 5′-AGAAUGAUGCAGCAUUAAAtt-3′ as sense and 5′-UUUAAUACUGCAUCAUUCUtt-3′ as antisense oligo with Lipofectamine RNAiMAX transfection reagent (all from Thermo Fisher Scientific). ITVNTT-GH and APV-GH were cotransfected 36 hr later into same HEK293A cells. Cells were harvested 18 hr later and processed for Surf4 and GH western blot following procedures noted above.
Microsome preparation and GH-binding assay
Tripeptide variant GH proteins (APV, ITV, ISV, FSM, EEE, and EET) were prepared from conditioned media collected 24 hr after transfection with corresponding GH plasmids into HEK293A cells. Concentrations were measured by hGH ELISA and adjusted to noted concentrations by dilution into KHMC buffer.
Crude microsome preparation
Surf4KO cells, either untreated or transfected for 24 hr with 12 μg of HA-Surf4 expression plasmid preincubated with 24 μl Lipofectamine 2000 in Opti-MEM, were released from 100 mm culture dishes, suspended in 6 ml of ice-cold hypotonic buffer (5 mM EDTA, 20 mM HEPES, pH 8.0, cOmplete EDTA-free protease inhibitor cocktail), and homogenized with a 7 mL Dounce glass homogenizer using a tight pestle (Wheaton Industries) until >90% of cells were ruptured. Homogenates were cleared of large debris by centrifugation at 6,000 ×g (10 min, supernatant) and then 60 min at 100,000 ×g (pellets). The microsomes were suspended in KHMC buffer by repeated pipetting. All above steps were conducted on ice or at 4°C. Half of the microsome preparation was pelleted and treated with CEB from Subcellular Protein Fractionation Kit for 30 min on ice before resuspension to starting volume with KHMC buffer. Microsomes, plus or minus CEB treatment, were equally aliquoted into separate experimental tubes and centrifuged at >100,000 x g for 10 min (A100/18 rotor in Beckman Airfuge). Pellets were suspended in 175 μl of 400 nM APV-GH or EET-GH media in KHMC buffer for 1 hr on ice. After >100,000 xg centrifugation (10 min), pellets were suspended for 10 min in 1 mg/ml APVNTTGGGC peptide to release bound GH. After final 10 min >100,000 xg centrifugation, amount of GH in supernatant was assayed using hGH ELISA kit.
ERES-associated microsome preparation using Surf4 antibody-coated magnetic beads
HEK293A cells were transfected with HA-Surf4 expression plasmid as above. Homogenates prepared by methods described above were centrifuged at 6,000 xg for 10 min to clear cell debris and supernatants collected for following studies. Surf4-enriched microsomes were prepared using 10 μg affinity-purified rabbit anti-Surf4-CT prebound to each 50 μl aliquot of magnetic Protein G-Dynabeads (Thermo Fisher Scientific). Briefly, microsome-containing supernatants (containing protease inhibitors) were gently mixed by rotation with washed anti-Surf4-Dynabeads for 1 hr at RT. Dynabeads were recovered using a magnetic particle separator and washed 3 times in KHMC buffer. Where noted, aliquoted microsome/beads were treated with CEB or 10–40 μg/ml digitonin (SIGMA, Cat # D141) in KHMC buffer for 30 min at RT. Final microsome/beads were quickly washed 3 times with KHMC buffer and recovered using a magnetic particle separator. For estimation of APV-GH binding constant, equal aliquots (in triplicate) of CEB-treated microsome-Dynabeads were incubated/tumbled with 0 nM, 100 nM, 200 nM, 300 nM, 400 nM, or 800 nM of APV-GH or EET-GH for 1 hr at RT. For relative binding levels of APV-GH, ITV-GH, ISV-GH, FSM-GH, EEE-GH, or EET-GH to microsome/beads, 400 nm of each GH variant was added to equal aliquots (in triplicate) of CEB-permeabilized microsome-Dynabeads in KHMC buffer and incubated for 1 hr with RT tumbling. (Note that one triplicate experiment involved incubation of 400 nM APV-GH with an equal aliquot of microsome/beads not treated with CEB.) Microsome/beads were then briefly washed twice with KHMC buffer and GH eluted by APVNTTGGGC peptide (1 mg/ml) for quantification by ELISA assay.
Statistical analysis
Statistical analyses were performed using GraphPad Prism 7 (Graphpad Software). One-way ANOVA analysis was carried out at IBM SPSS software. Results are expressed as mean±SEM.
The numerical data used in all figures are included in S1 Data.
Supporting information
Acknowledgments
DNA sequencing was by NIDCR Combined Technical Research Core (ZIC-DE000729-09).
Abbreviations
- Φ-P-Φ
hydrophobic-proline-hydrophobic
- Φ-S-Φ
hydrophobic-serine-hydrophobic
- AAK
alanine-alanine-lysine
- AMELX
amelogenin, X-linke
- AMBN
ameloblastin
- AMTN
amelotin
- APA
alanine-proline-alanine
- APD
alanine-proline-aspartic acid
- API
alanine-proline-isoleucine
- APT
alanine-proline-threonine
- APV
alanine-proline-valine
- AIA
alanine-isoleucine-alanine
- BiP
binding immunoglobulin protein
- BMP1
bone morphogenetic protein 1
- BSP
bone sialoprotein
- Cab45
45 kDa Ca2+-binding protein
- CALR
calreticulin
- Cas9
CRISPR-associated 9
- CEB
Cytosol Extraction Buffer
- COL1A1
collagen type 1 alpha 1
- COL2A1
collagen type 2 alpha 1
- COL3A1
collagen type 3 alpha 1
- COL6A1
collagen type 6 alpha 1
- COL6A2
collagen type 6 alpha 2
- COL6A3
collagen type 6 alpha 3
- COL7A1
collagen type 7 alpha 1
- COPII
coat protein complex II
- CPY
carboxypeptidase Y
- CRISPR
clustered regularly interspaced short palindromic repea
- DD
dentin dysplasia
- DGI
Dentinogenesis Imperfecta
- DKR
aspartic acid-lysine-arginine
- DMEM
Dulbecco's Modified Eagle Medium
- DMP1
dentin matrix acidic phosphoprotein 1
- DPP
dentin phosphoprotein
- DSP
dentin sialoprotein
- DSPP
dentin sialophosphoprotein
- DYP
aspartic acid-tyrosine-proline
- EDP
glutamic acid–aspartic acid–proline
- EEE
glutamic acid–glutamic acid–glutamic acid
- EEI
glutamic acid–glutamic acid–isoleucine
- EET
glutamic acid-glutamic acid-threonine
- EGT
glutamic acid-glycine-threonine
- Emp24p
24 kDa endomembrane protein
- EMP46p
46 kDa endomembrane protein
- Emp47p
47 kDa endomembrane protein
- ENAM
enamelin
- EPA
glutamic acid-proline-alanine
- EPL
glutamic acid-proline-leucine
- EPT
glutamic acid-proline-threonine
- EPV
glutamic acid-proline-valine
- ER
endoplasmic reticulum
- ERAD
ER-associated degradation
- ERES
ER exit site
- ER-ESCAPE motif
ER-Exit by Soluble Cargo using Amino-terminal Peptide-Encoding motif
- ERGIC
ER-Golgi intermediate compartment
- ERO1
ER oxidoreductase 1
- Erp1p
erythrocyte protein 1
- Erp2p
erythrocyte protein 2
- Erv14p
ER-derived vesicles protein 14
- Erv15p
ER-derived vesicles protein 15
- Erv25p
ER-derived vesicles protein 25
- Erv26p
ER-derived vesicles protein 26
- Erv29p
ER-derived vesicles protein 29
- Erv46p
ER-derived vesicles protein 46
- EST
glutamic acid-serine-threonine
- F-GRP78
fungal GRP78
- FGT
phenylalanine-glycine-threonine
- FLT
phenylalanine-leucine-threonine
- FPE
phenylalanine-proline-glutamic acid
- FPM
phenylalanine-proline-methionine
- FPR
phenylalanine-proline-arginine
- FPT
phenylalanine-proline-threonine
- FPV
phenylalanine-proline-valine
- FQV
phenylalanine-glutamine-valine
- FSA
phenylalanine-serine-alanine
- FSM
phenylalanine-serine-methionine
- FST
phenylalanine-threonine-valine
- FTV
phenylalanine-threonine-valine
- FVN
phenylalanine-valine-asparagine
- GH
growth hormone
- GPI
glycophosphatidylinositol
- GPV
glycine-proline-valine
- GRP78
glucose-related protein 78
- GRP94
glucose-related protein 94
- HA
hemagglutinin
- HEK293A
human embryonic kidney cell line 293A
- hGH
human growth hormone-1
- HSV
histidine-serine-valine
- IEV
isoleucine-glycine-valine
- IGV
isoleucine-glycine-valine
- IIP
isoleucine-isoleucine-proline
- ILV
isoleucine-leucine-valine
- INV
isoleucine-asparagine-valine
- IPA
isoleucine-proline-alanine
- IPD
isoleucine-proline-aspartic acid
- IPE
isoleucine-proline-glutamic acid
- IPP
isoleucine-proline-proline
- IPR
isoleucine-proline-arginine
- IPS
isoleucine-proline-serine
- IPV
isoleucine-proline-valine
- IRE1
inositol-requiring enzyme 1
- IRV
isoleucine-arginine-valine
- ISH
isoleucine-serine-histidine
- ISL
isoleucine-serine-leucine
- ISP
isoleucine-serine-proline
- ISQ
isoleucine-serine-glutamine
- ISR
isoleucine-serine-arginine
- ISV
isoleucine-serine-valine
- ITV
isoleucine-threonine-valine
- KAV
lysine-alanine-valine
- Kex2p
kexin 2
- KGV
lysine-glycine-valine
- KO
knockout
- KSV
lysine-serine-valine
- KVH
lysine-valine-histidine
- LMAN1
lectin mannose-binding 1
- LPO
lactoperoxidase
- LPO-Gluc
LPO with carboxy-terminal luciferase
- MCFD2
multiple coagulation factor deficiency protein 2
- mDSP
mouse DSP
- mfα-1
mating factor alpha-1
- MPL
methionine-proline-leucine
- NCBI
National Center for Biotechnology Information
- NPV
asparagine-proline-valine
- NSP
asparagine-serine-proline
- NTT
asparagine-threonine-threonine
- NUCB1
nucleobindin-1
- OPN
osteopontin
- PBS-T
PBS with 1% Tween 20
- PDIA2
protein disulfide isomerase family A member 2
- PDIA4
protein disulfide isomerase family A member 4
- PrA
yeast aspartate protease
- proMMP-9
pro-matrix metalloproteinase-9
- PRY
pathogen-related yeast
- PTH
parathyroid hormone
- QC
quality control
- QEE
glutamine-glutamic acid-glutamic acid
- QPV
glutamine-proline-valine
- QQQ
glutamine-glutamine-glutamine
- QQV
glutamine-glutamine-valine
- QSV
glutamine-serine-valine
- QTT
glutamine-threonine-threonine
- rER
rough ER
- Rer1p
retention of ER proteins 1
- RGV
arginine-glycine-valine
- RLV
arginine-leucine-valine
- RPA
arginine-proline-alanine
- RPG
arginine-proline-glycine
- RPK
arginine-proline-lysine
- RPL
arginine-proline-leucine
- RPV
arginine-proline-valine
- RRR
arginine-arginine-arginine
- RSV
arginine-serine-valine
- SEAP
secreted alkaline phosphatase
- SIBLING
small integrin-binding ligand, N-linked glycoprotein
- siRNA
small interfering RNA
- SLT
serine-leucine-threonine
- SMT
serine-methionine-threonine
- SPT
serine-proline-threonine
- SPV
serine-proline-valine
- SRT
serine-arginine-threonine
- SST
serine-serine-threonine
- Surf4
surfeit locus protein 4
- TANGO1
transport and Golgi organization 1
- TAI
threonine-alanine-isoleucine
- TPV
threonine-proline-valine
- VPA
valine-proline-alanine
- XBP-1
X-box-binding protein 1
- YPD
tyrosine-proline-aspartic acid
- YPY
tyrosine-proline-tyrosine
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
NIH/NIDCR (grant number 1ZIADE000074-45). This research was supported by the Intramural Research Program of the NIH, NIDCR. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Uhlen M, Fagerberg L, Hallstrom BM, Lindskog C, Oksvold P, Mardinoglu A, et al. Proteomics. Tissue-based map of the human proteome. Science. 2015;347(6220):1260419 Epub 2015/01/24. 10.1126/science.1260419 . [DOI] [PubMed] [Google Scholar]
- 2.Vitale A, Denecke J. The endoplasmic reticulum-gateway of the secretory pathway. Plant Cell. 1999;11(4):615–28. Epub 1999/04/24. ; PubMed Central PMCID: PMCPMC144197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hurtley SM, Helenius A. Protein oligomerization in the endoplasmic reticulum. Annu Rev Cell Biol. 1989;5:277–307. Epub 1989/01/01. 10.1146/annurev.cb.05.110189.001425 . [DOI] [PubMed] [Google Scholar]
- 4.Barlowe C, Helenius A. Cargo Capture and Bulk Flow in the Early Secretory Pathway. Annu Rev Cell Dev Biol. 2016;32:197–222. 10.1146/annurev-cellbio-111315-125016 . [DOI] [PubMed] [Google Scholar]
- 5.von Marschall Z, Mok S, Phillips MD, McKnight DA, Fisher LW. Rough endoplasmic reticulum trafficking errors by different classes of mutant dentin sialophosphoprotein (DSPP) cause dominant negative effects in both dentinogenesis imperfecta and dentin dysplasia by entrapping normal DSPP. J Bone Miner Res. 2012;27(6):1309–21. 10.1002/jbmr.1573 ; PubMed Central PMCID: PMC3390460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat Methods. 2011;8(10):785–6. 10.1038/nmeth.1701 . [DOI] [PubMed] [Google Scholar]
- 7.Kall L, Krogh A, Sonnhammer EL. A combined transmembrane topology and signal peptide prediction method. J Mol Biol. 2004;338(5):1027–36. 10.1016/j.jmb.2004.03.016 . [DOI] [PubMed] [Google Scholar]
- 8.von Marschall Z, Fisher LW. Dentin sialophosphoprotein (DSPP) is cleaved into its two natural dentin matrix products by three isoforms of bone morphogenetic protein-1 (BMP1). Matrix Biol. 2010;29(4):295–303. 10.1016/j.matbio.2010.01.002 ; PubMed Central PMCID: PMC2862847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.McKnight DA, Fisher LW. Molecular evolution of dentin phosphoprotein among toothed and toothless animals. BMC Evol Biol. 2009;9:299 10.1186/1471-2148-9-299 ; PubMed Central PMCID: PMC2803795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Fisher LW, Torchia DA, Fohr B, Young MF, Fedarko NS. Flexible structures of SIBLING proteins, bone sialoprotein, and osteopontin. Biochem Biophys Res Commun. 2001;280(2):460–5. 10.1006/bbrc.2000.4146 . [DOI] [PubMed] [Google Scholar]
- 11.Nam AS, Yin Y, von Marschall Z, Fisher LW. Efficient trafficking of acidic proteins out of the endoplasmic reticulum involves a conserved amino terminal IleProVal (IPV)-like tripeptide motif. Connect Tissue Res. 2014;55 Suppl 1:138–41. 10.3109/03008207.2014.923852 ; PubMed Central PMCID: PMC4153534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Termine JD, Belcourt AB, Christner PJ, Conn KM, Nylen MU. Properties of dissociatively extracted fetal tooth matrix proteins. I. Principal molecular species in developing bovine enamel. J Biol Chem. 1980;255(20):9760–8. . [PubMed] [Google Scholar]
- 13.Margolis HC, Beniash E, Fowler CE. Role of macromolecular assembly of enamel matrix proteins in enamel formation. J Dent Res. 2006;85(9):775–93. 10.1177/154405910608500902 . [DOI] [PubMed] [Google Scholar]
- 14.Bromley KM, Kiss AS, Lokappa SB, Lakshminarayanan R, Fan D, Ndao M, et al. Dissecting amelogenin protein nanospheres: characterization of metastable oligomers. J Biol Chem. 2011;286(40):34643–53. 10.1074/jbc.M111.250928 ; PubMed Central PMCID: PMC3186391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhang X, Ramirez BE, Liao X, Diekwisch TG. Amelogenin supramolecular assembly in nanospheres defined by a complex helix-coil-PPII helix 3D-structure. PLoS One. 2011;6(10):e24952 Epub 2011/10/11. 10.1371/journal.pone.0024952 ; PubMed Central PMCID: PMCPMC3184955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Clackson T, Ultsch MH, Wells JA, de Vos AM. Structural and functional analysis of the 1:1 growth hormone:receptor complex reveals the molecular basis for receptor affinity. J Mol Biol. 1998;277(5):1111–28. 10.1006/jmbi.1998.1669 . [DOI] [PubMed] [Google Scholar]
- 17.Hauri HP, Kappeler F, Andersson H, Appenzeller C. ERGIC-53 and traffic in the secretory pathway. J Cell Sci. 2000;113 (Pt 4):587–96. . [DOI] [PubMed] [Google Scholar]
- 18.Muniz M, Nuoffer C, Hauri HP, Riezman H. The Emp24 complex recruits a specific cargo molecule into endoplasmic reticulum-derived vesicles. J Cell Biol. 2000;148(5):925–30. ; PubMed Central PMCID: PMC2174538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Saito K, Chen M, Bard F, Chen S, Zhou H, Woodley D, et al. TANGO1 facilitates cargo loading at endoplasmic reticulum exit sites. Cell. 2009;136(5):891–902. 10.1016/j.cell.2008.12.025 . [DOI] [PubMed] [Google Scholar]
- 20.Saito K, Yamashiro K, Ichikawa Y, Erlmann P, Kontani K, Malhotra V, et al. cTAGE5 mediates collagen secretion through interaction with TANGO1 at endoplasmic reticulum exit sites. Mol Biol Cell. 2011;22(13):2301–8. 10.1091/mbc.E11-02-0143 ; PubMed Central PMCID: PMC3128532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Choudhary V, Schneiter R. Pathogen-Related Yeast (PRY) proteins and members of the CAP superfamily are secreted sterol-binding proteins. Proc Natl Acad Sci U S A. 2012;109(42):16882–7. 10.1073/pnas.1209086109 ; PubMed Central PMCID: PMC3479496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Herzig Y, Sharpe HJ, Elbaz Y, Munro S, Schuldiner M. A systematic approach to pair secretory cargo receptors with their cargo suggests a mechanism for cargo selection by Erv14. PLoS Biol. 2012;10(5):e1001329 10.1371/journal.pbio.1001329 ; PubMed Central PMCID: PMC3358343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Belden WJ, Barlowe C. Role of Erv29p in collecting soluble secretory proteins into ER-derived transport vesicles. Science. 2001;294(5546):1528–31. 10.1126/science.1065224 . [DOI] [PubMed] [Google Scholar]
- 24.Caldwell SR, Hill KJ, Cooper AA. Degradation of endoplasmic reticulum (ER) quality control substrates requires transport between the ER and Golgi. J Biol Chem. 2001;276(26):23296–303. 10.1074/jbc.M102962200 . [DOI] [PubMed] [Google Scholar]
- 25.Williams T, Yon J, Huxley C, Fried M. The mouse surfeit locus contains a very tight cluster of four "housekeeping" genes that is conserved through evolution. Proc Natl Acad Sci U S A. 1988;85(10):3527–30. ; PubMed Central PMCID: PMC280245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Duhig T, Ruhrberg C, Mor O, Fried M. The human Surfeit locus. Genomics. 1998;52(1):72–8. 10.1006/geno.1998.5372 . [DOI] [PubMed] [Google Scholar]
- 27.Reeves JE, Fried M. The surf-4 gene encodes a novel 30 kDa integral membrane protein. Mol Membr Biol. 1995;12(2):201–8. . [DOI] [PubMed] [Google Scholar]
- 28.Buszczak M, Paterno S, Lighthouse D, Bachman J, Planck J, Owen S, et al. The carnegie protein trap library: a versatile tool for Drosophila developmental studies. Genetics. 2007;175(3):1505–31. Epub 2006/12/30. 10.1534/genetics.106.065961 ; PubMed Central PMCID: PMCPMC1840051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Holden P, Horton WA. Crude subcellular fractionation of cultured mammalian cell lines. BMC Res Notes. 2009;2:243 10.1186/1756-0500-2-243 ; PubMed Central PMCID: PMC2802353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ridsdale A, Denis M, Gougeon PY, Ngsee JK, Presley JF, Zha X. Cholesterol is required for efficient endoplasmic reticulum-to-Golgi transport of secretory membrane proteins. Mol Biol Cell. 2006;17(4):1593–605. Epub 2006/02/03. 10.1091/mbc.E05-02-0100 ; PubMed Central PMCID: PMCPMC1415298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Palade G. Intracellular aspects of the process of protein synthesis. Science. 1975;189(4200):347–58. . [DOI] [PubMed] [Google Scholar]
- 32.Novick P, Schekman R. Secretion and cell-surface growth are blocked in a temperature-sensitive mutant of Saccharomyces cerevisiae. Proc Natl Acad Sci U S A. 1979;76(4):1858–62. ; PubMed Central PMCID: PMC383491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Barlowe C, Orci L, Yeung T, Hosobuchi M, Hamamoto S, Salama N, et al. COPII: a membrane coat formed by Sec proteins that drive vesicle budding from the endoplasmic reticulum. Cell. 1994;77(6):895–907. . [DOI] [PubMed] [Google Scholar]
- 34.Otte S, Belden WJ, Heidtman M, Liu J, Jensen ON, Barlowe C. Erv41p and Erv46p: new components of COPII vesicles involved in transport between the ER and Golgi complex. J Cell Biol. 2001;152(3):503–18. ; PubMed Central PMCID: PMC2195992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lodish HF, Kong N, Snider M, Strous GJ. Hepatoma secretory proteins migrate from rough endoplasmic reticulum to Golgi at characteristic rates. Nature. 1983;304(5921):80–3. . [DOI] [PubMed] [Google Scholar]
- 36.Thor F, Gautschi M, Geiger R, Helenius A. Bulk flow revisited: transport of a soluble protein in the secretory pathway. Traffic. 2009;10(12):1819–30. 10.1111/j.1600-0854.2009.00989.x . [DOI] [PubMed] [Google Scholar]
- 37.Apweiler R, Hermjakob H, Sharon N. On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. Biochim Biophys Acta. 1999;1473(1):4–8. . [DOI] [PubMed] [Google Scholar]
- 38.Arar C, Carpentier V, Le Caer JP, Monsigny M, Legrand A, Roche AC. ERGIC-53, a membrane protein of the endoplasmic reticulum-Golgi intermediate compartment, is identical to MR60, an intracellular mannose-specific lectin of myelomonocytic cells. J Biol Chem. 1995;270(8):3551–3. . [DOI] [PubMed] [Google Scholar]
- 39.Nichols WC, Seligsohn U, Zivelin A, Terry VH, Hertel CE, Wheatley MA, et al. Mutations in the ER-Golgi intermediate compartment protein ERGIC-53 cause combined deficiency of coagulation factors V and VIII. Cell. 1998;93(1):61–70. . [DOI] [PubMed] [Google Scholar]
- 40.Zhang B, Cunningham MA, Nichols WC, Bernat JA, Seligsohn U, Pipe SW, et al. Bleeding due to disruption of a cargo-specific ER-to-Golgi transport complex. Nat Genet. 2003;34(2):220–5. 10.1038/ng1153 . [DOI] [PubMed] [Google Scholar]
- 41.Appenzeller-Herzog C, Nyfeler B, Burkhard P, Santamaria I, Lopez-Otin C, Hauri HP. Carbohydrate- and conformation-dependent cargo capture for ER-exit. Mol Biol Cell. 2005;16(3):1258–67. 10.1091/mbc.E04-08-0708 ; PubMed Central PMCID: PMC551490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Nyfeler B, Reiterer V, Wendeler MW, Stefan E, Zhang B, Michnick SW, et al. Identification of ERGIC-53 as an intracellular transport receptor of alpha1-antitrypsin. J Cell Biol. 2008;180(4):705–12. 10.1083/jcb.200709100 ; PubMed Central PMCID: PMC2265576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Duellman T, Burnett J, Shin A, Yang J. LMAN1 (ERGIC-53) is a potential carrier protein for matrix metalloproteinase-9 glycoprotein secretion. Biochem Biophys Res Commun. 2015;464(3):685–91. 10.1016/j.bbrc.2015.06.164 ; PubMed Central PMCID: PMC4697752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Caplan S, Green R, Rocco J, Kurjan J. Glycosylation and structure of the yeast MF alpha 1 alpha-factor precursor is important for efficient transport through the secretory pathway. J Bacteriol. 1991;173(2):627–35. ; PubMed Central PMCID: PMC207053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Otte S, Barlowe C. Sorting signals can direct receptor-mediated export of soluble proteins into COPII vesicles. Nat Cell Biol. 2004;6(12):1189–94. 10.1038/ncb1195 . [DOI] [PubMed] [Google Scholar]
- 46.Rakestraw JA, Sazinsky SL, Piatesi A, Antipov E, Wittrup KD. Directed evolution of a secretory leader for the improved expression of heterologous proteins and full-length antibodies in Saccharomyces cerevisiae. Biotechnol Bioeng. 2009;103(6):1192–201. 10.1002/bit.22338 ; PubMed Central PMCID: PMC2847895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Lin-Cereghino GP, Stark CM, Kim D, Chang J, Shaheen N, Poerwanto H, et al. The effect of alpha-mating factor secretion signal mutations on recombinant protein expression in Pichia pastoris. Gene. 2013;519(2):311–7. 10.1016/j.gene.2013.01.062 ; PubMed Central PMCID: PMC3628533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Carrete L, Ksiezopolska E, Pegueroles C, Gomez-Molero E, Saus E, Iraola-Guzman S, et al. Patterns of Genomic Variation in the Opportunistic Pathogen Candida glabrata Suggest the Existence of Mating and a Secondary Association with Humans. Curr Biol. 2018;28(1):15–27 e7. 10.1016/j.cub.2017.11.027 ; PubMed Central PMCID: PMC5772174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Tsukumo Y, Tsukahara S, Saito S, Tsuruo T, Tomida A. A novel endoplasmic reticulum export signal: proline at the +2-position from the signal peptide cleavage site. J Biol Chem. 2009;284(40):27500–10. 10.1074/jbc.M109.021592 ; PubMed Central PMCID: PMC2785679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Mitrovic S, Ben-Tekaya H, Koegler E, Gruenberg J, Hauri HP. The cargo receptors Surf4, endoplasmic reticulum-Golgi intermediate compartment (ERGIC)-53, and p25 are required to maintain the architecture of ERGIC and Golgi. Mol Biol Cell. 2008;19(5):1976–90. 10.1091/mbc.E07-10-0989 ; PubMed Central PMCID: PMC2366877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kurjan J, Herskowitz I. Structure of a yeast pheromone gene (MF alpha): a putative alpha-factor precursor contains four tandem copies of mature alpha-factor. Cell. 1982;30(3):933–43. . [DOI] [PubMed] [Google Scholar]
- 52.Julius D, Schekman R, Thorner J. Glycosylation and processing of prepro-alpha-factor through the yeast secretory pathway. Cell. 1984;36(2):309–18. . [DOI] [PubMed] [Google Scholar]
- 53.Wilson DW, Lewis MJ, Pelham HR. pH-dependent binding of KDEL to its receptor in vitro. J Biol Chem. 1993;268(10):7465–8. . [PubMed] [Google Scholar]
- 54.Somogyi E, Petersson U, Hultenby K, Wendel M. Calreticulin—an endoplasmic reticulum protein with calcium-binding activity is also found in the extracellular matrix. Matrix Biol. 2003;22(2):179–91. . [DOI] [PubMed] [Google Scholar]
- 55.De Almeida I, Oliveira NM, Randall RA, Hill CS, McCoy JM, Stern CD. Calreticulin is a secreted BMP antagonist, expressed in Hensen's node during neural induction. Dev Biol. 2017;421(2):161–70. 10.1016/j.ydbio.2016.12.001 ; PubMed Central PMCID: PMC5231319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Matlack KE, Misselwitz B, Plath K, Rapoport TA. BiP acts as a molecular ratchet during posttranslational transport of prepro-alpha factor across the ER membrane. Cell. 1999;97(5):553–64. . [DOI] [PubMed] [Google Scholar]
- 57.Miesbauer M, Pfeiffer NV, Rambold AS, Muller V, Kiachopoulos S, Winklhofer KF, et al. alpha-Helical domains promote translocation of intrinsically disordered polypeptides into the endoplasmic reticulum. J Biol Chem. 2009;284(36):24384–93. 10.1074/jbc.M109.023135 ; PubMed Central PMCID: PMC2782031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Foley DA, Sharpe HJ, Otte S. Membrane topology of the endoplasmic reticulum to Golgi transport factor Erv29p. Mol Membr Biol. 2007;24(4):259–68. 10.1080/09687860601178518 . [DOI] [PubMed] [Google Scholar]
- 59.Millar RP, Newton CL. The year in G protein-coupled receptor research. Mol Endocrinol. 2010;24(1):261–74. Epub 2009/12/19. 10.1210/me.2009-0473 ; PubMed Central PMCID: PMCPMC5428143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ogbureke KU, Fisher LW. Expression of SIBLINGs and their partner MMPs in salivary glands. J Dent Res. 2004;83(9):664–70. 10.1177/154405910408300902 . [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.