ABSTRACT
In recent years, there has been increased appreciation that a whole category of proteins, small proteins of around 50 amino acids or fewer in length, has been missed by annotation as well as by genetic and biochemical assays. With the increased recognition that small proteins are stable within cells and have regulatory functions, there has been intensified study of these proteins. As a result, important questions about small proteins in bacteria and archaea are coming to the fore. Here, we give an overview of these questions, the initial answers, and the approaches needed to address these questions more fully. More detailed discussions of how small proteins can be identified by ribosome profiling and mass spectrometry approaches are provided by two accompanying reviews (N. Vazquez-Laslop, C. M. Sharma, A. S. Mankin, and A. R. Buskirk, J Bacteriol 204:e00294-21, 2022, https://doi.org/10.1128/JB.00294-21; C. H. Ahrens, J. T. Wade, M. M. Champion, and J. D. Langer, J Bacteriol 204:e00353-21, 2022, https://doi.org/10.1128/JB.00353-21). We are excited by the prospects of new insights and possible therapeutic approaches coming from this emerging field.
KEYWORDS: proteome, transmembrane proteins, protein evolution, small proteins
INTRODUCTION
For the purposes of this discussion, we consider “small proteins” to be proteins of 50 amino acids (aa) or fewer in length for which there is evidence that the small protein accumulates to sufficient levels for it to have a function. It should be noted that this definition is not strict, and we mention some proteins that are somewhat longer, as do other articles in this special issue. In general, the main defining and unifying characteristics of these proteins are their small size and the fact that they commonly have been missed. There are several interrelated reasons why small proteins have remained hidden. First, while automated genome annotation pipelines are adept at detecting long open reading frames (ORFs) in genome sequences, the false discovery rates balloon as the allowable length of an ORF drops (reviewed in reference 1). Consequently, many pipelines for protein annotation in bacteria ignore proteins smaller than 50 aa (reviewed in reference 2). Second, proteomic workflows similarly are optimized for large proteins such that the majority of small proteins are not retained, resolved, or detected unless specialized conditions are used. Similarly, most mass spectrometric methods rely on the ability to obtain multiple tryptic peptides from a single protein, which is not possible for small proteins (reviewed in reference 3). A compounding problem is that many of the search engines for peptide sequences are based on genome annotations, in which, as mentioned above, small proteins are underrepresented. The latter issue also impacts the identification of small protein genes by mutational screens.
Although their small sizes limited their identification and characterization for a long time, the increasing discovery of small proteins that are associated with mutant phenotypes or interact with larger proteins establishes a precedent for function. As such, overlooking an entire class of proteins means that investigators may be missing a whole level of regulation, critical structural components, as well as unique mechanisms of action. Until now, small proteins generally have been studied from perspectives and methods focused on large proteins. The recognition that there is a group of proteins missed by multiple approaches is an important step in discerning the diverse functions of these small proteins.
Peptide chains of 50 aa or fewer can be encoded by a defined short ORF (sORF) positioned at a variety of genetic locations (Fig. 1). They can be entirely separate sORFs, not overlapping other genes but encoded as part of operons or as independent transcripts (Fig. 1A). A subset of operonic sORFs are regulatory upstream ORFs (uORFs) (also denoted leader peptides) (Fig. 1B), whose translation impacts the expression of the downstream gene (reviewed in reference 4). sORFs also can be encoded antisense to a longer ORF or partially or fully overlapping a longer ORF on the sense strand (Fig. 1C). sORFs that are completely overlapped by a longer ORF, in the same or an alternate frame, are sometimes denoted embedded (or nested). Additionally, sORFs can be encoded on a small regulatory RNA, either overlapping or adjacent to base-pairing sequences (Fig. 1D). These protein-coding regulatory RNAs are often designated dual-function RNAs (reviewed in reference 5). Finally, small proteins can be cleaved from larger polypeptides (Fig. 1E). We refer to these as small peptides to distinguish this group from the independently translated small proteins. Here, we primarily focus on small proteins encoded as separate sORFs given that more of these small proteins have been characterized.
FIG 1.

Locations of small protein genes. (A to C) Small proteins can be encoded by independent sORFs (A), uORFs that act as regulators of the downstream gene (B), or sORFs encoded antisense to a larger ORF or embedded or nested within a larger ORF in the same or an alternative frame (C). (D) They also can be encoded on a regulatory small RNA that has a dual function as a base-pairing RNA and an mRNA encoding a small protein. (E) Functional small peptides can be cleaved from larger proteins.
While a 50-aa limit is useful as a working definition, slightly larger proteins that are unannotated thus far can be discovered and characterized with approaches developed for small proteins. It is not yet clear if there will be a lower size limit. Is there a size that should be considered too small? The surface area available for stable interaction with other molecules diminishes with size. Until now, the smallest characterized proteins, such as the antibacterial microcin C7 protein encoded as a separate sORF (6) and microbial signaling peptides processed from larger precursors (reviewed in reference 7), are 5 to 10 aa.
HOW CAN SMALL PROTEINS BE IDENTIFIED?
Investing in the development of approaches and standards for the identification of small proteins will facilitate small protein research and help to unify the field. Thus far, most identification has come from approaches used to identify larger proteins, although more and more improvements that allow better detection of small proteins are being developed.
While computational annotation for long ORFs can rely almost exclusively on the detection of long continuous coding potential, additional parameters are needed to mitigate the bioinformatic prediction problem of the inflated false discovery of sORFs. Thus, many computational searches for small proteins weight multiple parameters, which can include the presence of a ribosome binding site and amino acid conservation (8), although the latter parameter is progressively less effective the shorter the sORF. More and more computational searches also incorporate information from extensive RNA sequencing or mass spectrometric data sets currently available (9–11). One caution in using these data sets is that the transcription and even the translation of an sORF do not guarantee that a stable small protein with function is generated. Computational approaches are less effective at identifying candidate sORFs that are conserved in only a few bacteria or that overlap longer ORFs. Nevertheless, as the computational approaches continue to be refined, small protein genes can be predicted for any sequenced genome, with the caveat that the numbers of false-negative and false-positive results generally are unknown.
Ribosome binding to an RNA suggests that the transcript is being translated. Thus, ribosome profiling (also called Ribo-seq or ribosome footprinting), which allows ribosome binding to be interrogated on a genome-wide basis upon the isolation of translating ribosomes, is increasingly being used to identify small protein genes. As described in an accompanying review (12), modifications of the initial ribosome profiling protocols, such as treatment with antibiotics that enrich for initiation and termination complexes, have greatly improved this approach for identifying small proteins, although investigators who use the resulting data still need to be aware of limitations. An advantage of ribosome profiling is that the approach can identify actively translated ORFs in intergenic regions as well as provide evidence for the translation of antisense or embedded sORFs.
Improved methods for the direct detection of the translated polypeptide are also crucial, particularly because these may be the only approaches for identifying small peptides cleaved from larger translation products. Proteomic detection by mass spectrometry is the best method thus far for identifying both small proteins and small peptides, but again, the small sizes pose barriers to identification. The challenges related to small protein identification by mass spectrometry as well as recent developments in this field are discussed in another accompanying review (13). We anticipate that innovative developments in direct detection will have a significant impact on small protein research. While both ribosome profiling and proteomic detection can provide insights into differential expression for samples collected under various growth conditions, each approach has advantages and disadvantages. Ribosome profiling is considerably more sensitive than the proteomic approaches, while the proteomic approaches document that a small protein actually accumulates to a meaningful level.
It should be noted that the number of small proteins identified by ribosome profiling and direct detection will vary with the growth conditions, particular strain examined, or experimental setup, leading to potential differences among reports of the proteins. Related to this variation is the important question of how much validation is required to be confident that a small protein is synthesized and accumulates. Until now, the gold standard has been the independent detection of chromosomally encoded epitope-tagged small proteins. However, this gold standard may not be achievable for all legitimate bona fide small proteins predicted, as protein tagging is difficult in some organisms. Phenotypes associated with reduced or elevated levels of a small protein, if carried out with appropriate stop codon controls, can provide additional evidence for the existence of a small protein.
We advocate that in the absence of the detection of a chromosomally encoded tagged small protein, at least two lines of evidence be required for the annotation of a small protein gene. Thus, a small protein may be independently identified by conservation and ribosome profiling or ribosome profiling and proteomic approaches. We note that there is nuance in evaluating the quality of different types of evidence, however, and that appropriate quantitation, reproducibility, and control groups should be expected for all data sets used. Different discovery methods have the added value that they can provide distinct but often complementary information. For example, conservation is an indication of function but does not provide information about expression. On the other hand, mass spectrometric analysis shows that a small protein is stable enough to be detected but not whether it has function.
A corollary to the limitations in identifying translated sORFs and small proteins is ambiguity and variability with respect to information in databases. Currently, there are both under- and overannotation of small protein genes. Once new small proteins are stringently vetted for their synthesis as described above, the genes that encode them need to be added to genome annotations. There is value to also capturing information about small proteins for which corroborating evidence is not available as long as the level of confidence is clearly denoted. Websites collating evidence for small proteins are being developed (14), along with expanding databases of ribosome profiling and mass spectrometric data. We encourage the establishment of reliable, rigorous data sets that the small protein research community can continue to build on as this nascent field develops.
The approaches described for the discovery of small proteins and their genes will also lead to the identification of more uORFs. In fact, uORFs and sORFs cannot be distinguished in ribosome profiling data, other than through information about the location of the ORF (15). The identification of uORFs is an added benefit of ribosome profiling studies, and the encoded products of regulatory uORFs may have independent functions as small proteins (16).
The number of small proteins encoded by a genome is not known for any organism. At least 150 small proteins have been documented as chromosomally encoded tagged proteins for the well-studied organism Escherichia coli, but that is not the complete count, and only a fraction of these have been characterized sufficiently to know about function (reviewed in reference 2). Even less is known about what small proteins are synthesized or their functions in other organisms.
HOW CAN SMALL PROTEIN FUNCTIONS BE ELUCIDATED?
Along with the problems associated with the discovery of sORFs and small proteins, studies of the biological roles of small proteins come with several challenges. The first major issue is whether the product of a translated sORF has a function in the cell. It is possible that ribosomes are associated with an ORF as a result of noisy translation initiation or incidental translation. Alternatively, as mentioned above, the translation of the ORF itself may be regulatory in cis, while the product of translation has no function in trans.
The conservation of an sORF and also surrounding genes is one clue that the translated small protein might have a function, and information about conserved nearby genes can help predict that function. For example, the genes encoding toxic small proteins belonging to toxin-antitoxin (TA) systems generally are genetically linked to the genes encoding their antitoxin counterparts and thus can be identified through gene synteny (reviewed in reference 17). The value of searches for conserved sequences is limited, however, when the putative small protein is produced from an sORF embedded in a larger gene (18). Here, the analyses become muddled by the features of the larger gene product. Dual-function RNAs harboring a small protein-encoding sORF as well as a base-pairing sequence element, which can overlap the sORF, similarly are difficult to predict using bioinformatics (5). By definition, conservation and gene synteny analyses also cannot be used to detect functional strain-specific small proteins.
Other clues that translated small proteins have function come from in vivo and in vitro experiments. For instance, differential expression of the small protein and the mRNAs that encode them, such as induction in response to a specific stress, is a hint that the protein might have a function under that condition. This differential expression can be detected through the generation and examination of transcriptomic or proteomic data sets. Existing data sets can also be interrogated to detect these differentially expressed small proteins. However, as many small protein ORFs are currently lacking in standard genome annotations, these analyses necessitate remapping of previous data sets. Given that large transcriptome repositories such as the Gene Expression Omnibus (GEO) (19) currently hold thousands of published data sets, it is likely that mining existing data could provide new information on the expression of putative small protein genes. Such expression analyses can aid in the identification of environmental conditions under which the small protein of interest is produced and thus is more likely to carry out its biological function.
Phenotypes associated with either the overexpression of a small protein or, better yet, the deletion of the corresponding gene suggest biological functions and can provide important hints about these functions. Collections of overexpression plasmids or genome-wide mutational libraries such as transposon insertion sequencing (Tn-Seq) libraries (20) can be screened for a variety of growth phenotypes under various conditions. The effects of over- or underexpression can also be monitored by global transcriptome and proteome analyses. Again, it is worth noting some caveats. These approaches typically rely on existing annotation, and thus, unannotated small protein genes might evade analysis. Additionally, due to their size, mutation of small protein genes in Tn-Seq experiments requires a much higher transposon density than for conventionally sized genes in order to obtain the number of insertions necessary to achieve statistical confidence that the altered recovery of mutations in an sORF is meaningful. These approaches again are more challenging for genes where there is overlapping coding potential. Nonetheless, reevaluating existing data sets focusing on sORFs could promote a better understanding of small protein functions in various microbial species.
Once a candidate small protein has been identified, it is crucial to confirm that the protein accumulates in the cell before carrying out more detailed functional studies. To this end, various technologies are employed. Most frequently, the addition of an epitope tag to the N- or C-terminal end of the small protein at the chromosomal locus followed by Western blot analysis is used to validate small protein accumulation and further examine regulated expression (15). This approach comes with the advantage that the required genetic modifications are typically quick and straightforward. However, it is possible that the added tag interferes with or alters the expression and/or the function of the protein. While this is true for all proteins, it is particularly a concern for small proteins where even a small epitope tag is disproportionately large relative to the small protein and may obstruct interactions or significantly change the overall charge or hydrophobicity. As a consequence, it is important to ultimately test whether the expression of the tagged protein phenocopies the expression of the untagged small protein in complementation assays. These assays can be simplified by constructing reporters to genes whose expression is contingent on the function of the small protein. It may be necessary to test a variety of small protein derivatives with different types of tags and linker sequences. Alternatively, the small protein may need to be tagged through the introduction of a nonnatural amino acid. Antibodies specific to the small protein of interest would also be useful but are often difficult to obtain due to the limited number of epitopes present in a small protein (reviewed in reference 21).
The next steps in elucidating the biological role of a small protein usually involve determining its subcellular localization and identifying cellular interaction partners. Similar to the transcriptomic data sets mentioned above, small proteins typically are missing from large-scale protein localization and interaction studies, such as those carried out for E. coli (22, 23). Hence, independent analysis is usually required. Coimmunoprecipitation experiments using functional tagged small protein variants or specific antibodies have proven most successful for identifying interacting proteins that can lead to strong hypotheses regarding the cellular function of a small protein. Given the examples of small proteins that act as DNA or RNA binding factors (2, 24–26), copurification with nucleic acids should also be considered. In addition, it is conceivable that small proteins could act by binding to other cellular components such as lipids or small metabolites; however, no dedicated searches for such interactions have been reported. The unique advantage that small proteins can be chemically synthesized in their entirety can also be exploited in biochemical assays. Thus far, different combinations of experiments have led to the elucidation of individual small protein functions and are setting the stage for structural and further mutational studies for a mechanistic understanding of small protein mechanisms of action.
WHAT SMALL PROTEIN FUNCTIONS ARE KNOWN?
Several bacterial small proteins have now been functionally characterized. While we are not able to discuss all of these proteins in the context of this summary article, it is useful to mention examples that represent the general functional categories discovered thus far. A surprisingly large percentage of characterized small proteins act at the membrane. More detail about these small proteins is provided by another review in this issue (27).
One obvious category is the proteins that act as toxins, including some TA system toxins, microcins, phenol-soluble modulins like Staphylococcus alpha-hemolysin that act as a defense against host cells, and holin proteins produced by lytic bacteriophage (reviewed in references 28, to ,31). Many of these proteins are highly hydrophobic and contain an α-helical transmembrane domain. The overexpression of these transmembrane domain toxins typically results in depolarized membranes or membrane disruption, which has been attributed to the insertion of the small proteins into the cell membrane generating a pore or otherwise disrupting the membrane or membrane proteins.
Toxic proteins are not the only small proteins that consist of a transmembrane α-helix. A number of other small proteins found in the membrane are not toxic but rather associate with larger membrane proteins to regulate their levels or activities. It is not yet clear what features distinguish toxic α-helical transmembrane small proteins from those that are not toxic. The regulation of magnesium homeostasis in enteric bacteria illustrates how small nontoxic transmembrane proteins can modulate the levels of larger proteins. The 30-aa MgtR protein from Salmonella interacts with MgtA, MgtB, and MgtC (32, 33), which are inner membrane proteins required for adaptation to low magnesium concentrations, and induces their degradation by the FtsH protease (34). In contrast, the 28-aa MgtU protein stabilizes the same proteins and thus increases intercellular magnesium concentrations in Salmonella (32). MgtR and MgtU are not found in E. coli. However, in this organism, the 31-aa MgtS protein binds and stabilizes the MgtA magnesium transporter by blocking FtsH-mediated degradation (35). MgtS additionally binds and negatively regulates the PitA cation-phosphate symporter, resulting in increased intracellular magnesium (36).
Additional examples of transmembrane small proteins that modulate the activities of transporters are the 30-aa PmrR protein found in Salmonella and the more broadly distributed 49-aa AcrZ and 37-aa SgrT proteins. PmrR, whose expression is under the control of the iron (Fe3+)-responsive PmrAB two-component system (TCS), downregulates the activity of LpxT, an inner membrane protein that increases the negative charge of the lipopolysaccharide (LPS) (37). A less electronegative form of the LPS attracts fewer Fe3+ ions to the outer membrane, thereby dampening PmrAB activation as part of a negative-feedback loop. AcrZ associates with and modulates the activity of the AcrAB-TolC drug efflux pump, which confers resistance to many antibiotics (38). In conjunction with the surrounding membrane lipids, AcrZ has an allosteric effect on AcrB, leading to a conformational change that facilitates the transport of certain compounds (39, 40). E. coli SgrT specifically interacts with the unphosphorylated form of the PtsG glucose transporter inhibiting glucose uptake (41).
Other transmembrane small proteins appear to be integral parts of larger enzymatic protein complexes where they can have more of a structural role. Several different small proteins have been found to interact with cytochrome oxidase complexes. In E. coli, the 37-aa CydX and 29-aa CydH proteins are part of the cytochrome bd oxidase and likely assist in the folding and stability of the complex (42–44). Work on photosynthetic bacteria has also provided several examples of transmembrane small proteins affecting the activity of metabolic enzyme complexes (45), where they impact complex assembly (46, 47) as well as protein turnover (48, 49).
Another general category of small proteins is composed of those that interact with the membrane as amphipathic helices or through other membrane anchors and thus can serve as tethers to larger proteins. Here, the best-characterized example is the 26-aa SpoVM protein of Bacillus subtilis, which specifically binds curved membranes to recruit the SpoIVA protein during spore formation (50). Detailed structural and biophysical studies have given significant insights into how SpoVM recognizes membrane curvature (51–53). Another example in the human pathogen Listeria monocytogenes is the 31-aa Prli42 protein, which is predicted to be a tail-anchored membrane protein. Prli42 interacts with the stressosome complex, inducing the complex upon oxidative stress (54).
Small proteins do not need to act at the membrane to affect enzyme activity. In Synechocystis, the 44-aa AcnSP protein binds to, and stimulates the activity of, aconitase, thus impacting flux through the citric acid cycle (55), while the 51-aa PirA protein modulates global nitrogen metabolism by binding regulatory protein PII (56). Examples can also be found in archaea where, in Methanosarcina mazei, binding of the 23-aa sP26 protein to glutamine synthetase (GlnA1) is predicted to stabilize the dodecameric oligomer and might also promote the association of GlnK1 (57). Archaeal small proteins are described more extensively in another review in this issue (58).
In addition to regulating the activities of transporters and enzymatic complexes, small proteins can control the activities of other regulators. Another E. coli small protein impacting magnesium homeostasis is the 47-aa MgrB transmembrane protein, which modulates signaling by inhibiting the PhoQ sensor kinase part of the PhoPQ TCS (59). The mgrB gene itself is activated by PhoPQ, and thus, the small protein provides autoregulation to the PhoPQ TCS (60). We expect that other small proteins that modulate kinases will be identified.
Small proteins can also act by affecting the activity of transcription regulators. For example, small, secreted peptides (5 to 10 aa) produced by various Gram-positive bacteria bind to and activate transcription factors belonging to the RRNPP (named for prototypical members) protein family (reviewed in reference 7). These peptides are typically part of bacterial communication systems and are frequently processed from larger proteins by specific proteases. Intriguingly, a similar mechanism is employed by temperate phages to control lysis versus lysogeny (61). We anticipate that more proteins of fewer than 50 aa will be found to bind and modulate transcription factors. Additionally, it is likely that there are other small proteins that bind specifically to DNA or RNA and have independent regulatory functions, particularly since some well-studied transcription factors (the Cro repressor of bacteriophage λ is 66 aa) and RNA binding proteins (the global posttranscriptional regulator CsrA in E. coli is 61 aa) are relatively small. Finally, it is noteworthy that many core ribosomal proteins (r-proteins) as well as several r-proteins that are assembled into the ribosome at times of specific stresses fit our definition of small proteins (2, 62).
WHAT ARE THE FUNDAMENTAL PROPERTIES OF SMALL PROTEINS?
In conjunction with the need to identify functions for small proteins, much remains to be learned about the fundamental properties of these proteins. Some of these properties, such as how the proteins are localized within the cells, whether the proteins are posttranslationally modified, and how the small proteins are degraded, are the same questions asked about larger proteins. Even the fundamental question of the relative concentration of small proteins has not been answered. Here again, the small size poses challenges given the caveats associated with detection by custom antibodies and epitope tags. The basic properties that have been investigated most extensively are the subcellular localization and membrane translocation of small membrane proteins. Initial tagging with larger tags such as green fluorescent protein suggested that small E. coli proteins can have various orientations in the membrane (with the N terminus on either the cytoplasmic or periplasmic side of the membrane) and be translocated by several different mechanisms (63). More recent studies of the E. coli 27-aa YohP and 33-aa YkgR modified with much smaller tags and assayed in vitro showed that the posttranslational insertion of these proteins into the membrane requires the signal recognition particle (64).
Given that many small proteins identified thus far are thought to function as regulators induced under specific conditions, there need to be mechanisms to downregulate the levels or activities of the small proteins when they are no longer needed. This downregulation could be accomplished by the binding of small molecules, amino acid modifications, and regulated proteolysis. Unfortunately, most global approaches to monitor protein modification or proteolysis are blind to the small proteins. Hence, these techniques will need to be adapted to specifically capture small proteins before more conclusions can be drawn about how proteins of this class themselves are regulated.
Some questions about fundamental properties relate to features of small proteins that may differ from those of larger proteins. These include questions about the translation and folding of the small proteins. For example, are there any barriers to the release of a small protein from the ribosome if the protein is shorter than the length of the ribosome exit channel? In the case of nested ORFs, does the translation of the longer ORF affect the translation of the embedded shorter ORF or vice versa? It is also not yet clear whether or how small proteins fold independent of the molecules that they bind. For some approaches such as nuclear magnetic resonance (NMR) spectroscopy, small size is an advantage, and several groups have initiated studies to systematically examine the structures of small proteins in isolation (65). However, these structures need to be viewed with some caution in the absence of information about whether the small proteins examined in vitro are functional. Mutational analyses that test specific predictions should also be carried out to provide support for the physiological relevance of these structures.
Related to questions about structure is the expectation that small proteins will have a more limited number of protein folds. However, the range of small protein folds and whether some structural features are more likely to be associated with small proteins are not yet known given the limited data available. Most of the small protein structures solved in complex with their interacting partners until now correspond to α-helices (40, 44, 66). However, these helices still can have unusual features. For example, it is noteworthy that a proline residue in the middle of the transmembrane helix of AcrZ introduces a kink that is critical for AcrZ binding to the AcrB transporter (40).
HOW DO SMALL PROTEINS EVOLVE?
Another intriguing question is how small protein genes evolve (Fig. 2). Generally, small proteins are not broadly conserved, so it appears that small protein genes can evolve rapidly. This assumption is supported by studies in Saccharomyces cerevisiae, where small protein genes that have arisen de novo increase fitness (67). Some small protein genes may be derived from the degeneration of a larger protein gene where only a small segment of the original ORF and translation signals remain intact (Fig. 2A). Alternatively, it is possible that a small region of a larger protein, such as a transmembrane domain, is duplicated and acquires a separate function as well as the ability to be expressed. Systematic assessments of whether small protein components of larger complexes are fused to a larger protein in some organisms and comparisons of synteny across closely related organisms could help to distinguish between these possibilities.
FIG 2.
Models for the evolutionary emergence and establishment of small protein genes in bacterial genomes. (A) A large protein-coding gene (gray) is expressed from the combined activities of a transcriptional start site (raised black arrow) and initiation codon (M). Small protein genes can evolve from this large ORF through the degeneration of the large ORF leaving only the signal for the expression and synthesis of part of the larger protein. This may occur after the duplication of the original large ORF. The small protein gene can also result from the duplication of just part of the large ORF. In this case, the synthesis of the small protein from this partial duplication may require the evolution of signals for transcription and translation. (B) Stochastic evolution of promoter and translation signals can lead to the expression of small proteins from random ORFs found in the genomic sequence. (C) Similarly, the stochastic evolution of a new proteolytic cleavage site can lead to the generation of a functional small peptide.
Small proteins may also be encoded by sORFs that arise by the de novo expression of a formerly noncoding nucleic acid sequence (Fig. 2B). Transcriptomic studies indicate that much of a genome can be transcribed. Additionally, there is increasing evidence for pervasive translation (reviewed in reference 12). Thus, there is not a high barrier to the chance occurrence of a combination of features resulting in the incidental expression of a functional small protein that confers a selective advantage. Sequence changes that result in the generation of a new protease cleavage site similarly can lead to the generation of a new functional small peptide (Fig. 2C).
It is interesting to ponder observations about small proteins in the context of evolutionary considerations. For example, although E. coli and Salmonella are quite closely related, the MgtB and MgtU small proteins are found only in Salmonella, and the Salmonella homolog of MgtS does not appear to act in the same way as the E. coli protein (32), even though all three of the small proteins bind to the MgtA magnesium importer. Strikingly, when an expression library allowing the expression of diverse synthetic proteins was used in selection for resistance to the antibiotic colistin, the six de novo colistin resistance-conferring (Dcr) peptides identified were hydrophobic and interacted with and activated the PmrB sensor kinase of the PmrAB TCS (68). Both of these findings suggest that it may be relatively facile for cells to evolve new small proteins, particularly transmembrane small proteins, with advantageous functions.
An even more challenging task is understanding the evolution of sORFs nested within larger ORFs, particularly since fewer examples have been characterized (18). The availability of more and more related genome sequences allowing expanded, fine-scale genome comparisons tuned to detecting additional conservation in intergenic regions as well as within ORFs could be a starting point for addressing some of these questions.
HOW CAN SMALL PROTEINS BE EXPLOITED?
Some of the characterized small proteins have already been shown to have medical consequences: the TA small proteins impact the occurrence of bacterial cells that persist under antibiotic exposure (reviewed in reference 69), and the modulins manipulate host cells. Small proteins are being detected in mixed microbial communities such as the human microbiome, which could well affect the composition of the microbiome and the interaction with host cells (9, 70). Additionally, small proteins that regulate transporters or regulators of key metabolic processes such as magnesium homeostasis could have important roles during bacterial infection (71). Thus, future experiments with an eye toward exploiting small proteins, which have a greater potential to cross membranes, should be considered in new antibacterial strategies as well as for optimizing bacteria for biosynthetic purposes.
OUTLOOK
As the discovery and functional assessment of small proteins are gaining momentum, a multitude of interesting and important questions await answers. Addressing these questions will require appropriate vigilance regarding the limitations of current approaches as well as the development of new technologies and more extensive studies of small proteins in organisms beyond E. coli and Salmonella enterica. However, even in the absence of global approaches, which generally have been more challenging for small proteins, the characterization of an expanding number of individual small proteins by many researchers studying various organisms undoubtedly will reveal more commonalities, general principles, as well as novel mechanisms. We expect that innovations in methods and ideas will drive an exponential increase in exciting fundamental discoveries about small proteins.
ACKNOWLEDGMENTS
We thank A. Buskirk and J. Wade for their comments on this review.
Small protein research is supported by R01GM139277 in the Gray lab and by the Intramural Research Program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development in the Storz lab. K.P. acknowledges support by the German Research Foundation (SPP2002 and EXC 2051, project ID 390713860) and the Vallee Foundation.
Contributor Information
Todd Gray, Email: todd.gray@health.ny.gov.
Gisela Storz, Email: storzg@mail.nih.gov.
Kai Papenfort, Email: kai.papenfort@uni-jena.de.
Tina M. Henkin, Ohio State University
REFERENCES
- 1.Andrews SJ, Rothnagel JA. 2014. Emerging evidence for functional peptides encoded by short open reading frames. Nat Rev Genet 15:193–204. 10.1038/nrg3520. [DOI] [PubMed] [Google Scholar]
- 2.Hemm MR, Weaver J, Storz G. 2020. Escherichia coli small proteome. EcoSal Plus 9:ESP-0031-2019. 10.1128/ecosalplus.ESP-0031-2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Chu Q, Ma J, Saghatelian A. 2015. Identification and characterization of sORF-encoded polypeptides. Crit Rev Biochem Mol Biol 50:134–141. 10.3109/10409238.2015.1016215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Turnbough CL, Jr. 2019. Regulation of bacterial gene expression by transcription attenuation. Microbiol Mol Biol Rev 83:e00019-19. 10.1128/MMBR.00019-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Raina M, King A, Bianco C, Vanderpool CK. 2018. Dual-function RNAs. Microbiol Spectr 6:RWR-0032-2018. 10.1128/microbiolspec.RWR-0032-2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.González-Pastor JE, San Millán JL, Moreno F. 1994. The smallest known gene. Nature 369:281. 10.1038/369281a0. [DOI] [PubMed] [Google Scholar]
- 7.Neiditch MB, Capodagli GC, Prehna G, Federle MJ. 2017. Genetic and structural analyses of RRNPP intercellular peptide signaling of Gram-positive bacteria. Annu Rev Genet 51:311–333. 10.1146/annurev-genet-120116-023507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Richardson EJ, Watson M. 2013. The automatic annotation of bacterial genomes. Brief Bioinform 14:1–12. 10.1093/bib/bbs007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sberro H, Fremin BJ, Zlitni S, Edfors F, Greenfield N, Snyder MP, Pavlopoulos GA, Kyrpides NC, Bhatt AS. 2019. Large-scale analyses of human microbiomes reveal thousands of small, novel genes. Cell Syst 178:1245–1259. 10.1016/j.cell.2019.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Venturini E, Svensson SL, Maaß S, Gelhausen R, Eggenhofer F, Li L, Cain AK, Parkhill J, Becher D, Backofen R, Barquist L, Sharma CM, Westermann AJ, Vogel J. 2020. A global data-driven census of Salmonella small proteins and their potential functions in bacterial virulence. Microlife 1:uqaa002. 10.1093/femsml/uqaa002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Miravet-Verde S, Ferrar T, Espadas-García G, Mazzolini R, Gharrab A, Sabido E, Serrano L, Lluch-Senar M. 2019. Unraveling the hidden universe of small proteins in bacterial genomes. Mol Syst Biol 15:e8290. 10.15252/msb.20188290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Vazquez-Laslop N, Sharma CM, Mankin AS, Buskirk AR. 2022. Identifying small open reading frames in prokaryotes with ribosome profiling. J Bacteriol 204:e00294-21. 10.1128/JB.00294-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ahrens CH, Wade JT, Champion MM, Langer JD. 2022. A practical guide to small protein discovery and characterization using mass spectrometry. J Bacteriol 204:e00353-21. 10.1128/JB.00353-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hao Y, Zhang L, Niu Y, Cai T, Luo J, He S, Zhang B, Zhang D, Qin Y, Yang F, Chen R. 2018. SmProt: a database of small proteins encoded by annotated coding and non-coding RNA loci. Brief Bioinform 19:636–643. 10.1093/bib/bbx005. [DOI] [PubMed] [Google Scholar]
- 15.Weaver J, Mohammad F, Buskirk AR, Storz G. 2019. Identifying small proteins by ribosome profiling with stalled initiation complexes. mBio 10:e02819-18. 10.1128/mBio.02819-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Melior H, Li S, Stötzel M, Maaß S, Schütz R, Azarderakhsh S, Shevkoplias A, Barth-Weber S, Baumgardt K, Ziebuhr J, Förstner KU, Chervontseva Z, Becher D, Evguenieva-Hackenberg E. 2021. Reprograming of sRNA target specificity by the leader peptide peTrpL in response to antibiotic exposure. Nucleic Acids Res 49:2894–2915. 10.1093/nar/gkab093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Harms A, Brodersen DE, Mitarai N, Gerdes K. 2018. Toxins, targets, and triggers: an overview of toxin-antitoxin biology. Mol Cell 70:768–784. 10.1016/j.molcel.2018.01.003. [DOI] [PubMed] [Google Scholar]
- 18.Meydan S, Marks J, Klepacki D, Sharma V, Baranov PV, Firth AE, Margus T, Kefi A, Vazquez-Laslop N, Mankin AS. 2019. Retapamulin-assisted ribosome profiling reveals the alternative bacterial proteome. Mol Cell 74:481–493. 10.1016/j.molcel.2019.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A. 2013. NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Res 41:D991–D995. 10.1093/nar/gks1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.van Opijnen T, Camilli A. 2013. Transposon insertion sequencing: a new tool for systems-level analysis of microorganisms. Nat Rev Microbiol 11:435–442. 10.1038/nrmicro3033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Makarewich CA, Olson EN. 2017. Mining for micropeptides. Trends Cell Biol 27:685–696. 10.1016/j.tcb.2017.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Butland G, Peregrin-Alvarez JM, Li J, Yang W, Yang X, Canadien V, Starostine A, Richards D, Beattie B, Krogan N, Davey M, Parkinson J, Greenblatt J, Emili A. 2005. Interaction network containing conserved and essential protein complexes in Escherichia coli. Nature 433:531–537. 10.1038/nature03239. [DOI] [PubMed] [Google Scholar]
- 23.Kitagawa M, Ara T, Arifuzzaman M, Ioka-Nakamichi T, Inamoto E, Toyonaga H, Mori H. 2005. Complete set of ORF clones of Escherichia coli ASKA library (a complete set of E. coli K-12 ORF archive): unique resources for biological research. DNA Res 12:291–299. 10.1093/dnares/dsi012. [DOI] [PubMed] [Google Scholar]
- 24.Quendera AP, Seixas AF, Dos Santos RF, Santos I, Silva JPN, Arraiano CM, Andrade JM. 2020. RNA-binding proteins driving the regulatory activity of small non-coding RNAs in bacteria. Front Mol Biosci 7:78. 10.3389/fmolb.2020.00078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Orr MW, Mao Y, Storz G, Qian SB. 2020. Alternative ORFs and small ORFs: shedding light on the dark proteome. Nucleic Acids Res 48:1029–1042. 10.1093/nar/gkz734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Grutzner J, Billenkamp F, Spanka DT, Rick T, Monzon V, Forstner KU, Klug G. 2021. The small DUF1127 protein CcaF1 from Rhodobacter sphaeroides is an RNA-binding protein involved in sRNA maturation and RNA turnover. Nucleic Acids Res 49:3003–3019. 10.1093/nar/gkab146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yadavalli SS, Yuan J. 2022. Bacterial small membrane proteins: the Swiss Army knife of regulators at the lipid bilayer. J Bacteriol 204:e00344-21. 10.1128/JB.00344-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Berghoff BA, Wagner EGH. 2017. RNA-based regulation in type I toxin-antitoxin systems and its implication for bacterial persistence. Curr Genet 63:1011–1016. 10.1007/s00294-017-0710-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fozo EM, Hemm MR, Storz G. 2008. Small toxic proteins and the antisense RNAs that repress them. Microbiol Mol Biol Rev 72:579–589. 10.1128/MMBR.00025-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cahill J, Young R. 2019. Phage lysis: multiple genes for multiple barriers. Adv Virus Res 103:33–70. 10.1016/bs.aivir.2018.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Li Y, Li Y, Mengist HM, Shi C, Zhang C, Wang B, Li T, Huang Y, Xu Y, Jin T. 2021. Structural basis of the pore-forming toxin/membrane interaction. Toxins (Basel) 13:128. 10.3390/toxins13020128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Yeom J, Shao Y, Groisman EA. 2020. Small proteins regulate Salmonella survival inside macrophages by controlling degradation of a magnesium transporter. Proc Natl Acad Sci USA 117:20235–20243. 10.1073/pnas.2006116117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Alix E, Blanc-Potard AB. 2008. Peptide-assisted degradation of the Salmonella MgtC virulence factor. EMBO J 27:546–557. 10.1038/sj.emboj.7601983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Blanc-Potard AB, Groisman EA. 2021. How pathogens feel and overcome magnesium limitation when in host tissues. Trends Microbiol 29:98–106. 10.1016/j.tim.2020.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wang H, Yin X, Wu Orr M, Dambach M, Curtis R, Storz G. 2017. Increasing intracellular magnesium levels with the 31-amino acid MgtS protein. Proc Natl Acad Sci USA 114:5689–5694. 10.1073/pnas.1703415114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Yin X, Wu Orr M, Wang H, Hobbs EC, Shabalina SA, Storz G. 2019. The small protein MgtS and small RNA MgrR modulate the PitA phosphate symporter to boost intracellular magnesium levels. Mol Microbiol 111:131–144. 10.1111/mmi.14143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kato A, Chen HD, Latifi T, Groisman EA. 2012. Reciprocal control between a bacterium’s regulatory system and the modification status of its lipopolysaccharide. Mol Cell 47:897–908. 10.1016/j.molcel.2012.07.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hobbs EC, Yin X, Paul BJ, Astarita JL, Storz G. 2012. Conserved small protein associates with the multidrug efflux pump AcrB and differentially affects antibiotic resistance. Proc Natl Acad Sci USA 109:16696–16701. 10.1073/pnas.1210093109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang Z, Fan G, Hryc CF, Blaza JN, Serysheva II, Schmid MF, Chiu W, Luisi BF, Du D. 2017. An allosteric transport mechanism for the AcrAB-TolC multidrug efflux pump. Elife 6:e24905. 10.7554/eLife.24905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Du D, Neuberger A, Orr MW, Newman CE, Hsu PC, Samsudin F, Szewczak-Harris A, Ramos LM, Debela M, Khalid S, Storz G, Luisi BF. 2020. Interactions of a bacterial RND transporter with a transmembrane small protein in a lipid environment. Structure 28:625–634. 10.1016/j.str.2020.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Lloyd CR, Park S, Fei J, Vanderpool CK. 2017. The small protein SgrT controls transport activity of the glucose-specific phosphotransferase system. J Bacteriol 199:e00869-16. 10.1128/JB.00869-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.VanOrsdel CE, Bhatt S, Allen RJ, Brenner EP, Hobson JJ, Jamil A, Haynes BM, Genson AM, Hemm MR. 2013. The Escherichia coli CydX protein is a member of the CydAB cytochrome bd oxidase complex and is required for cytochrome bd oxidase activity. J Bacteriol 195:3640–3650. 10.1128/JB.00324-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hoeser J, Hong S, Gehmann G, Gennis RB, Friedrich T. 2014. Subunit CydX of Escherichia coli cytochrome bd ubiquinol oxidase is essential for assembly and stability of the di-heme active site. FEBS Lett 588:1537–1541. 10.1016/j.febslet.2014.03.036. [DOI] [PubMed] [Google Scholar]
- 44.Safarian S, Hahn A, Mills DJ, Radloff M, Eisinger ML, Nikolaev A, Meier-Credo J, Melin F, Miyoshi H, Gennis RB, Sakamoto J, Langer JD, Hellwig P, Kuhlbrandt W, Michel H. 2019. Active site rearrangement and structural divergence in prokaryotic respiratory oxidases. Science 366:100–104. 10.1126/science.aay0967. [DOI] [PubMed] [Google Scholar]
- 45.Brandenburg F, Klahn S. 2020. Small but smart: on the diverse role of small proteins in the regulation of cyanobacterial metabolism. Life 10:322. 10.3390/life10120322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zabret J, Bohn S, Schuller SK, Arnolds O, Moller M, Meier-Credo J, Liauw P, Chan A, Tajkhorshid E, Langer JD, Stoll R, Krieger-Liszkay A, Engel BD, Rudack T, Schuller JM, Nowaczyk MM. 2021. Structural insights into photosystem II assembly. Nat Plants 7:524–538. 10.1038/s41477-021-00895-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Schuller JM, Birrell JA, Tanaka H, Konuma T, Wulfhorst H, Cox N, Schuller SK, Thiemann J, Lubitz W, Setif P, Ikegami T, Engel BD, Kurisu G, Nowaczyk MM. 2019. Structural adaptations of photosynthetic complex I enable ferredoxin-dependent electron transfer. Science 363:257–260. 10.1126/science.aau3613. [DOI] [PubMed] [Google Scholar]
- 48.Krauspe V, Fahrner M, Spat P, Steglich C, Frankenberg-Dinkel N, Macek B, Schilling O, Hess WR. 2021. Discovery of a small protein factor involved in the coordinated degradation of phycobilisomes in cyanobacteria. Proc Natl Acad Sci USA 118:e2012277118. 10.1073/pnas.2012277118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Collier JL, Grossman AR. 1994. A small polypeptide triggers complete degradation of light-harvesting phycobiliproteins in nutrient-deprived cyanobacteria. EMBO J 13:1039–1047. 10.1002/j.1460-2075.1994.tb06352.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ramamurthi KS, Lecuyer S, Stone HA, Losick R. 2009. Geometric cue for protein localization in a bacterium. Science 323:1354–1357. 10.1126/science.1169218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gill RL, Jr, Castaing J-P, Hsin J, Tan IS, Wang X, Huang KC, Tian F, Ramamurthi KS. 2015. Structural basis for the geometry-driven localization of a small protein. Proc Natl Acad Sci USA 112:E1908–E1915. 10.1073/pnas.1423868112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kim EY, Tyndall ER, Huang KC, Tian F, Ramamurthi KS. 2017. Dash-and-recruit mechanism drives membrane curvature recognition by the small bacterial protein SpoVM. Cell Syst 5:518–526. 10.1016/j.cels.2017.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Peluso EA, Updegrove TB, Chen J, Shroff H, Ramamurthi KS. 2019. A 2-dimensional ratchet model describes assembly initiation of a specialized bacterial cell surface. Proc Natl Acad Sci USA 116:21789–21799. 10.1073/pnas.1907397116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Impens F, Rolhion N, Radoshevich L, Becavin C, Duval M, Mellin J, Garcia Del Portillo F, Pucciarelli MG, Williams AH, Cossart P. 2017. N-terminomics identifies Prli42 as a membrane miniprotein conserved in Firmicutes and critical for stressosome activation in Listeria monocytogenes. Nat Microbiol 2:17005. 10.1038/nmicrobiol.2017.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.de Alvarenga LV, Hess WR, Hagemann M. 2020. AcnSP—a novel small protein regulator of aconitase activity in the cyanobacterium Synechocystis sp. PCC 6803. Front Microbiol 11:1445. 10.3389/fmicb.2020.01445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Bolay P, Rozbeh R, Muro-Pastor MI, Timm S, Hagemann M, Florencio FJ, Forchhammer K, Klahn S. 2021. The novel PII-interacting protein PirA controls flux into the cyanobacterial ornithine-ammonia cycle. mBio 12:e00229-21. 10.1128/mBio.00229-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gutt M, Jordan B, Weidenbach K, Gudzuhn M, Kiessling C, Cassidy L, Helbig A, Tholey A, Pyper DJ, Kubatova N, Schwalbe H, Schmitz RA. 2021. High complexity of glutamine synthetase regulation in Methanosarcina mazei: small protein 26 interacts and enhances glutamine synthetase activity. FEBS J 288:5350–5373. 10.1111/febs.15799. [DOI] [PubMed] [Google Scholar]
- 58.Weidenbach K, Gutt M, Cassidy L, Chibani C, Schmitz RA. 2022. Small proteins in Archaea, a mainly unexplored world. J Bacteriol 204:e00313-21. 10.1128/JB.00313-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lippa AM, Goulian M. 2009. Feedback inhibition in the PhoQ/PhoP signaling system by a membrane peptide. PLoS Genet 5:e1000788. 10.1371/journal.pgen.1000788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Salazar ME, Podgornaia AI, Laub MT. 2016. The small membrane protein MgrB regulates PhoQ bifunctionality to control PhoP target gene expression dynamics. Mol Microbiol 102:430–445. 10.1111/mmi.13471. [DOI] [PubMed] [Google Scholar]
- 61.Erez Z, Steinberger-Levy I, Shamir M, Doron S, Stokar-Avihail A, Peleg Y, Melamed S, Leavitt A, Savidor A, Albeck S, Amitai G, Sorek R. 2017. Communication between viruses guides lysis-lysogeny decisions. Nature 541:488–493. 10.1038/nature21049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Hentschel J, Burnside C, Mignot I, Leibundgut M, Boehringer D, Ban N. 2017. The complete structure of the Mycobacterium smegmatis 70S ribosome. Cell Rep 20:149–160. 10.1016/j.celrep.2017.06.029. [DOI] [PubMed] [Google Scholar]
- 63.Fontaine F, Fuchs RT, Storz G. 2011. Membrane localization of small proteins in Escherichia coli. J Biol Chem 286:32464–32474. 10.1074/jbc.M111.245696. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Steinberg R, Origi A, Natriashvili A, Sarmah P, Licheva M, Walker PM, Kraft C, High S, Luirink J, Shi WQ, Helmstädter M, Ulbrich MH, Koch H-G. 2020. Posttranslational insertion of small membrane proteins by the bacterial signal recognition particle. PLoS Biol 18:e3000874. 10.1371/journal.pbio.3000874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Kubatova N, Pyper DJ, Jonker HRA, Saxena K, Remmel L, Richter C, Brantl S, Evguenieva-Hackenberg E, Hess WR, Klug G, Marchfelder A, Soppa J, Streit W, Mayzel M, Orekhov VY, Fuxreiter M, Schmitz RA, Schwalbe H. 2020. Rapid biophysical characterization and NMR structural analysis of small proteins from bacteria and archaea. Chembiochem 21:1178–1187. 10.1002/cbic.201900677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Huang CS, Pedersen BP, Stokes DL. 2017. Crystal structure of the potassium-importing KdpFABC membrane complex. Nature 546:681–685. 10.1038/nature22970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Vakirlis N, Acar O, Hsu B, Castilho Coelho N, Van Oss SB, Wacholder A, Medetgul-Ernar K, Bowman RW, II, Hines CP, Iannotta J, Parikh SB, McLysaght A, Camacho CJ, O’Donnell AF, Ideker T, Carvunis A-R. 2020. De novo emergence of adaptive membrane proteins from thymine-rich genomic sequences. Nat Commun 11:781. 10.1038/s41467-020-14500-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Knopp M, Babina AM, Gudmundsdóttir JS, Douglass MV, Trent MS, Andersson DI. 2021. A novel type of colistin resistance genes selected from random sequence space. PLoS Genet 17:e1009227. 10.1371/journal.pgen.1009227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Harms A, Maisonneuve E, Gerdes K. 2016. Mechanisms of bacterial persistence during stress and antibiotic exposure. Science 354:aaf4268. 10.1126/science.aaf4268. [DOI] [PubMed] [Google Scholar]
- 70.Fremin BJ, Sberro H, Bhatt AS. 2020. MetaRibo-Seq measures translation in microbiomes. Nat Commun 11:3268. 10.1038/s41467-020-17081-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Belon C, Rosas Olvera M, Vives E, Kremer L, Gannoun-Zaki L, Blanc-Potard AB. 2016. Use of the Salmonella MgtR peptide as an antagonist of the Mycobacterium MgtC virulence factor. Future Microbiol 11:215–225. 10.2217/fmb.15.134. [DOI] [PubMed] [Google Scholar]

