Skip to main content
Journal of Bacteriology logoLink to Journal of Bacteriology
. 2022 Jan 18;204(1):e00313-21. doi: 10.1128/JB.00313-21

Small Proteins in Archaea, a Mainly Unexplored World

Katrin Weidenbach a, Miriam Gutt a, Liam Cassidy b, Cynthia Chibani a, Ruth A Schmitz a,
Editor: Tina M Henkinc
PMCID: PMC8765429  PMID: 34543104

ABSTRACT

In recent years, increasing numbers of small proteins have moved into the focus of science. Small proteins have been identified and characterized in all three domains of life, but the majority remains functionally uncharacterized, lack secondary structure, and exhibit limited evolutionary conservation. While quite a few have already been described for bacteria and eukaryotic organisms, the amount of known and functionally analyzed archaeal small proteins is still very limited. In this review, we compile the current state of research, show strategies for systematic approaches for global identification of small archaeal proteins, and address selected functionally characterized examples. Besides, we document exemplarily for one archaeon the tool development and optimization to identify small proteins using genome-wide approaches.

KEYWORDS: small proteins, short ORFs, archaea

INTRODUCTION

Short open reading frame (sORF)-encoded small proteins with a size of less than 70 amino acids (aa) have been reported in all three domains of life (e.g., references 1 to 3). Although the first small protein was reported in 1965 (4), small proteins have been overlooked in prokaryotes for a long time due to technical limitations in detection. However, modern genomics and transcriptomics technologies, combined with systematic genome-wide approaches, have uncovered an unexpected genome complexity in prokaryotes. As a result, over the past decade, global approaches have discovered a wealth of hidden small genes containing short open reading frames in many prokaryotic genomes in addition to genes encoding larger (standard) proteins and genes of noncoding RNAs (ncRNAs) (5, 6). In consequence, the number of experimentally identified sORFs and verified small proteins is constantly increasing. However, information regarding a specific cellular physiological role is lacking for the majority of confirmed small proteins, and only a small number identified in diverse prokaryotes have been functionally characterized in detail. They have been reported to play crucial roles in various cellular processes, including stress response, transcription and translation regulation, metabolism, and antibiotic resistance (reviewed in references 1, 3, and 6 to 18). In the respective reviews and original reports, the definitions of small proteins encoded by sORFs differ, particularly in their maximal size, which ranges between 50 to 100 aa. Here, we focused on proteins with a maximal length of 70 residues.

Due to the late discovery of archaea, as well as their frequent occurrence in extreme habitats (with high temperatures, high salt, extreme pH, or anoxic conditions), only a small proportion of the Archaea species has been described and successfully cultured in the lab so far. In the NCBI database are 1,985 genome assemblies of archaea, in contrast to 40,312 genome assemblies of bacteria (data retrieved 26 May 2021), clearly showing the underrepresentation of archaea. Consequently, in comparison to bacteria, significantly fewer small proteins have been identified in archaea. In this review, we report on the current research on small proteins in archaea and introduce the tools for genome-wide screening approaches generally used for prokaryotes. Finally, the development and optimization of tools to identify small proteins is exemplarily documented for the methanogenic archaeon Methanosarcina mazei.

STRATEGIES AND SYSTEMATIC APPROACHES FOR GLOBAL IDENTIFICATION OF SMALL PROTEINS IN ARCHAEA

In addition to the relatively small number of archaea described so far and the fact that they are often difficult to culture, several methodological reasons, not archaeon-specific but more general, have complicated the identification of small proteins in archaea; for example, automated gene prediction algorithms used are missing the small proteins (1, 3, 1924). Another profound reason is the technical limitations of detecting small proteins (e.g., SDS-PAGE resolution or mass spectrometry) (2529). Overall, a standard procedure for detecting small proteins in archaea is missing. Different strategies, including systematic approaches for the global identification of small proteins, are often combined. Since these methods have been used for both bacteria and archaea and are discussed in detail in a separate review within this issue, they are only briefly summarized here, as follows.

The majority of the so far identified and described archaeal small proteins (summarized in Table 1) have been found by serendipity. Small proteins were identified using various experimental approaches not explicitly aiming to identify small proteins, like the 7-kDa DNA-binding proteins from Crenarchaea (Sulfolobales) detected by isopycnic centrifugation of cell extracts and subsequent isolation of the DNA-binding proteins (30). Brz1 from Halobacterium salinarum was identified by inverse structural genomics. The proteome, which is smaller than 20 kDa, was first enriched by filter membrane centrifugation, then separated by Tricine–SDS-PAGE, followed by a short digestion, and then analyzed by One-dimensional liquid chromatography–mass spectrometry/tandem mass spectrometry (LTQ-FT system) and mapped to the genome (31). In this approach, 20 proteins containing a characteristic DNA/RNA-binding zinc finger motif were discovered, including 10 proteins smaller than 70 aa (32). The small protein rubredoxin was originally described for Clostridium pasteurianum (4). Rubredoxin of Pyrococcus furiosus was isolated under anaerobic conditions by several different chromatographic separations (including ion exchange, hydroxyapatite chromatography, and size exclusion chromatography), followed by amino acid sequencing (repetitive Edman degradation) (33). Small proteins similar to experimentally discovered ones were later identified in the databases based on sequence comparisons (similarity-based identification). Today, modern and sophisticated sequencing techniques allow global approaches to predict sORFs and verify small proteins. In a first genome-wide transcriptomics approach comparing M. mazei growing under various nitrogen availability conditions, 44 short ORFs encoding proteins 30 to 90 amino acids (aa) in length were discovered. The identification of sORFs was achieved by manual curation of the cDNA reads, mainly in the noncoding region of the genome (34). Three of those were later experimentally verified to be translated by LC-MS/MS analysis and were predicted to be involved in regulation of nitrogen metabolism (2). Other examples of archaeal sORFs identified by transcriptome analysis are the respective ORFs encoding four small heat shock proteins and four TRAM-domain containing proteins with low molecular weight in the psychrophilic methanoarchaeon Methanolobus psychrophilus R15, which are upregulated at 4°C compared to expression during growth at 18°C (35).

TABLE 1.

Identified and functionally characterized small proteins in archaea

Small protein Length (aa) Organism(s) Function Reference(s)
VhuU 69 Methanococcus voltae Posttranslational processes, crucial for [NiFe] hydrogenase activity 88, 89
Histone-like 55–69 Euryarcheota Stabilization of DNA 63
7-kDa binding family 51–78 Crenaechaeota (Sulfolobis sp.) Stabilization of DNA, transcription factors 30, 61, 6567
Cold shock proteins 60–70 Several groups RNA-binding chaperons, TRAM domain 35, 58, 82, 83
sORF56 56 Sulfolobus islandicus Transcription factor 69, 70, 72
Nop10 60 Methanocaldococcus jannaschii, Pyrococcus abyssi, Pyrococcus furiosus Crucial for pseudouridine synthetase 85 87
TIP 64 Thermococcus kodakarensis PCNA inhibitor 76, 77
Spt4 59–61 Methanocaldococcus jannaschii, Pyrococcus furiosus Crucial part of transcription elongation factor 7880, 133
SecE 61 Pyrococcus furiosus Crucial for SecYEG protein transport channel 114 119
SecG 53 Methanocaldococcus jannaschii Plug of SecYEG protein transport channel
CedA2 52 Sulfolobus acidocaldarius Part of the Ced DNA import channel 120
RelB 51–82 Pyrococcus horikoshii, Methanococcus jannaschii, Archaeoglobus fulgidus Antitoxin of a TA system II 97, 98, 134
SAMP1/SAMP2 66, 69 Haloferax volcanii, Pyrococcus furiosus, Methanosarcina acetivorans Ubiquitin-like modifiers 105107, 135, 136
AfDmpI 63 Archaeoglobus fulgidus Keto-enol tautomerase activity 137
Brz-1 60 Halobacterium salinarum Transcription factor 31, 32
Dodecin 68 Halobacterium salinarum Quenching based on dealkylation of riboflavin 127 129
Rubredoxin 45–55 Pyrococcus furiosus Detoxification of reactive oxygen species 33, 121
HVO_0416 58 Haloferax volcanii Affects biofilm formation
HVO_0649 68 Haloferax volcanii Affects swarming
HVO_0758 56 Haloferax volcanii Affects biofilm formation and swarming 56, 57, 59
HVO_2142 50 Haloferax volcanii Affects biofilm formation and swarming
HVO_2523 50 Haloferax volcanii Affects biofilm formation and swarming
HVO_2753 59 Haloferax volcanii Affects biofilm formation and swarming
HVO_2901 54 Haloferax volcanii Affects biofilm formation and swarming
HVO_2922 60 Haloferax volcanii Upregulated under iron limitation
HVO_A0556 47 Haloferax volcanii Affects biofilm formation and swarming
sP36 61 Methanosarcina mazei Related to nitrogen metabolism
sP41 53 Methanosarcina mazei Putative transcription factor 2
sP44 23 Methanosarcina mazei Putative transcription factor
sP26 23 Methanosarcina mazei Regulator of glutamine synthetase 52

Another global approach for detecting small proteins is based on a proteome-wide approach. Small proteins are generally underrepresented in most proteomics studies, since the identification by classical methods of bottom-up proteomics is challenging and shows an inherent bias against small proteins (36, 37). These biases arise due to a number of factors, such as the sequence length, which following enzymatic digestion can result in only a limited number of peptides, or in extreme cases no or only a single peptide(s). Additionally, in many cases, small proteins are assumed to be in low abundance, which further hampers their detection against a background of high-abundance proteins capable of producing a large number of potentially detectable peptides. Thus, in recent years, tools have been developed and adapted to optimize for the detection of small proteins (often termed peptidomics approaches). One major issue is to reduce the complexity of the samples and to enrich for the small proteins or deplete for larger proteins. Examples for enrichment of small proteins are gel-based methods (38, 39), organic solvent depletion methods (40), solid-phase extraction (37, 41, 42), molecular weight cutoff filters (43), size exclusion chromatography (44), and adapted protein extraction methods (45). To reduce or circumvent the loss of small proteins due to binding to larger protein complexes during sample processing, chaotropic reagents, such as guanidinium hydrochloride, can be used (37, 42). Recently, a combined bottom-up and top-down proteomics workflow was utilized for the analysis of small proteins (37). The combined analysis provides the benefits of both methods, allows characterization of proteoforms and a limited number of posttranslational modifications, and results in a more confident identification of small proteins than has previously been reported (see “Small proteins identified in M. mazei”). Finally, a multiprotease approach can be used for the improved identification and characterization of small proteins, as recently shown for M. mazei (46).

The global tool of ribosome profiling (also called RiboSeq or ribosome foot-printing), which was originally established in eukaryotes (47), has been recently successfully adapted for prokaryotes to identify new small proteins (e.g., references 4850). In general, this method can identify the translation of predicted ORFs (including sORFs). Initially, extracted RNA is depleted for rRNA and nucleolytically digested, except for those fragments that are protected by ribosomes. By sequencing the remaining mRNA fragments that are protected from nuclease digestion by actively translating ribosomes (covering ∼30 to 40 nucleotides), the method provides a global picture of transcripts that are actively translated. The method further enables determination of the ORF boundaries, or refining of sORF annotations (27, 51); see also the respective review in this special issue. Very recently, the first RiboSeq analysis for an archaeon, Haloferax volcanii, was reported (50). Evaluating the data sets showed that 75% of the footprints map to the first nucleotide of the start codon, observing 3-nucleotide periodicity, as reported for eukaryotes. Overall, 160 unknown translation start sites were detected in H. volcanii, of which 68 represent small proteins.

CATEGORIES AND STRUCTURE OF SMALL PROTEINS IN ARCHAEA

Depending on the localization, the experimentally verified sORFs can be differentiated into different categories (Fig. 1). The first category is translated sORFs located in intergenic regions, which have been identified in, for example, M. mazei (e.g., sORF36, sORF41, sORF44, and sORF26) (2, 52), as well as in Halobacterium salinarum (Brz1) (32) (Fig. 1A). The second category is sORFs encoded in the 5′ untranslated region (UTR) of an operon or gene, so-called upstream ORFs (uORFs) (Fig. 1B), which might regulate the translation of the downstream gene(s) as known for bacteria (e.g., attenuation [6, 53, 54]). Translation of uORFs has recently been shown in H. volcanii (50); however, little is currently known about the effects of the respective small proteins and whether they regulate the translation of the downstream ORF. Besides, internal out-of-frame sORFs (alternative ORFs) and antisense translation ORFs (asORFs) have been identified in H. volcanii by the recent ribosome profiling approach (Fig. 1C and D) (50). A recently published study reports on the structural analysis of 27 small proteins, identified to be expressed in nine different bacteria and archaea, by circular dichroism (CD) and nuclear magnetic resonance (NMR) spectroscopy (55). Overall, the systematic screen demonstrated that most of these small proteins appeared to be unstructured or only partially folded, especially if they contain fewer than 30 aa. Most interestingly, bioinformatics tools such as the FuzPred algorithm predict that some of these unstructured small proteins might fold into a structure through disorder-to-order transitions upon interacting with their target protein (55). Besides, the structures of two archaeal small proteins of H. volcanii have been recently determined and characterized by NMR analysis, namely, those of the HVO_2753 protein (59 aa), which is potentially involved in biofilm formation (56), and the stress-regulated HVO_2922 protein (60 aa) (57).

FIG 1.

FIG 1

Localization of different types of short ORFs. (A) Intergenic localization, has its own regulatory elements; (B) localization within 5′-UTR (uORFs); (C) localization within a large ORF; and (D) localization in antisense. Panels B, C, and D created based on the data from Gelsinger et al. (50).

(POTENTIAL) FUNCTIONS OF SMALL PROTEINS IDENTIFIED IN ARCHAEA

In archaea overall, 335 small proteins (<70 aa) have been reported based on UniProt data (Fig. 2). Well-known examples of archaeal small proteins are able to interact with DNA or RNA or with other proteins. Some of these represent subunits of complexes. The distribution of these archaeal small proteins based on protein category and archaeal genus is summarized and depicted in Fig. 3. The color coding additionally indicates the respective characterization status, namely, structural evidence, evidence at protein or transcriptional levels, inferred from similarity, or bioinformatics prediction. The majority are regulated in response to a specific stress and, as mentioned, most have been found by serendipity or, more recently, by global screens. However, for most of these proteins, experimentally verified functional analysis is still scarce or missing (e.g., references 2, 38, 40, 50, and 58 to 60). Only for a few has a physiological function been predicted or experimentally identified. The current knowledge on small archaeal proteins and their respective predicted or experimentally verified functions is summarized in Table 1; selected examples will be introduced in more detail in the following sections.

FIG 2.

FIG 2

Distribution of small proteins within prokaryotes. Recorded reviewed small proteins (≤70 aa) within prokaryotes in the database UniProt (retrieved 5 October 2020), excluding ribosomal proteins and subunits of DNA polymerase.

FIG 3.

FIG 3

Distribution of small proteins within archaea. UpSet plot representing the number of small proteins (≤70 aa) in archaeal genera. Vertical bars represent the number of proteins per category shared between the specific archaeal genera. Protein categories are color coded by their characterization status based on the UniProt data (retrieved 5 October 2020). Horizontal bars in the lower panel indicate the total number of proteins contained in each genus. The different dots denote the presence of the protein category by genus. UpSet plot was generated using the library ggplot2 in R (132).

Small proteins interacting with DNA or RNA.

The majority of archaeal small proteins described to date are DNA/RNA-binding proteins. Archaeal histone-like proteins are generally smaller than eukaryotic histones (ranging from 55 to 69 aa), contain a simple histone fold (HF), and establish chromatin structures similar to those of eukaryotes; however, they form tetramers or hexamers (6164). Some Crenarchaeota lack histone-like proteins but contain members of the so-called “7-kDa DNA-binding” protein family (ranging from 51 to 78 aa), which show similar chromatin structuring function. Within this protein family are small basic proteins like the homologous Sul7d (Sulfolobus islandicus), Sac7d (Sulfolobus acidocaldarius), and Sso7d (Sulfolobus solfataricus). Those proteins are thermostable up to 100°C and stable under an extreme pH range from 0 to 12 (65). The proteins bind double-stranded DNA (dsDNA) in the minor groove without sequence specificity, resulting in an increase of stability of the DNA, especially at high temperatures (61, 6567). Moreover, it has been reported that some of the 7-kDa DNA-binding proteins can be covalently modified. Sul7d can be methylated on the solvent-exposed surface (61), whereas Sso7d can be ADP-ribosylated, which modulates its thermoprotective effect on bound dsDNA (66).

Another DNA-binding protein is the extremely stable ORF56 gene product (56 aa) encoded on a conserved plasmid (pRN1) from S. islandicus (6870). This small protein is a dimer in solution, but binds as a tetramer to its own promoter region at inverted repeats (autoregulation). ORF56 is organized in an operon together with ORF904, which encodes a multifunctional replication protein (mediating primase, DNA polymerase, and helicase activities), and overlaps the start site of ORF904. The ORF56-ORF904 operon has been shown to be essential for plasmid replication (71). Modulation of the expression of the operon on a transcriptional level results in regulation of the plasmid copy number by ORF56 (68, 72).

Two small proteins have been reported to be involved in replication and transcription. Proliferating cell nuclear antigen (PCNA), a trimeric protein that acts as a DNA clamp on the lagging strand (73), plays an essential role in replication and repair (74, 75). The small protein TIP (Thermococcales inhibitor of PCNA) in Thermococcus kodakarensis, which consists of 64 aa, was reported to inhibit PCNA-dependent activities (76). TIP disrupts the intact PCNA complex by binding to one monomer and blocking the interaction side for the trimer formation; consequently, the PCNA complex dissociates. This interaction with a small protein represents a novel mechanism to regulate PCNA activity (77). The transcription elongation factor complex Spt4/5 in P. furiosus (78) and Methanocaldococcus jannaschii (79), where Spt4 contains 61 and 59 aa, respectively, binds within the RNA polymerase claw encircling the DNA-binding channel. Due to this interaction, the small proteins prevent an early disengagement of the RNA polymerase and, furthermore, inhibit the formation of DNA secondary structures inside the transcription bubble. Thereby, the Spt4/5 complex improves the processivity of the RNA polymerase and transcription (7880).

Besides small heat shock proteins, proteins analogous to bacterial cold shock proteins (Csps) have been reported in archaea. In Methanolobus psychrophilus and Methanococcoides burtonii, several small proteins (60 to 70 aa) containing a single TRAM (TRM2 and MiaB) domain (81) were detected that are strongly induced under low temperatures (35, 82). The TRAM domains (60 to 75 aa) are supposed to bind RNA and belong to RNA-modifying proteins (81). Four M. psychrophilus TRAM-containing small proteins have been demonstrated to functionally complement an Escherichia coli Csp-deficient strain, proving its function as nonspecific RNA chaperone (58, 83). Recent work showed that the small TRAM0076 protein (69 aa) from Methanococcus maripaludis globally effects gene expression at the posttranscriptional level, in addition to the RNA chaperone activity (84).

The small protein Nop10 (60 aa), is essential for the function of pseudouridine synthase Cbf5 from Methanocaldococcus jannaschii, Pyrococcus abyssi, and P. furiosus. The conversion of uridine into pseudouridine is the most frequent modification of structured RNAs (85). Nop10 consists of a conserved N-terminal zinc ribbon and a C-terminal α-helix, which are connected by a linker. Both ends seem to interact with the pseudouridine synthase Cbf5, which potentially enables RNA binding. It has been shown that binding of Nop 10 to Cfb5 results in masking its negative charges and mediates an interaction site of Cfb5 for pseudouridylation to target RNA (8587).

Small proteins interacting with proteins or representing subunits of complexes.

The small protein VhuU (25 aa) is one of the first functional characterized small proteins (88, 89). It represents a subunit of the [NiFeSe] hydrogenase (Vhu) from Methanococcus voltae, which is a key enzyme in methanogenesis under sufficient selenium conditions. The small protein VhuU contains selenocysteine and is a part of the primary reaction center of the Vhu hydrogenase. It has been shown to be posttranslationally processed. The preprotein consists of 69 aa and is shortened, leading to a 25-aa small protein (89). Interestingly, another [NiFeSe] hydrogenases from M. voltae (Fru), contains the corresponding peptide sequence of VhuU as C-terminal parts of the large subunit (90). A respective engineered fusion protein consisting of VhuA and VhuU showed no significant difference in hydrogen uptake activity of the Vhu hydrogenase, but a 10-fold higher Km value for hydrogen compared to that of the wild-type enzyme. Thus, at low hydrogen concentrations, the wild-type enzyme Vhu, consisting of two subunits, might have an advantage in competing with other hydrogen-utilizing microorganisms in the environment (91, 92).

Several toxin-antitoxin systems are reported within the small proteins for Bacteria (reviewed in, e.g., references 18, 93, 94). In archaea, only one toxin-antitoxin system has been observed to date, the RelE-RelB toxin-antitoxin system II reported in M. jannaschii, Archaeoglobus fulgidus, and Pyrococcus horikoshii (95). The toxin-antitoxin systems consist of one small toxic protein (RelE, ranging from 85 to 92 aa) and the corresponding small antitoxin protein (RelB, ranging from 57 to 82 aa), which neutralizes the toxin on the protein level (95, 96). Both genes are organized in an operon induced by nutritional stress and represent homologs of the E. coli MazFE system. RelE encodes a ribosome-dependent RNase, which cleaves stop (UAG, UAA, and UGA) and sense codons (UCG and CAG), thus inhibiting protein synthesis and cell growth. However, as long as RelE and RelB are simultaneously present, the toxic effect of RelE is neutralized by RelB via protein-protein interaction that prevents binding of RelE to the ribosome (97, 98). Under certain conditions, RelB is degraded by the Lon protease, leading to rapid cell stasis or death (98).

Small proteins also play important roles in protein modification; one example is the process of “SAMPyling” other proteins. The ubiquitin-like proteins (Ubl), which do not have high sequence similarities to the eukaryotic ubiquitin (Ub) but do have a similar three-dimensional structure containing the β-grasp fold and a C-terminal glycine, are present in all three domains of life (99104). In archaea, so called small archaeal modifier proteins (SAMPs) have been identified (105110). Examples include the two small modifier proteins SAMP1 (87 aa) and SAMP2 (66 aa) from Haloferax volcanii, which have been shown to be involved in protein modification by covalent binding to lysine residues of the target proteins, subsequently leading to cleavage of the conjugates by HvJAMM1 (111113). Besides a second function in sulfur transfer similar to that of the bacterial Ubl, known functions like those for MoaD or ThiS were implicated. Thereby, SAMP1 is essential for the activity of molybdenum cofactor-dependent enzymes, and SAMP2 is required for tRNA thiosylation (112, 113).

The ability to encircle other proteins or to form clamps has been observed several times for small proteins. Examples include SecE (61 aa) in P. furiosus and SecG (53 aa) in M. jannaschii, both subunits of the SecYEG translocation channel, which is involved in protein transport. Whereas SecE is essential and acts as a clamp and opens or closes the transport channel, SecG functions as a plug to keep the channel closed, preventing small molecules passing the channel (114119). Another small protein involved in transport is CedA2 (52 aa) in Sulfolobus acidocaldarius. This small protein is a component of the Ced system, which is crucial for DNA import. Under UV stress, S. acidocaldarius induces the pilus-producing ups operon, resulting in a mating-pair formation in which the UV-damaged cell imports DNA from a nondamaged cell via the Ced complex (120).

Rubredoxins (45 to 55 aa) are nonheme iron proteins that are involved in the reduction of superoxide in some anaerobic bacteria or act as electron carriers in many biochemical processes (84). Homologues of rubredoxin have been found in different anaerobic archaea, such as P. furiosus, Pyrococcus abyssi, Methanothermobacter thermoautotrophicus, and Thermotoga maritima (121, 122). Although in vitro studies using a reconstituted system have indicated that rubredoxin from P. furiosus is involved in detoxification of reactive oxygen species using a reconstituted system by donating electrons for superoxide reductase (SOR) and rubrerythrin (123), the in vivo function of rubredoxin in P. furiosus is still not clear. A generated rubredoxin deletion strain did not show higher O2 sensitivity than that of the parental strain, implying that another redox protein is able to directly reduce SOR or flavodiiron protein A (FdpA) (124). P. furiosus rubredoxin is monomeric and might be the most thermostable protein, with a melting temperature of nearly 200°C (33, 121, 125, 126).

Just recently, the first biochemical and detailed functional characterizations of one M. mazei small protein were reported, elucidating the physiological role of the first archaeal small protein (52). The small protein (sP26) of 23 aa, including the noncanonical start (valine), was highly conserved within numerous Methanosarcina strains on the nucleotide and amino acid levels. The transcript levels of small peptide RNA 26 (spRNA26) were increased under nitrogen limitation conditions or under salt or oxygen stress. Under nitrogen limitation, when the 2-oxoglutarat level in the cell is high, glutamine synthetase GlnA1 forms dodecameric structures, resulting in activation. GlnA1 activity has been shown to be activated in the presence of sP26, most likely based on a stabilizing effect of sP26 on the dodecameric GlnA1 or by forming tighter dodecameric GlnA1 complexes when binding to sP26. A further stabilization occurred by binding of the PII protein GlnK1 to GlnA1, mediated by sp26. Due to the fact that homologues to sP26 have been identified in several Methanosarcina strains, it can be assumed that these are involved in nitrogen fixation in a similar way (52).

Small proteins interacting with DNA or proteins in Haloarchaea.

Quite a high number of small proteins have been identified in Haloarchaea. In an early study elucidating the low-molecular-weight proteome (<20 kDa) of Halobacterium salinarum, 45 small proteins were discovered. In this study, the small protein bacteriorhodopsin-regulating zinc finger protein (Brz-1; 60 aa) was identified (see supporting material for reference 31). The respective brz-1 gene, as well as another sORF (brp-1), is located in the intergenic region between the bop and brp operons, encoding the key component bacteriorhodopsin of the photosynthetic system of H. salinarum and a bacteriorhodopsin-related protein, respectively. Brz-1 represents a zinc finger-containing transcription factor and is crucial for expression of the photosynthetic system, including the bop cluster and the crtB1 gene, which encodes the first enzyme in the carotenoid biosynthesis pathway (31, 32).

Another small protein, dodecin (68 aa), has been identified in H. salinarum under stress conditions. Solving the structure by X-ray crystallography showed a riboflavin-binding site provided by three β-sheets and one α-helix (127, 128). Upon binding the riboflavin, dodecin forces a relaxation by dealkylating the riboflavin to 7,8-dimethylalloxazine (lumichrome; fast quenching), which results in reducing the risk for uncontrolled light-induced damage by riboflavin and simultaneously leads to the accumulation of the important precursor. Thus, dodecin buffers the riboflavin concentration inside the cell. When conditions are more favorable, dodecin is downregulated, freeing the riboflavin for flavin adenine dinucleotide (FAD) and flavinmononucleotid biosynthesis (129).

A more recent study focused on 16 genes encoding putative small zinc-finger proteins in H. volcanii (ranging from 50 to 68 aa). Since all respective chromosomal deletion mutants were successfully generated, it is assumed that none of those small proteins is essential. However, 12 mutants showed a clear phenotype related to motility under nonoptimal conditions. Six of those mutants (HVO_0649, HVO_0758, HVO_2142, HVO_2523, HVO_2901, and HVO_A0556) lost their swarming ability, whereas others (HVO_0416, HVO_0758, HVO_2142, HVO_2523, HVO_2901, and HVO_A0556) showed increased biofilm formation (3- to 9-fold increase) (59). Interestingly, the small protein HVO_2753, which is conserved within euryarchaeota, contains two zinc-binding motifs. The respective deletion strain lost its swarming ability and did not form proper biofilms (56).

The H. volcanii small protein HVO_2922 (60 aa) has been shown to be upregulated under iron limitation conditions and downregulated under high salt concentrations, suggesting that it is stress regulated. The NMR solution structure of the protein and mutant derivatives showed that the protein forms a symmetrical dimer that is probably essential for its functionality. HVO_2922 is conserved in Haloarchaea; based on the structure of the dimer this protein belongs to the UPF0339 family with unknown function (57).

Tool development and optimization of genome-wide identification of small proteins in Methanosarcina mazei.

In the methanogenic archeaon M. mazei, several sORFs have been discovered genome-wide using a differential transcriptome sequencing (RNA-seq) approach (34). In this study, 44 RNAs that each contain a putative sORF have been identified and designated small peptide RNAs (spRNAs), and their respective small proteins have been predicted. Several of these spRNAs have been demonstrated to be differentially transcribed in response to nitrogen availability. Today, based on several genome-wide RNA-seq studies of M. mazei grown under various growth and stress conditions (e.g., nitrogen starvation; oxygen, temperature, and salt stresses; and/or biofilm formation), a total of 1,442 small RNAs in M. mazei containing predicted sORFs (<70 aa) have been identified by manual analysis, as well as by automated bioinformatics tools such as ANNOgesic (130). In total, 1,330 of those comprised putative sORFs encoding small proteins ranging from 8 to 49 aa, the length distribution of which is depicted in Fig. 4A. Interestingly, the majority of the predicted small proteins start with a noncanonical, alternative start codon (70.4%) and fewer with the canonical methionine. These identified sORFs in general have to be verified to be translated by using, e.g., proteomics or ribosome profiling approaches. Since experimental verification of respective small proteins is entirely based on earlier identified sORFs, they are often also designated short open reading frame-encoded peptides (SEPs).

FIG 4.

FIG 4

Predicted very small proteins in M. mazei. (A) Bar plot representing the frequency of all predicted sORFs that are less than 50 aa in length (1,330 ORFs), color coded by green for canonical (394 ORFs) and blue for alternative start (936 ORFs). (B) Bar plot representing the frequency of all predicted very small proteins (gray) (1,330 sORFs) based on numerous transcriptome sequencing (RNA-seq) sets and all verified very small proteins (blue), depending on their length in amino acids. Plots were generated using the library ggplot2 in R (132).

The first experimental verification of small proteins from M. mazei was performed in 2015 (2). The soluble proteome was analyzed by LC-MS/MS using the plasma-free separation method to detect seven small proteins under strict criteria (FDR = 1%), experimentally verifying three small proteins predicted by Jäger et al. (34), namely sP44 (23 aa), sP41 (53 aa), and sP36 (61 aa). Based on the significant upregulation of spRNA36 and the respective small protein (sP36) under nitrogen starvation, a function of sP36 in the context of the nitrogen metabolism was proposed (2). In a following semi-top-down approach comparing 2D LC-MS and a gel-free LC approach, eight small proteins were verified with high confidence (FDR = 1%). Not using the strict criteria, an additional six small proteins were verified and identified with medium confidence (FDR = 5%), including sP26 (38). Within the scope of method optimization, an acetonitrile-based depletion method for the high-molecular-weight proteome (<15 kDa) was established and evaluated. Using this approach, three additional so far unknown small proteins were verified. While the depletion method removed approximately 70% of the high-molecular-weight proteome, it did not result in a significant higher detection rate for small proteins. However, depletion of the high-molecular-weight proteome resulted in significantly improved identification metrics for those small proteins that were detected (40). To evaluate the impact of the gel staining methods in a GeLC-MS, in which an SDS-PAGE is performed prior to LC-MS/MS, the low-molecular-weight proteome of M. mazei grown under nitrogen starvation conditions was analyzed. Within this study, an additional 24 predicted small proteins, ranging from 8 to 69 aa, were verified under strict criteria (39). In another recent study, a multidimensional top-down analysis was performed that enabled the detection of several small proteins and provided evidence for a number of posttranslational modifications (37). The utilization of a solid-phase extraction methodology, followed by strong cation exchange and then either top-down or bottom-up LC-MS allowed detection of 36 proteoforms belonging to 12 small proteins. The major advantage of this approach was the ability to identified intact proteoforms, allowing for the identification of N and C termini, as well as a number of biological posttranslational modifications (e.g., disulfide bridges). Importantly, this workflow enabled the characterization of five small proteins, of which one was newly verified. The extensive analysis also uncovered several interesting characteristics of the verified small proteins, e.g., alternate translation initiation sites, and argued for posttranslational processing of several small proteins (N- or C-terminal truncations). Interestingly, this approach also revealed the existence of N-terminal formylation (of small proteins) in archaea, which was not previously known, and thus further illustrates the diversity of modifications of small proteins (37) Very recently, a systematic evaluation of a multiprotease approach to improve both the sequence coverage and number of identified peptides for small proteins was performed. This comprehensive approach resulted in the identification of 91 proteins with at least two unique peptides and clearly improved the identification and molecular characterization of small proteins (46). Overall, with the different optimization steps described above, 58 small proteins smaller than 51 aa have been identified and experimentally verified with high confidence (FDR = 1%). The length distribution of these is depicted in Fig. 4B. In agreement with the predictions, the majority of them (70.4%) show a noncanonical start. A recent study concerning the average number of sORFs per genome that analyzed 109 bacterial genomes showed that approximately 16% ± 9% of total coding ORFs might be coding for small proteins (21). If this tendency is applicable for Archaea, one can predict 263 to 713 sORFs for M. mazei (compare to 3,490 large ORFs identified in M. mazei [131]).

CONCLUSION AND OUTLOOK

Most small proteins are single discoveries and only recently has the hidden world of small proteins present in archaeal genomes been recognized using genome-wide approaches. First biochemical characterizations indicate that the majority of identified small proteins have specific functions. As has been shown in bacteria, small archaeal proteins are often involved in survival under stressful conditions, under which it is important to adapt to various degrees. The establishment of effective identification and verification tools for small proteins in archaea (as well as in bacteria) has advanced in recent years, often through the combination of genome-wide approaches (RNA-seq, RiboSeq, and peptidomics). Besides, combining bottom-up and top-down proteomics enables identification of posttranslational modifications. Moreover, a better understanding of the molecular mechanisms involved in sORF evolution, their evolutionary rates, and selection pressure on codon usage will help to identify sORFs using computational tools. Consequently, future studies of the small proteome in archaea and the elucidation of their biological functions will provide novel insights into the full repertoire and function of the archaeal small proteome and substantially complement our understanding of small proteins in archaea. Accordingly, unveiling this new layer of biological information in archaea as well as in bacteria is challenging, but it has an enormous potential to discover surprising and unexpected results and thus might trigger a major change in our current thinking.

Biographies

Katrin Weidenbach performed her Ph.D. at Göttingen University (Germany). She is a senior postdoctoral researcher in the group of Ruth Schmitz, Institute for General Microbiology (IFAM) at Kiel University (Germany), focusing on small proteins in archaea.

Miriam Gutt is currently performing her Ph.D. on small archaeal proteins in the group of Ruth Schmitz.

Liam Cassidy acquired a Ph.D. from the Australian National University and is now a postdoctoral researcher in the Systematic Proteomics Workgroup of Andreas Tholey at the University of Kiel.

Cynthia Chibani performed her Ph.D. at Göttingen University. She is currently a postdoctoral researcher in the group of Ruth Schmitz, IFAM at Kiel University, mainly focusing on bioinformatic analysis of metagenomes and viromes.

Ruth A. Schmitz performed her Ph.D. in biochemistry and microbiology at Marburg University, worked as a postdoctoral researcher at the University of California, Berkeley (USA), and habilitated in microbiology and genetics at Göttingen University. Since 2004, she heads the IFAM at Kiel University.

Contributor Information

Ruth A. Schmitz, Email: rschmitz@ifam.uni-kiel.de.

Tina M. Henkin, Ohio State University

REFERENCES

  • 1.Basrai MA, Hieter P, Boeke JD. 1997. Small open reading frames: beautiful needles in the haystack. Genome Res 7:768–771. 10.1101/gr.7.8.768. [DOI] [PubMed] [Google Scholar]
  • 2.Prasse D, Thomsen J, De Santis R, Muntel J, Becher D, Schmitz RA. 2015. First description of small proteins encoded by spRNAs in Methanosarcina mazei strain Gö1. Biochimie 117:138–148. 10.1016/j.biochi.2015.04.007. [DOI] [PubMed] [Google Scholar]
  • 3.Storz G, Wolf YI, Ramamurthi KS. 2014. Small proteins can no longer be ignored. Annu Rev Biochem 83:753–777. 10.1146/annurev-biochem-070611-102400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lovenberg W, Sobel BE. 1965. Rubredoxin: a new electron transfer protein from Clostridium pasteurianum. Proc Natl Acad Sci U S A 54:193–199. 10.1073/pnas.54.1.193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hobbs EC, Yin X, Paul BJ, Astarita JL, Storz G. 2012. Conserved small protein associates with the multidrug efflux pump AcrB and differentially affects antibiotic resistance. Proc Natl Acad Sci U S A 109:16696–16701. 10.1073/pnas.1210093109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Orr MW, Mao Y, Storz G, Qian S-B. 2019. Alternative ORFs and small ORFs: shedding light on the dark proteome. Nucleic Acids Res 48:1029–1042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Alix E, Blanc-Potard A-B. 2009. Hydrophobic peptides: novel regulators within bacterial membrane. Mol Microbiol 72:5–11. 10.1111/j.1365-2958.2009.06626.x. [DOI] [PubMed] [Google Scholar]
  • 8.Su M, Ling Y, Yu J, Wu J, Xiao J. 2013. Small proteins: untapped area of potential biological importance. Front Genet 4:286. 10.3389/fgene.2013.00286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hellens RP, Brown CM, Chisnall MAW, Waterhouse PM, Macknight RC. 2016. The emerging world of small ORFs. Trends Plant Sci 21:317–328. 10.1016/j.tplants.2015.11.005. [DOI] [PubMed] [Google Scholar]
  • 10.Plaza S, Menschaert G, Payre F. 2017. In search of lost small peptides. Annu Rev Cell Dev Biol 33:391–416. 10.1146/annurev-cellbio-100616-060516. [DOI] [PubMed] [Google Scholar]
  • 11.Khitun A, Slavoff SA. 2019. Proteomic detection and validation of translated small open reading frames. Curr Protoc Chem Biol 11:e77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Makarova KS, Wolf YI, Koonin EV. 2019. Towards functional characterization of archaeal genomic dark matter. Biochem Soc Trans 47:389–398. 10.1042/BST20180560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hemm MR, Weaver J, Storz G. 2020. Escherichia coli small proteome. EcoSal Plus 9. 10.1128/ecosalplus.ESP-0031-2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Steinberg R, Koch H-G. 2021. The largely unexplored biology of small proteins in pro- and eukaryotes. FEBS J 10.1111/febs.15845. [DOI] [PubMed] [Google Scholar]
  • 15.Andrews SJ, Rothnagel JA. 2014. Emerging evidence for functional peptides encoded by short open reading frames. Nat Rev Genet 15:193–204. 10.1038/nrg3520. [DOI] [PubMed] [Google Scholar]
  • 16.Chugunova A, Navalayeu T, Dontsova O, Sergiev P. 2018. Mining for small translated ORFs. J Proteome Res 17:1–11. 10.1021/acs.jproteome.7b00707. [DOI] [PubMed] [Google Scholar]
  • 17.Duval M, Cossart P. 2017. Small bacterial and phagic proteins: an updated view on a rapidly moving field. Curr Opin Microbiol 39:81–88. 10.1016/j.mib.2017.09.010. [DOI] [PubMed] [Google Scholar]
  • 18.Fozo EM, Hemm MR, Storz G. 2008. Small toxic proteins and the antisense RNAs that repress them. Microbiol Mol Biol Rev 72:579–589. 10.1128/MMBR.00025-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Warren AS, Archuleta J, Feng WC, Setubal JC. 2010. Missing genes in the annotation of prokaryotic genomes. BMC Bioinformatics 11:131. 10.1186/1471-2105-11-131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Samayoa J, Yildiz FH, Karplus K. 2011. Identification of prokaryotic small proteins using a comparative genomic approach. Bioinformatics 27:1765–1771. 10.1093/bioinformatics/btr275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Miravet-Verde S, Ferrar T, Espadas-García G, Mazzolini R, Gharrab A, Sabido E, Serrano L, Lluch-Senar M. 2019. Unraveling the hidden universe of small proteins in bacterial genomes. Mol Syst Biol 15:e8290. 10.15252/msb.20188290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Dujon B, Alexandraki D, André B, Ansorge W, Baladron V, Ballesta JP, Banrevi A, Bolle PA, Bolotin-Fukuhara M, Bossier P, Bou G, Boyer J, Bultrago MJ, Cheret G, Colleaux L, Dalgnan-Fornler B, del Rey F, Dlon C, Domdey H, Düsterhoft A, Düsterhus S, Entlan KD, Erfle H, Esteban PF, Feldmann H, Fernandes L, Robo GM, Fritz C, Fukuhara H, Gabel C, Gaillon L, Carcia-Cantalejo JM, Garcia-Ramirez JJ, Gent NE, Ghazvini M, Goffeau A, Gonzaléz A, Grothues D, Guerreiro P, Hegemann J, Hewitt N, Hilger F, Hollenberg CP, Horaitis O, Indge KJ, Jacquier A, James CM, Jauniaux C, Jimenez A, Keuchel H, et al. 1994. Complete DNA sequence of yeast chromosome XI. Nature 369:371–378. 10.1038/369371a0. [DOI] [PubMed] [Google Scholar]
  • 23.Goffeau A, Barrell BG, Bussey H, Davis RW, Dujon B, Feldmann H, Galibert F, Hoheisel JD, Jacq C, Johnston M, Louis EJ, Mewes HW, Murakami Y, Philippsen P, Tettelin H, Oliver SG. 1996. Life with 6000 genes. Science 274:546, 563–567. [DOI] [PubMed] [Google Scholar]
  • 24.Harrison PM, Kumar A, Lang N, Snyder M, Gerstein M. 2002. A question of size: the eukaryotic proteome and the problems in defining it. Nucleic Acids Res 30:1083–1090. 10.1093/nar/30.5.1083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Laemmli UK. 1970. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227:680–685. 10.1038/227680a0. [DOI] [PubMed] [Google Scholar]
  • 26.Schägger H. 2006. Tricine-SDS-PAGE. Nat Protoc 1:16–22. 10.1038/nprot.2006.4. [DOI] [PubMed] [Google Scholar]
  • 27.Weaver J, Mohammad F, Buskirk AR, Storz G. 2019. Identifying small proteins by ribosome profiling with stalled initiation complexes. mBio 10:e02819-18. 10.1128/mBio.02819-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Cassidy L, Kaulich PT, Maaß S, Bartel J, Becher D, Tholey A. 2021. Bottom-up and top-down proteomic approaches for the identification, characterization, and quantification of the low molecular weight proteome with focus on short open reading frame-encoded peptides. Proteomics 10.1002/pmic.202100008. [DOI] [PubMed] [Google Scholar]
  • 29.Ma J, Ward CC, Jungreis I, Slavoff SA, Schwaid AG, Neveu J, Budnik BA, Kellis M, Saghatelian A. 2014. Discovery of human sORF-encoded polypeptides (SEPs) in cell lines and tissue. J Proteome Res 13:1757–1765. 10.1021/pr401280w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Thomm M, Stetter KO, Zillig W. 1982. Histone-like proteins in eu- and archaebacteria. Zentralbl Bakteriol Mikrobiol Hyg I Abt Orig C 3:128–139. [Google Scholar]
  • 31.Klein C, Aivaliotis M, Olsen JV, Falb M, Besir H, Scheffer B, Bisle B, Tebbe A, Konstantinidis K, Siedler F, Pfeiffer F, Mann M, Oesterhelt D. 2007. The low molecular weight proteome of Halobacterium salinarum. J Proteome Res 6:1510–1518. 10.1021/pr060634q. [DOI] [PubMed] [Google Scholar]
  • 32.Tarasov VY, Besir H, Schwaiger R, Klee K, Furtwängler K, Pfeiffer F, Oesterhelt D. 2008. A small protein from the bop-brp intergenic region of Halobacterium salinarum contains a zinc finger motif and regulates bop and crtB1 transcription. Mol Microbiol 67:772–780. 10.1111/j.1365-2958.2007.06081.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Blake PR, Park JB, Bryant FO, Aono S, Magnuson JK, Eccleston E, Howard JB, Summers MF, Adams MWW. 1991. Determinants of protein hyperthermostability: purification and amino acid sequence of rubredoxin from the hyperthermophilic archaebacterium Pyrococcus furiosus and secondary structure of the zinc adduct by NMR. Biochemistry 30:10885–10895. 10.1021/bi00109a012. [DOI] [PubMed] [Google Scholar]
  • 34.Jäger D, Sharma CM, Thomsen J, Ehlers C, Vogel J, Schmitz RA. 2009. Deep sequencing analysis of the Methanosarcina mazei Gö1 transcriptome in response to nitrogen availability. Proc Natl Acad Sci U S A 106:21878–21882. 10.1073/pnas.0909051106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chen Z, Yu H, Li L, Hu S, Dong X. 2012. The genome and transcriptome of a newly described psychrophilic archaeon, Methanolobus psychrophilus R15, reveal its cold adaptive characteristics. Environ Microbiol Rep 4:633–641. 10.1111/j.1758-2229.2012.00389.x. [DOI] [PubMed] [Google Scholar]
  • 36.Zhang Y, Fonslow BR, Shan B, Baek MC, Yates JR, 3rd.. 2013. Protein analysis by shotgun/bottom-up proteomics. Chem Rev 113:2343–2394. 10.1021/cr3003533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cassidy L, Helbig AO, Kaulich PT, Weidenbach K, Schmitz RA, Tholey A. 2021. Multidimensional separation schemes enhance the identification and molecular characterization of low molecular weight proteomes and short open reading frame-encoded peptides in top-down proteomics. J Proteomics 230:103988. 10.1016/j.jprot.2020.103988. [DOI] [PubMed] [Google Scholar]
  • 38.Cassidy L, Prasse D, Linke D, Schmitz RA, Tholey A. 2016. Combination of bottom-up 2D-LC-MS and semi-top-down GelFree-LC-MS Enhances coverage of proteome and low molecular weight short open reading frame encoded peptides of the archaeon Methanosarcina mazei. J Proteome Res 15:3773–3783. 10.1021/acs.jproteome.6b00569. [DOI] [PubMed] [Google Scholar]
  • 39.Kaulich PT, Cassidy L, Weidenbach K, Schmitz RA, Tholey A. 2020. Complementarity of different SDS-PAGE Gel staining methods for the identification of short open reading frame-encoded peptides. Proteomics 20:2000084. 10.1002/pmic.202000084. [DOI] [PubMed] [Google Scholar]
  • 40.Cassidy L, Kaulich PT, Tholey A. 2019. Depletion of High-molecular-mass proteins for the identification of small proteins and short open reading frame encoded peptides in cellular proteomes. J Proteome Res 18:1725–1734. 10.1021/acs.jproteome.8b00948. [DOI] [PubMed] [Google Scholar]
  • 41.Petruschke H, Anders J, Stadler PF, Jehmlich N, von Bergen M. 2020. Enrichment and identification of small proteins in a simplified human gut microbiome. J Proteomics 213:103604. 10.1016/j.jprot.2019.103604. [DOI] [PubMed] [Google Scholar]
  • 42.Bartel J, Varadarajan AR, Sura T, Ahrens CH, Maaß S, Becher D. 2020. Optimized proteomics workflow for the detection of small proteins. J Proteome Res 19:4004–4018. 10.1021/acs.jproteome.0c00286. [DOI] [PubMed] [Google Scholar]
  • 43.Greening DW, Simpson RJ. 2010. A centrifugal ultrafiltration strategy for isolating the low-molecular weight (≤ 25 K) component of human plasma proteome. J Proteomics 73:637–648. 10.1016/j.jprot.2009.09.013. [DOI] [PubMed] [Google Scholar]
  • 44.Harney DJ, Hutchison AT, Su Z, Hatchwell L, Heilbronn LK, Hocking S, James DE, Larance M. 2019. Small-protein enrichment assay enables the rapid, unbiased analysis of over 100 low abundance factors from human plasma. Mol Cell Proteomics 18:1899–1915. 10.1074/mcp.TIR119.001562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Cardon T, Hervé F, Delcourt V, Roucou X, Salzet M, Franck J, Fournier I. 2020. Optimized sample preparation workflow for improved identification of ghost proteins. Anal Chem 92:1122–1129. 10.1021/acs.analchem.9b04188. [DOI] [PubMed] [Google Scholar]
  • 46.Kaulich PT, Cassidy L, Bartel J, Schmitz RA, Tholey A. 2021. Multi-protease approach for the improved identification and molecular characterization of small proteins and short open reading frame-encoded peptides. J Proteome Res 20:2895–2903. 10.1021/acs.jproteome.1c00115. [DOI] [PubMed] [Google Scholar]
  • 47.Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. 2009. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324:218–223. 10.1126/science.1168978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Neuhaus K, Landstorfer R, Simon S, Schober S, Wright PR, Smith C, Backofen R, Wecko R, Keim DA, Scherer S. 2017. Differentiation of ncRNAs from small mRNAs in Escherichia coli O157:H7 EDL933 (EHEC) by combined RNAseq and Riboseq—ryhB encodes the regulatory RNA RyhB and a peptide. RyhP BMC Genomics 18:216–216. 10.1186/s12864-017-3586-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Venturini E, Svensson SL, Maaß S, Gelhausen R, Eggenhofer F, Li L, Cain AK, Parkhill J, Becher D, Backofen R, Barquist L, Sharma CM, Westermann AJ, Vogel J. 2020. A global data-driven census of Salmonella small proteins and their potential functions in bacterial virulence. microLife 1:uqaa002. 10.1093/femsml/uqaa002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gelsinger DR, Dallon E, Reddy R, Mohammad F, Buskirk AR, DiRuggiero J. 2020. Ribosome profiling in archaea reveals leaderless translation, novel translational initiation sites, and ribosome pausing at single codon resolution. Nucleic Acids Res 48:5201–5216. 10.1093/nar/gkaa304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Canestrari JG, Lasek-Nesselquist E, Upadhyay A, Rofaeil M, Champion MM, Wade JT, Derbyshire KM, Gray TA. 2020. Polycysteine-encoding leaderless short ORFs function as cysteine-responsive attenuators of operonic gene expression in mycobacteria. Mol Microbiol 114:93–108. 10.1111/mmi.14498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Gutt M, Jordan B, Weidenbach K, Gudzuhn M, Kiessling C, Cassidy L, Helbig A, Tholey A, Pyper DJ, Kubatova N, Schwalbe H, Schmitz RA. 2021. High complexity of glutamine synthetase regulation in Methanosarcina mazei: small protein 26 interacts and enhances glutamine synthetase activity. FEBS J 288:5350–5373. 10.1111/febs.15799. [DOI] [PubMed] [Google Scholar]
  • 53.Levin HL, Schachman HK. 1985. Regulation of aspartate transcarbamoylase synthesis in Escherichia coli: analysis of deletion mutations in the promoter region of the pyrBI operon. Proc Natl Acad Sci U S A 82:4643–4647. 10.1073/pnas.82.14.4643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Park SY, Cromie MJ, Lee EJ, Groisman EA. 2010. A bacterial mRNA leader that employs different mechanisms to sense disparate intracellular signals. Cell 142:737–748. 10.1016/j.cell.2010.07.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Kubatova N, Pyper DJ, Jonker HRA, Saxena K, Remmel L, Richter C, Brantl S, Evguenieva-Hackenberg E, Hess WR, Klug G, Marchfelder A, Soppa J, Streit W, Mayzel M, Orekhov VY, Fuxreiter M, Schmitz RA, Schwalbe H. 2020. Rapid biophysical characterization and NMR spectroscopy structural analysis of small proteins from bacteria and archaea. ChemBioChem 21:1178–1187. 10.1002/cbic.201900677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Zahn S, Kubatova N, Pyper DJ, Cassidy L, Saxena K, Tholey A, Schwalbe H, Soppa J. 2021. Biological functions, genetic and biochemical characterization, and NMR structure determination of the small zinc finger protein HVO_2753 from Haloferax volcanii. FEBS J 288:2042–2062. 10.1111/febs.15559. [DOI] [PubMed] [Google Scholar]
  • 57.Kubatova N, Jonker HRA, Saxena K, Richter C, Vogel V, Schreiber S, Marchfelder A, Schwalbe H. 2020. Solution structure and dynamics of the small protein HVO_2922 from Haloferax volcanii. Chembiochem 21:149–156. 10.1002/cbic.201900085. [DOI] [PubMed] [Google Scholar]
  • 58.Kremer W, Schuler B, Harrieder S, Geyer M, Gronwald W, Welker C, Jaenicke R, Kalbitzer HR. 2001. Solution NMR structure of the cold-shock protein from the hyperthermophilic bacterium Thermotoga maritima. Eur J Biochem 268:2527–2539. 10.1046/j.1432-1327.2001.02127.x. [DOI] [PubMed] [Google Scholar]
  • 59.Nagel C, Machulla A, Zahn S, Soppa J. 2019. Several one-domain zinc finger µ-proteins of Haloferax volcanii are important for stress adaptation, biofilm formation, and swarming. Genes (Basel) 10:361. 10.3390/genes10050361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Jevtić Ž, Stoll B, Pfeiffer F, Sharma K, Urlaub H, Marchfelder A, Lenz C. 2019. The response of Haloferax volcanii to salt and temperature stress: a proteome study by label-free mass spectrometry. Proteomics 19:e1800491. [DOI] [PubMed] [Google Scholar]
  • 61.Reeve JN. 2003. Archaeal chromatin and transcription. Mol Microbiol 48:587–598. 10.1046/j.1365-2958.2003.03439.x. [DOI] [PubMed] [Google Scholar]
  • 62.Sandman K, Reeve JN. 2006. Archaeal histones and the origin of the histone fold. Curr Opin Microbiol 9:520–525. 10.1016/j.mib.2006.08.003. [DOI] [PubMed] [Google Scholar]
  • 63.Mattiroli F, Bhattacharyya S, Dyer PN, White AE, Sandman K, Burkhart BW, Byrne KR, Lee T, Ahn NG, Santangelo TJ, Reeve JN, Luger K. 2017. Structure of histone-based chromatin in Archaea. Science 357:609–612. 10.1126/science.aaj1849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Stevens KM, Swadling JB, Hocher A, Bang C, Gribaldo S, Schmitz RA, Warnecke T. 2020. Histone variants in archaea and the evolution of combinatorial chromatin complexity. Proc Natl Acad Sci U S A 117:33384–33395. 10.1073/pnas.2007056117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Kalichuk V, Béhar G, Renodon-Cornière A, Danovski G, Obal G, Barbet J, Mouratou B, Pecorari F. 2016. The archaeal “7 kDa DNA-binding” proteins: extended characterization of an old gifted family. Sci Rep 6:37274. 10.1038/srep37274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Castellano S, Farina B, Faraone-Mennella MR. 2009. The ADP-ribosylation of Sulfolobus solfataricus Sso7 modulates protein/DNA interactions in vitro. FEBS Lett 583:1154–1158. 10.1016/j.febslet.2009.03.003. [DOI] [PubMed] [Google Scholar]
  • 67.Hsu CH, Wang AH. 2011. The DNA-recognition fold of Sso7c4 suggests a new member of SpoVT-AbrB superfamily from archaea. Nucleic Acids Res 39:6764–6774. 10.1093/nar/gkr283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lipps G. 2009. Molecular biology of the pRN1 plasmid from Sulfolobus islandicus. Biochem Soc Trans 37:42–45. 10.1042/BST0370042. [DOI] [PubMed] [Google Scholar]
  • 69.Lipps G, Stegert M, Krauss G. 2001. Thermostable and site-specific DNA binding of the gene product ORF56 from the Sulfolobus islandicus plasmid pRN1, a putative archael plasmid copy control protein. Nucleic Acids Res 29:904–913. 10.1093/nar/29.4.904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Zeeb M, Lipps G, Lilie H, Balbach J. 2004. Folding and association of an extremely stable dimeric protein from Sulfolobus islandicus. J Mol Biol 336:227–240. 10.1016/j.jmb.2003.12.003. [DOI] [PubMed] [Google Scholar]
  • 71.Berkner S, Grogan D, Albers SV, Lipps G. 2007. Small multicopy, non-integrative shuttle vectors based on the plasmid pRN1 for Sulfolobus acidocaldarius and Sulfolobus solfataricus, model organisms of the (cren-)archaea. Nucleic Acids Res 35:e88. 10.1093/nar/gkm449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Weininger U, Zeeb M, Neumann P, Löw C, Stubbs MT, Lipps G, Balbach J. 2009. Structure-based stability analysis of an extremely stable dimeric DNA binding protein from Sulfolobus islandicus. Biochemistry 48:10030–10037. 10.1021/bi900760n. [DOI] [PubMed] [Google Scholar]
  • 73.Yao NY, O’Donnell M. 2012. The RFC clamp loader: structure and function. Subcell Biochem 62:259–279. 10.1007/978-94-007-4572-8_14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Vivona JB, Kelman Z. 2003. The diverse spectrum of sliding clamp interacting proteins. FEBS Lett 546:167–172. 10.1016/s0014-5793(03)00622-7. [DOI] [PubMed] [Google Scholar]
  • 75.Moldovan GL, Pfander B, Jentsch S. 2007. PCNA, the maestro of the replication fork. Cell 129:665–679. 10.1016/j.cell.2007.05.003. [DOI] [PubMed] [Google Scholar]
  • 76.Li Z, Santangelo TJ, Cuboňová L, Reeve JN, Kelman Z. 2010. Affinity purification of an archaeal DNA replication protein network. mBio 1:e00221-10. 10.1128/mBio.00221-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Altieri AS, Ladner JE, Li Z, Robinson H, Sallman ZF, Marino JP, Kelman Z. 2016. A small protein inhibits proliferating cell nuclear antigen by breaking the DNA clamp. Nucleic Acids Res 44:10015–10015. 10.1093/nar/gkw824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Klein BJ, Bose D, Baker KJ, Yusoff ZM, Zhang X, Murakami KS. 2011. RNA polymerase and transcription elongation factor Spt4/5 complex structure. Proc Natl Acad Sci U S A 108:546–550. 10.1073/pnas.1013828108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Hirtreiter A, Damsma GE, Cheung AC, Klose D, Grohmann D, Vojnic E, Martin AC, Cramer P, Werner F. 2010. Spt4/5 stimulates transcription elongation through the RNA polymerase clamp coiled-coil motif. Nucleic Acids Res 38:4040–4051. 10.1093/nar/gkq135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Landick R. 2006. The regulatory roles and mechanism of transcriptional pausing. Biochem Soc Trans 34:1062–1066. 10.1042/BST0341062. [DOI] [PubMed] [Google Scholar]
  • 81.Anantharaman V, Koonin EV, Aravind L. 2001. TRAM, a predicted RNA-binding domain, common to tRNA uracil methylation and adenine thiolation enzymes. FEMS Microbiology Lett 197:215–221. 10.1111/j.1574-6968.2001.tb10606.x. [DOI] [PubMed] [Google Scholar]
  • 82.Campanaro S, Williams TJ, Burg DW, De Francisci D, Treu L, Lauro FM, Cavicchioli R. 2011. Temperature-dependent global gene expression in the Antarctic archaeon Methanococcoides burtonii. Environ Microbiol 13:2018–2038. 10.1111/j.1462-2920.2010.02367.x. [DOI] [PubMed] [Google Scholar]
  • 83.Zhang B, Yue L, Zhou L, Qi L, Li J, Dong X. 2017. Conserved TRAM domain functions as an archaeal cold shock protein via RNA chaperone activity. Front Microbiol 8. 10.3389/fmicb.2017.01597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Li Y, Liu PP, Ni X. 2019. Molecular evolution and functional analysis of rubredoxin-like proteins in plants. BioMed Res Int 2019:2932585–2932585. 10.1155/2019/2932585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Duan J, Li L, Lu J, Wang W, Ye K. 2009. Structural mechanism of substrate RNA recruitment in H/ACA RNA-guided pseudouridine synthase. Mol Cell 34:427–439. 10.1016/j.molcel.2009.05.005. [DOI] [PubMed] [Google Scholar]
  • 86.Hamma T, Reichow SL, Varani G, Ferré-D'Amaré AR. 2005. The Cbf5-Nop10 complex is a molecular bracket that organizes box H/ACA RNPs. Nat Struct Mol Biol 12:1101–1107. 10.1038/nsmb1036. [DOI] [PubMed] [Google Scholar]
  • 87.Manival X, Charron C, Fourmann JB, Godard F, Charpentier B, Branlant C. 2006. Crystal structure determination and site-directed mutagenesis of the Pyrococcus abyssi aCBF5-aNOP10 complex reveal crucial roles of the C-terminal domains of both proteins in H/ACA sRNP activity. Nucleic Acids Res 34:826–839. 10.1093/nar/gkj482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Pfeiffer M, Bestgen H, Bürger A, Klein A. 1998. The vhuU gene encoding a small subunit of a selenium-containing [NiFe]-hydrogenase in Methanococcus voltae appears to be essential for the cell. Arch Microbiol 170:418–426. 10.1007/s002030050662. [DOI] [PubMed] [Google Scholar]
  • 89.Sorgenfrei O, Linder D, Karas M, Klein A. 1993. A novel very small subunit of a selenium containing [NiFe] hydrogenease of Methanococcus voltae is postranslationally processed by cleavage at a defined position. Eur J Biochem 213:1355–1358. 10.1111/j.1432-1033.1993.tb17888.x. [DOI] [PubMed] [Google Scholar]
  • 90.Sorgenfrei O, Müller S, Pfeiffer M, Sniezko I, Klein A. 1997. The [NiFe] hydrogenases of Methanococcus voltae: genes, enzymes and regulation. Arch Microbiol 167:189–195. 10.1007/s002030050434. [DOI] [PubMed] [Google Scholar]
  • 91.Pfeiffer M, Bingemann R, Klein A. 1998. Fusion of two subunits does not impair the function of a [NiFeSe]-hydrogenase in the archaeon Methanococcus voltae. Eur J Biochem 256:447–452. 10.1046/j.1432-1327.1998.2560447.x. [DOI] [PubMed] [Google Scholar]
  • 92.Bingemann R, Pierik AJ, Klein A. 2000. Influence of the fusion of two subunits of the F420-non-reducing hydrogenase of Methanococcus voltae on its biochemical properties. Arch Microbiol 174:375–378. 10.1007/s002030000213. [DOI] [PubMed] [Google Scholar]
  • 93.Harms A, Brodersen DE, Mitarai N, Gerdes K. 2018. Toxins, targets, and triggers: an overview of toxin-antitoxin biology. Mol Cell 70:768–784. 10.1016/j.molcel.2018.01.003. [DOI] [PubMed] [Google Scholar]
  • 94.Jurėnas D, Van Melderen L. 2020. The variety in the common theme of translation inhibition by type II toxin-antitoxin systems. Front Genet 11:262. 10.3389/fgene.2020.00262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Gerdes K. 2000. Toxin-antitoxin modules may regulate synthesis of macromolecules during nutritional stress. J Bacteriol 182:561–572. 10.1128/JB.182.3.561-572.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Galvani C, Terry J, Ishiguro EE. 2001. Purification of the RelB and RelE proteins ofEscherichia coli: RelE binds to RelB and to ribosomes. J Bacteriol 183:2700–2703. 10.1128/JB.183.8.2700-2703.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Francuski D, Saenger W. 2009. Crystal structure of the antitoxin-toxin protein complex RelB-RelE from Methanococcus jannaschii. J Mol Biol 393:898–908. 10.1016/j.jmb.2009.08.048. [DOI] [PubMed] [Google Scholar]
  • 98.Takagi H, Kakuta Y, Okada T, Yao M, Tanaka I, Kimura M. 2005. Crystal structure of archaeal toxin-antitoxin RelE-RelB complex with implications for toxin activity and antitoxin effects. Nat Struct Mol Biol 12:327–331. 10.1038/nsmb911. [DOI] [PubMed] [Google Scholar]
  • 99.Iyer LM, Burroughs AM, Aravind L. 2006. The prokaryotic antecedents of the ubiquitin-signaling system and the early evolution of ubiquitin-like β-grasp domains. Genome Biol 7:R60. 10.1186/gb-2006-7-7-r60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Hochstrasser M. 2000. Evolution and function of ubiquitin-like protein-conjugation systems. Nat Cell Biol 2:E153—E157. 10.1038/35019643. [DOI] [PubMed] [Google Scholar]
  • 101.Hochstrasser M. 2009. Origin and function of ubiquitin-like proteins. Nature 458:422–429. 10.1038/nature07958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Hershko A, Ciechanover A. 1992. The ubiquitin system for protein degradation. Annu Rev Biochem 61:761–807. 10.1146/annurev.bi.61.070192.003553. [DOI] [PubMed] [Google Scholar]
  • 103.Kerscher O, Felberbaum R, Hochstrasser M. 2006. Modification of proteins by ubiquitin and ubiquitin-like proteins. Annu Rev Cell Dev Biol 22:159–180. 10.1146/annurev.cellbio.22.010605.093503. [DOI] [PubMed] [Google Scholar]
  • 104.Welchman RL, Gordon C, Mayer RJ. 2005. Ubiquitin and ubiquitin-like proteins as multifunctional signals. Nat Rev Mol Cell Biol 6:599–609. 10.1038/nrm1700. [DOI] [PubMed] [Google Scholar]
  • 105.Darwin KH, Hofmann K. 2010. SAMPyling proteins in archaea. Trends Biochem Sci 35:348–351. 10.1016/j.tibs.2010.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Humbard MA, Miranda HV, Lim JM, Krause DJ, Pritz JR, Zhou G, Chen S, Wells L, Maupin-Furlow JA. 2010. Ubiquitin-like small archaeal modifier proteins (SAMPs) in Haloferax volcanii. Nature 463:54–60. 10.1038/nature08659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Li Y, Maciejewski MW, Martin J, Jin K, Zhang Y, Maupin-Furlow JA, Hao B. 2013. Crystal structure of the ubiquitin-like small archaeal modifier protein 2 from Haloferax volcanii. Protein Sci 22:1206–1217. 10.1002/pro.2305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Ordóñez MV, Guillén J, Nercessian D, Villalaín J, Conde RD. 2011. Secondary structure determination by FTIR of an archaeal ubiquitin-like polypeptide from Natrialba magadii. Eur Biophys J 40:1101–1107. 10.1007/s00249-011-0719-y. [DOI] [PubMed] [Google Scholar]
  • 109.Ordóñez MV, Nercessian D, Conde RD. 2012. Nmag_2608, an extracellular ubiquitin-like domain-containing protein from the haloalkaliphilic archaeon Natrialba magadii. Extremophiles 16:437–446. 10.1007/s00792-012-0443-2. [DOI] [PubMed] [Google Scholar]
  • 110.Ranjan N, Damberger FF, Sutter M, Allain FH, Weber-Ban E. 2011. Solution structure and activation mechanism of ubiquitin-like small archaeal modifier proteins. J Mol Biol 405:1040–1055. 10.1016/j.jmb.2010.11.040. [DOI] [PubMed] [Google Scholar]
  • 111.Hepowit NL, Uthandi S, Miranda HV, Toniutti M, Prunetti L, Olivarez O, De Vera IM, Fanucci GE, Chen S, Maupin-Furlow JA. 2012. Archaeal JAB1/MPN/MOV34 metalloenzyme (HvJAMM1) cleaves ubiquitin-like small archaeal modifier proteins (SAMPs) from protein-conjugates. Mol Microbiol 86:971–987. 10.1111/mmi.12038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Miranda HV, Nembhard N, Su D, Hepowit N, Krause DJ, Pritz JR, Phillips C, Söll D, Maupin-Furlow JA. 2011. E1- and ubiquitin-like proteins provide a direct link between protein conjugation and sulfur transfer in archaea. Proc Natl Acad Sci U S A 108:4417–4422. 10.1073/pnas.1018151108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Miranda HV, Antelmann H, Hepowit N, Chavarria NE, Krause DJ, Pritz JR, Bäsell K, Becher D, Humbard MA, Brocchieri L, Maupin-Furlow JA. 2014. Archaeal ubiquitin-like SAMP3 is isopeptide-linked to proteins via a UbaA-dependent mechanism. Mol Cell Proteomics 13:220–239. 10.1074/mcp.M113.029652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Egea PF, Stroud RM. 2010. Lateral opening of a translocon upon entry of protein suggests the mechanism of insertion into membranes. Proc Natl Acad Sci U S A 107:17182–17187. 10.1073/pnas.1012556107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Gumbart J, Trabuco LG, Schreiner E, Villa E, Schulten K. 2009. Regulation of the protein-conducting channel by a bound ribosome. Structure 17:1453–1464. 10.1016/j.str.2009.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Li W, Schulman S, Boyd D, Erlandson K, Beckwith J, Rapoport TA. 2007. The plug domain of the SecY protein stabilizes the closed state of the translocation channel and maintains a membrane seal. Mol Cell 26:511–521. 10.1016/j.molcel.2007.05.002. [DOI] [PubMed] [Google Scholar]
  • 117.Ménétret JF, Schaletzky J, Clemons WM, Jr, Osborne AR, Skånland SS, Denison C, Gygi SP, Kirkpatrick DS, Park E, Ludtke SJ, Rapoport TA, Akey CW. 2007. Ribosome binding of a single copy of the SecY complex: implications for protein translocation. Mol Cell 28:1083–1092. 10.1016/j.molcel.2007.10.034. [DOI] [PubMed] [Google Scholar]
  • 118.Park E, Ménétret J-F, Gumbart JC, Ludtke SJ, Li W, Whynot A, Rapoport TA, Akey CW. 2014. Structure of the SecY channel during initiation of protein translocation. Nature 506:102–106. 10.1038/nature12720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Van den Berg B, Clemons WM, Jr, Collinson I, Modis Y, Hartmann E, Harrison SC, Rapoport TA. 2004. X-ray structure of a protein-conducting channel. Nature 427:36–44. 10.1038/nature02218. [DOI] [PubMed] [Google Scholar]
  • 120.van Wolferen M, Wagner A, van der Does C, Albers S-V. 2016. The archaeal Ced system imports DNA. Proc Natl Acad Sci U S A 113:2496–2501. 10.1073/pnas.1513740113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Jenney FE, Jr, Adams MW. 2001. Rubredoxin from Pyrococcus furiosus. Methods Enzymol 334:45–55. 10.1016/s0076-6879(01)34457-9. [DOI] [PubMed] [Google Scholar]
  • 122.Nelson KE, Clayton RA, Gill SR, Gwinn ML, Dodson RJ, Haft DH, Hickey EK, Peterson JD, Nelson WC, Ketchum KA, McDonald L, Utterback TR, Malek JA, Linher KD, Garrett MM, Stewart AM, Cotton MD, Pratt MS, Phillips CA, Richardson D, Heidelberg J, Sutton GG, Fleischmann RD, Eisen JA, White O, Salzberg SL, Smith HO, Venter JC, Fraser CM. 1999. Evidence for lateral gene transfer between Archaea and bacteria from genome sequence of Thermotoga maritima. Nature 399:323–329. 10.1038/20601. [DOI] [PubMed] [Google Scholar]
  • 123.Grunden AM, Jenney FE, Jr, Ma K, Ji M, Weinberg MV, Adams MWW. 2005. In vitro reconstitution of an NADPH-dependent superoxide reduction pathway from Pyrococcus furiosus. Appl Environ Microbiol 71:1522–1530. 10.1128/AEM.71.3.1522-1530.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Thorgersen MP, Stirrett K, Scott RA, Adams MWW. 2012. Mechanism of oxygen detoxification by the surprisingly oxygen-tolerant hyperthermophilic archaeon, Pyrococcus furiosus. Proc Natl Acad Sci U S A 109:18547–18552. 10.1073/pnas.1208605109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Jenney FE, Jr, Verhagen MF, Cui X, Adams MW. 1999. Anaerobic microbes: oxygen detoxification without superoxide dismutase. Science 286:306–309. 10.1126/science.286.5438.306. [DOI] [PubMed] [Google Scholar]
  • 126.Weinberg MV, Jenney FE, Jr, Cui X, Adams MWW. 2004. Rubrerythrin from the hyperthermophilic archaeon Pyrococcus furiosus is a rubredoxin-dependent, iron-containing peroxidase. J Bacteriol 186:7888–7895. 10.1128/JB.186.23.7888-7895.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Bieger B, Essen LO, Oesterhelt D. 2003. Crystal structure of halophilic dodecin: a novel, dodecameric flavin binding protein from Halobacterium salinarum. Structure 11:375–385. 10.1016/S0969-2126(03)00048-0. [DOI] [PubMed] [Google Scholar]
  • 128.Grininger M, Staudt H, Johansson P, Wachtveitl J, Oesterhelt D. 2009. Dodecin is the key player in flavin homeostasis of archaea. J Biol Chem 284:13068–13076. 10.1074/jbc.M808063200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Grininger M, Zeth K, Oesterhelt D. 2006. Dodecins: a family of lumichrome binding proteins. J Mol Biol 357:842–857. 10.1016/j.jmb.2005.12.072. [DOI] [PubMed] [Google Scholar]
  • 130.Yu S-H, Vogel J, Förstner KU. 2018. ANNOgesic: a Swiss army knife for the RNA-seq based annotation of bacterial/archaeal genomes. GigaScience 7:giy096. 10.1093/gigascience/giy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Deppenmeier U, Johann A, Hartsch T, Merkl R, Schmitz RA, Martinez-Arias R, Henne A, Wiezer A, Bäumer S, Jacobi C, Brüggemann H, Lienard T, Christmann A, Bömeke M, Steckel S, Bhattacharyya A, Lykidis A, Overbeek R, Klenk HP, Gunsalus RP, Fritz HJ, Gottschalk G. 2002. The genome of Methanosarcina mazei: evidence for lateral gene transfer between bacteria and archaea. J Mol Microbiol Biotechnol 4:453–461. [PubMed] [Google Scholar]
  • 132.Wickham H. 2011. ggplot2. WIREs Comput Stat 3:180–185. 10.1002/wics.147. [DOI] [Google Scholar]
  • 133.Guo M, Xu F, Yamada J, Egelhofer T, Gao Y, Hartzog GA, Teng M, Niu L. 2008. Core structure of the yeast Spt4-Spt5 complex: a conserved module for regulation of transcription elongation. Structure 16:1649–1658. 10.1016/j.str.2008.08.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Yamaguchi Y, Park JH, Inouye M. 2011. Toxin-antitoxin systems in bacteria and archaea. Annu Rev Genet 45:61–79. 10.1146/annurev-genet-110410-132412. [DOI] [PubMed] [Google Scholar]
  • 135.Chavarria NE, Hwang S, Cao S, Fu X, Holman M, Elbanna D, Rodriguez S, Arrington D, Englert M, Uthandi S, Söll D, Maupin-Furlow JA. 2014. Archaeal Tuc1/Ncs6 homolog required for wobble uridine tRNA thiolation is associated with ubiquitin-proteasome, translation, and RNA processing system homologs. PLoS One 9:e99104. 10.1371/journal.pone.0099104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Jeong YJ, Jeong B-C, Song HK. 2011. Crystal structure of ubiquitin-like small archaeal modifier protein 1 (SAMP1) from Haloferax volcanii. Biochem Biophys Res Commun 405:112–117. 10.1016/j.bbrc.2011.01.004. [DOI] [PubMed] [Google Scholar]
  • 137.Almrud JJ, Dasgupta R, Czerwinski RM, Kern AD, Hackert ML, Whitman CP. 2010. Kinetic and structural characterization of DmpI from Helicobacter pylori and Archaeoglobus fulgidus, two 4-oxalocrotonate tautomerase family members. Bioorg Chem 38:252–259. 10.1016/j.bioorg.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Bacteriology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES