Abstract
The expression of the neutral protease gene (npr) from the thermophilic Bacillus sp. BT1 strain was studied in its natural host and in mesophilic Bacillus subtilis. In the thermophilic BT1 strain, the transcription of the protease gene is initiated from its own promoter, just 5′ to the gene. In contrast, in heterologous B. subtilis this thermophilic npr promoter does not function, and expression of the npr gene results from transcription originating upstream of an adjacent gene, open reading frame X (ORF X). A functional promoter was identified 5′ to ORF X that is required for efficient expression of the npr gene in Bacillus subtilis as verified by primer extension, reverse transcription-PCR, and 5′ rapid amplification of cDNA ends experiments. These data suggest that transcriptional signals used in thermophilic Bacillus sp. BT1 strain are different from those used in B. subtilis.
Many species of the genus Bacillus produce a variety of thermostable extracellular enzymes, some of which are industrially important. Among these enzymes, the neutral proteases (Npr) have been extensively studied not only for industrial production but also for the elucidation of mechanisms involved in thermostability of enzymes (6, 7, 16). Several npr genes from thermophilic bacilli have been cloned and expressed in Bacillus subtilis (4, 8, 12, 15) to study the gene and its product in a well-characterized host. However, relatively little is known about the regulation of the expression and promoter utilization of these thermophilic genes in B. subtilis and in natural hosts as well.
The thermostable neutral protease-encoding gene (npr) from a thermophilic strain of Bacillus sp. BT1 was cloned and sequenced (17). The gene was subcloned in B. subtilis on the 4.3-kb Sau3AI fragment and subsequently confined to the shorter 2.2-kb EcoRI-Sau3AI fragment. To further enhance the protease production in B. subtilis the npr gene was placed into the high-copy-number plasmid pGDV1. When cloned into pGDV1, the 4.3-kb fragment (pGBR5) produced about 15-fold more protease than did cloning of the 2.2-kb fragment in pGDV1 (pGBR50) in the protease-deficient B. subtilis DB117, indicating that important information for the expression of the npr gene resided in the 2.1-kb region upstream of the npr gene (17).
To characterize this region several plasmid derivatives of pGBR5 were constructed by generating deletions from the 5′ direction on the 4.3-kb Sau3AI fragment (Fig. 1). The proteolytic activity encoded by plasmid derivatives was determined by using Azocoll (Sigma) as a substrate (1). One unit of proteolytic activity was arbitrarily defined as the amount of enzyme causing an increase in A520 of 0.5 per 30 min. Deletions up to first BsaBI site from Sau3AI end did not affect the protease production (pGBR80 and pGBR75), whereas the deletion of Sau3AI-MscI fragment abolished the enhanced protease production (pGBR70). Therefore, the nucleotide sequence of the 1.5-kb BsaBI-EcoRI fragment was determined and coupled with the already reported sequence of the 2,231-bp EcoRI-Sau3AI fragment containing the npr gene (GenBank accession number U07824 [17]). Within this region, only one large open reading frame (ORF) composed of 1,245 bp was found, and this was designated ORF X (Fig. 1). The ORF X sequence continues through the EcoRI site and ends 112 bp upstream from the npr gene. A potential Shine-Dalgarno (SD) sequence, AGATGGGAGGTGA, extensively complementary (ΔG = −23.4 kcal/mol) to the 3′ terminus of B. subtilis 16S RNA (10) was found upstream from the probable translation start codon ATG (Fig. 2). When comparing the ORF X sequence with database entries, no significant homology to other known genes was found.
FIG. 1.
Localization of the ORF X and npr genes on the cloned 4.3-kb Sau3AI fragment. The position and direction of transcription of the genes are indicated by large arrows. The structure of pGBR5 derivatives is shown below the physical map of the cloned fragment. Black lines indicate the DNA from Bacillus sp. BT1 strain; the open boxes represent deletions of the specific fragments. The protease production encoded by the corresponding plasmid is indicated on the right side.
FIG. 2.
The promoter region of the ORF X gene. The presented region corresponds to nt 481 to 600 in the deposited ORF X sequence. The ORF X coding region starts at the possible initiation codon ATG as marked by the short arrow under the sequence. The suggested SD and promoter (−10 and −35 regions) sequences are underlined. The black arrow above the sequence denotes the doublet of ORF X transcriptional start sites determined in this study.
To analyze the possible role of the ORF X or its gene product in the protease production, plasmids carrying deletions extending into the ORF X gene were constructed from pGBR5 (Fig. 1). Deletion of the major part of the ORF X, which, in addition, changed the reading frame of ORF X (620-bp MscI-EcoRI fragment; pGBR65) had no effect on the enhanced production of the protease, indicating that the potential protein encoded by ORF X is not necessary for activation. However, deletion of the 5′-terminal region of the ORF X (458-bp EarI fragment; pGBR60) probably containing the ORF X regulatory sequences reduced the protease production to the level produced by DB117(pGBR50).
To characterize the expression of the protease gene in B. subtilis more precisely, the transcription start sites of the npr and ORF X genes were mapped by primer extension. Total RNA was isolated from DB117(pGBR5) cells grown in Luria-Bertani medium to early stationary phase using the RNeasy purification kit (Qiagen). The primers NPREX (5′-CGAACGGCCAAGCCATCAATC-3′) and ORFEX (5′-GGCAATATGGCACAGTTCTCG-3′) complementary to 5′ end of the npr (from nucleotides [nt] 443 to 423 in the published sequence) and ORF X (from nt 652 to 632) genes, respectively, were end labeled with [γ-32P]ATP (Amersham) and extended by avian myeloblastosis virus (AMV) reverse transcriptase (Promega) using total RNA extracted from DB117(pGBR5) as a template. Hybridization and extension procedures were carried out essentially as described by Sambrook et al. (13). The synthesized cDNA was precipitated with ethanol, dissolved in gel loading buffer, and analyzed on a 6% polyacrylamide-urea sequencing gel. No apparent signal was detected using the npr-specific primer, whereas as shown in Fig. 3, two cDNA products were obtained using ORF X-specific primer. The obtained products correspond to initiation either at an A or G residue separated by 16 or 15 bp from the presumed ORF X start codon (Fig. 2). The initiation sites are preceded by a plausible −10 region TATAAT, which perfectly matches the consensus for the major B. subtilis ςA factor (10). In addition, an AGGATA sequence 17 bp upstream probably represents the −35 region, with three matches out of six with ςA consensus. Also, the sequence ATATA found at positions −18 to −14 (relative to the transcription initiation site) resembles the less-specific sequence prototype PuTPuTG, proposed for the −18 to −14 positions of B. subtilis strong promoters (6, 10). Also, the presence of an AT-rich sequence (87.5 mol%) positioned between −50 and −35 bases indicates that the thermophilic ORF X promoter represents a functional promoter showing all the necessary features to be efficiently utilized by ςA-RNA polymerase of mesophilic B. subtilis (2, 11). Further confirmation of the results obtained from primer extension experiments was done by testing the promoter activities of the ORF X and npr genes using lacZ transcriptional fusions. The 5′-flanking regions of the npr and ORF X genes were inserted upstream of the lacZ in the vector pBTK2 to construct lacZ fusions. As the integration vector pBTK2 (D. van Sinderen, unpublished data) carries the promoterless spoVG-lacZ indicator gene fusion flanked by 5′ and 3′ parts of the B. subtilis amy gene, the fusions were stably integrated via double recombination with the amy gene into the chromosome of B. subtilis DB117. To construct the npr-lacZ fusion, the 489-bp EcoRI-TthI fragment, including the promoter region and the first 115 bp of the npr, was gel purified, filled in with Klenow DNA polymerase, and ligated to linearized plasmid pBTK2. For that purpose, pBTK2 was digested with EcoRI and the protruding ends were filled in. To construct an ORF X-lacZ fusion, a 458-bp EarI fragment encompassing the regulatory region and the first 65 bp of ORF X was purified, filled in with Klenow DNA polymerase, and ligated with pBTK2 obtained as described above. The ligation mixes were used to transform competent cells of Escherichia coli DH5α (5) to ampicillin and kanamycin resistance. Several clones were screened for plasmid constructs carrying the inserted ORF X or npr promoter DNA in the proper orientation to the lacZ reporter gene. The correct plasmid constructs were linearized with ScaI enzyme and transformed into B. subtilis DB117 to allow the single-copy integration of transcriptional fusions into the chromosome (14). Kanamycin-resistant clones were tested for the amylase-deficient phenotype by checking halo formation on agar plates containing 1% starch. Such clones carried a single copy of the specific fusion as verified by Southern hybridization. The expression of the lacZ fusions was monitored by assaying β-galactosidase activity of cells during the growth in Luria-Bertani medium at 37°C as described by Miller (9). The ORF X-lacZ fusion integrant strain expressed about 130 Miller units (U) of β-galactosidase, and, as expected for the ςA-dependent promoter, the ORF X-directed β-galactosidase activity was found to be induced during the exponential phase of growth. Apparently, the promoter of the thermophilic gene ORF X displays a transcriptional activity in B. subtilis. In contrast, the activity of npr-lacZ fusion was only about 2 U of β-galactosidase, which was in fact, in the range of the background activity produced by the strain DB117 (1 to 2 U) (data not shown), indicating that the DNA upstream from npr gene did not exhibit promoter activity. This finding is consistent with the negative result of the npr promoter mapping by primer extension. The protease production of pGBR50 carrier could be then explained as background expression resulting from nonspecific transcription, probably originating in high-copy vector sequences.
FIG. 3.
Identification of the ORF X transcription start sites by primer extension analysis. The primer extension product (the leftmost lane) was analyzed by electrophoresis along with a dideoxy sequencing ladder (in order A, C, G, and T) obtained with the same primer used for primer extension. The sequence of the coding strand is shown with the transcriptional starts (∗) and the −10 and SD sequences (vertical lines).
These results are suggestive of an operon structure for the ORF X and npr genes. Therefore, this possibility was verified by identification of the very 5′ end of the npr-specific mRNA in B. subtilis using the RT-PCR and 5′RACE methods with aid of the 5′/3′RACE kit (Boehringer Mannheim). In both cases, total RNA isolated from strain DB117(pGBR5) was used together with the NPREX primer for the synthesis of the first-strand cDNA. The resulting single-stranded DNA was first amplified in a reverse transcription (RT)-PCR experiment utilizing the NPREX primer and ORF X-specific ORFW primer 5′-CGGCATTGAATTCTTGAA-3′, hybridizing with the 1543 to 1560 region of the ORF X sequence. The product of the PCR had the expected size of 450 bp (data not shown), indicating that ORF X and npr genes are part of the same operon. In the rapid amplification of cDNA ends (RACE) experiment, the obtained npr-specific single-stranded cDNA was amplified according to the manufacturer's protocol. Terminal transferase was used to add a homopolymeric A tail to the 3′ end of the cDNA, and the tailed cDNA was amplified by PCR using the ORF X-specific primer XREV (5′-GCCTTGTGGTCATATTTTC-3′; nt 876 to 858) and the oligo(dT) anchor primer to form the second strand of DNA. The obtained PCR product was further amplified by a second PCR using a nested, ORF X-specific primer, ORFEX, and the PCR anchor primer to increase the product specificity. The specific amplified product of size determined by the primers (136 bp) was isolated and subcloned for sequencing. From the sequence analysis of the RACE product (Fig. 4A) an A residue could be identified as the start of transcription, thus confirming the transcription initiation site of the ORF X gene. This confirmed that the npr transcript is initiated at the ORF X promoter in B. subtilis. On the basis of this data, the promoter of the npr gene is probably not active or not efficiently utilized in B. subtilis. However, we are aware of the fact that in the original strain the expression of the npr gene might be regulated by a different mechanism. Therefore, we have searched for the npr promoter also in the thermophilic strain BT1 using RT-PCR and 5′ RACE methods. The RT-PCR was performed under the same PCR conditions and with the same primers as in the case of RNA preparation from B. subtilis (see above); however, as a template the total RNA of the BT1 strain was used. Interestingly, in contrast to the result with RNA from B. subtilis, no product was obtained with RNA isolated from the BT1 strain. This indicated that in the thermophilic strain the ORF X gene is not transcribed together with npr gene and that the npr gene might be expressed from its own promoter. Accordingly, the start point of transcription of the protease gene was determined by 5′ RACE. Total RNA of the BT1 strain and the npr-specific primer NPRACE2 (nt 848 to 830) were used for the first-strand cDNA synthesis. Tailed cDNA was then amplified by two following PCR experiments using the nested npr-specific primers NPRACE3 (nt 647 to 638) and NPREV (nt 496 to 476). Sequencing of the specific PCR product revealed a transcription start point (Fig. 4B), which identified a −10 region of the npr promoter, TAGAAT. These experiments established that in the natural thermophilic host the transcription of npr gene is initiated at its own promoter. Furthermore, we have also confirmed that ORF X and npr genes do not comprise an operon in BT1 strain. Evidently, the read-through transcription from the ORF X promoter observed in B. subtilis does not represent the natural pattern of expression of the npr gene. The explanation for inefficiency of the npr promoter in B. subtilis could come from its own structure. The identified −10 region is located just 8 bp downstream of the hairpin-like structure present in the intergenic region between ORF X and npr genes (Fig. 5). Consequently, it is rational to suggest that the putative −35 region is a part of this structure. However, we could not identify any appropriately spaced −35-like sequence within the hairpin. As the best match to the conserved −35 sequence of a ςA-type promoter, only a TAGAAA hexamer could be found. This region, however, is separated from the −10 region by 20 bp, instead of typical 17 or 18 bp. Presumably, such a promoter requires an additional regulatory factor to be recognized by ςA-associated RNA polymerase. Nevertheless, the lack of activity of the npr promoter in B. subtilis might be due to sequestering of the −35 region in the hairpin, which might be absent at 60°C (the cultivation temperature of thermophilic strain BT1), but present at 37°C (ΔG = −29.4 kcal/mol), thus making the promoter inaccessible for RNA polymerase in B. subtilis. To test this hypothesis, the symmetry in the GC-rich region creating the hairpin stem was reduced to destabilize the hairpin structure, and, therefore, four of seven C residues involved in the stem were replaced by site-directed mutagenesis (Fig. 5), leaving the region surrounding the −35 position unchanged. However, the destabilization of the hairpin by site-directed mutations did not cause significant improvement of the protease production in B. subtilis (data not shown). The identification of the npr promoter in the thermophilic BT1 strain might also suggest that this promoter is recognized by a positive transcription factor (activator) or by a novel holoenzyme form associated with a ς factor distinct from those available in B. subtilis. The protease genes of thermophilic origin that have been successfully cloned and characterized in B. subtilis were presumed to be expressed from specific promoters found immediately 5′ to protease genes. Apparently, our expression study does not prove this assumption, and on the contrary, it clearly demonstrates that transcriptional signals used in the mesophilic and thermophilic bacilli might be different. We assume that this observed transcriptional diversity may also be found with heterologous expression of other thermophilic genes.
FIG. 4.
Sequence analysis of 5′ RACE products. The very 5′ ends of npr-specific transcripts were obtained using total RNA from B. subtilis DB117(pGBR5) (A) and total RNA from thermophilic Bacillus sp. BT1 strain (B). The fluorogram images of the sequence ladder were created using Vistra DNA Sequencer 725 software (Amersham). The sequences displayed are complementary to that determined by automated DNA sequencing; the corresponding transcriptional starts are marked with an asterisk.
FIG. 5.
Hairpin-like structure in the promoter region of the npr gene. The transcriptional start is marked with an asterisk; the −10 and SD sequences are underlined. The coding sequence of the npr gene is shown in bold letters. The base substitutions introduced by the mutagenesis are shown by arrows and letters.
Nucleotide sequence accession number.
The nucleotide sequence of ORF X has been submitted to the GenBank Data Library under the accession number U23444.
Acknowledgments
We are grateful to Roy H. Doi for critical reading of the manuscript and helpful discussion. B.V. is very grateful to the members of Department of Genetics in Haren for encouragement throughout this work.
This work was carried out under an East European Fellowship and Short Term Fellowship from EMBO awarded to B.V. and was partially supported by postdoctoral grant C5020702 from the Grant Agency of the Czech Academy of Sciences.
REFERENCES
- 1.Chavira R, Jr, Burnett T J, Hageman J H. Assaying proteinases with azocoll. Anal Biochem. 1984;136:446–450. doi: 10.1016/0003-2697(84)90242-2. [DOI] [PubMed] [Google Scholar]
- 2.Doi R H. Genetic engineering in Bacillus subtilis. In: Russel G E, editor. Biotechnology and genetic engineering reviews. Vol. 2. Newcastle Upon Tyne, United Kingdom: Intercept; 1984. pp. 121–153. [DOI] [PubMed] [Google Scholar]
- 3.Eijsink V G H, Veltman O R, Aukema W, Vriend G, Venema G. Structural determinants of the stability of thermolysin-like proteinase. Nat Struct Biol. 1995;2:374–379. doi: 10.1038/nsb0595-374. [DOI] [PubMed] [Google Scholar]
- 4.Fujii M, Takagi M, Imanaka T, Aiba S. Molecular cloning of a thermostable neutral protease gene from Bacillus stearothermophilus in a vector plasmid and its expression in Bacillus stearothermophilus and Bacillus subtilis. J Bacteriol. 1983;154:831–837. doi: 10.1128/jb.154.2.831-837.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hanahan D. Studies on transformation of Escherichia coli with plasmids. J Mol Biol. 1983;166:557–580. doi: 10.1016/s0022-2836(83)80284-8. [DOI] [PubMed] [Google Scholar]
- 6.Helmann J D. Compilation and analysis of Bacillus subtilis ςA-dependent promoter sequences: evidence for extended contact between RNA polymerase and upstream promoter DNA. Nucleic Acids Res. 1995;23:2351–2360. doi: 10.1093/nar/23.13.2351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Imanaka T, Shibazaki M, Takagi M. A new way of enhancing the thermostability of proteases. Nature. 1986;324:695–697. doi: 10.1038/324695a0. [DOI] [PubMed] [Google Scholar]
- 8.Kubo M, Imanaka T. Cloning and nucleotide sequence of the highly thermostable neutral protease gene from B. stearothermophilus. J Gen Microbiol. 1988;134:1883–1892. doi: 10.1099/00221287-134-7-1883. [DOI] [PubMed] [Google Scholar]
- 9.Miller J H. Experiments in molecular genetics. Cold Spring Harbor, N.Y: Cold Spring Harbor Laboratory Press; 1972. [Google Scholar]
- 10.Moran C P, Jr, Lang N, LeGrice S F J, Lee G, Stephens M, Sonenshein A L, Pero J, Losick R. Nucleotide sequences that signal the initiation of transcription and translation in Bacillus subtilis. Mol Gen Genet. 1982;186:339–346. doi: 10.1007/BF00729452. [DOI] [PubMed] [Google Scholar]
- 11.Mountain A. Gene expression systems for Bacillus subtilis. In: Harwood C R, editor. Biotechnology handbooks, vol. 2. Bacillus. New York, N.Y: Plenum Press; 1989. pp. 73–114. [Google Scholar]
- 12.Nishiya Y, Imanaka T. Cloning and nucleotide sequences of the Bacillus stearothermophilus neutral protease gene and its transcriptional activator gene. J Bacteriol. 1990;172:4861–4869. doi: 10.1128/jb.172.9.4861-4869.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sambrook J, Fritsch E F, Maniatis T. Molecular cloning: a laboratory manual. 2nd ed. Cold Spring Harbor, N.Y: Cold Spring Harbor Laboratory; 1989. [Google Scholar]
- 14.Shimotsu H, Henner D. Construction of a single copy integration vector and its use in analysis of regulation of the trp operon of Bacillus subtilis. Gene. 1986;33:103–119. doi: 10.1016/0378-1119(86)90011-9. [DOI] [PubMed] [Google Scholar]
- 15.van den Burg B, Enequist H G, van der Haar M E, Eijsink V G H, Stulp B K, Venema G. A highly thermostable neutral protease from Bacillus caldolyticus: cloning and expression of the gene in Bacillus subtilis and characterization of the gene product. J Bacteriol. 1991;173:4107–4115. doi: 10.1128/jb.173.13.4107-4115.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.van den Burg B, Eijsink V G H, Veltman O R, Venema G, Vriend G. Evolution reversed: engineering an enzyme to resist boiling. Proc Natl Acad Sci USA. 1998;95:2056–2060. doi: 10.1073/pnas.95.5.2056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Večerek B, Kyslík P. Cloning and sequencing of the neutral protease-encoding gene from a thermophilic strain of Bacillus sp. Gene. 1995;158:147–148. doi: 10.1016/0378-1119(95)00167-5. [DOI] [PubMed] [Google Scholar]





