Abstract
Bacterial integration host factors (IHFs) play central roles in the cellular processes of recombination, DNA replication, transcription, and bacterial pathogenesis. We describe here a novel mycobacterial IHF (mIHF) of Mycobacterium smegmatis and Mycobacterium tuberculosis that stimulates integration of mycobacteriophage L5. mIHF is the product of a single gene and is unrelated at the sequence level to other integration host factors. By itself, mIHF does not bind preferentially to attP DNA, although it significantly alters the pattern of integrase (Int) binding, promoting the formation of specific integrase–mIHF–attP intasome complexes.
The mycobacteria include the important human pathogens Mycobacterium tuberculosis and Mycobacterium leprae, the causative agents of tuberculosis and leprosy, respectively (1). These pathogens are characterized by unusual cell walls and extremely slow growth rates (1). It is not clear how the cellular processes of DNA replication, cell division, and gene expression affect the rate of mycobacterial growth, or what changes in these events occur during bacterial infection. In other bacterial systems, small heat-stable DNA-binding proteins such as integration host factor (IHF) and HU play central roles in regulating these processes. We describe here the mycobacterial integration host factor (mIHF), a novel DNA-binding protein required for phage L5 integration.
Phage L5, a temperate phage of the mycobacteria, integrates site-specifically into the genomes of Mycobacterium smegmatis, M. tuberculosis, and bacille Calmette–Guérin (2–6). Integration occurs by integrase (Int)-mediated site-specific recombination between the phage attachment site (attP) and the bacterial attachment site (attB) (3, 6). Although L5 Int is only distantly related to λ Int (3), both proteins are composed of two domains, each of which recognizes a different type of sequence within attP; the smaller N-terminal domains bind to arm-type sites, and the C-terminal domains bind to core-type sites and contain the catalytic residues (5, 33, 34). In both systems, attB is small (25–30 bp) relative to attP (≈250 bp), and strand exchange involves specific cleavages of attP and attB DNA within a common core (7–10).
Four well-characterized phage systems (Escherichia coli phages λ and P2, Salmonella typhimurium phage P22, and the Haemophilus influenzae phage HP1) require a factor encoded by their bacterial hosts for integrative recombination (11–14). This IHF is a small, heterodimeric, sequence-specific DNA-binding protein (15) that is involved in numerous disparate cellular processes including bacterial pathogenesis (15–18). In the λ system, IHF binds to three sites within attP (19, 20) and stimulates integrative recombination through its ability to introduce specific bends at each site (21, 22); these bends promote the formation of intramolecular protein bridges, in which the two domains of Int are simultaneously bound to core- and arm-type sites (23–25).
Although DNA-bending proteins that bind nonspecifically to DNA (such as HU, HMG1, HMG2, and the histone dimer H2A-H2B) can substitute for IHF to form intasomes, they do not stimulate integrative recombination of phage λ (26, 27). The failure of these nonspecific DNA-binding proteins to support integration appears to result from their inability to introduce bends of the required magnitude and direction at all three IHF binding sites simultaneously (27, 28). The requirements for λ excisive recombination are less stringent, and the nonspecific DNA-binding proteins can substitute for IHF (26, 27). Similarly, L5 Int-mediated integrative recombination displays a strong requirement for a host factor that is present in extracts of M. smegmatis (6) and bacille Calmette–Guérin (data not shown). The need for a mycobacterial extract appears to be quite specific: although the stimulating activity shares with IHF and HU the property of being heat stable, E. coli extracts, IHF, or HU do not stimulate L5 integration in vitro (6). In this report, we show that mIHF is composed of a single, small, heat-stable polypeptide that binds to DNA without specificity for the attP site. mIHF is unrelated to previously described DNA-binding proteins and appears to stimulate recombination by binding cooperatively with L5 integrase to attP, forming specific intasomal complexes that Int alone is unable to form. Finally, mIHF is highly conserved between the fast-growing M. smegmatis and the slow-growing pathogen, M. tuberculosis.
MATERIALS AND METHODS
Plasmid Constructions.
The M. smegmatis mIHF gene was isolated from a cosmid library of M. smegmatis DNA (kindly provided by Bill Jacobs, Jr., Yeshiva University, New York) using the degenerate oligonucleotides 5′-CCc/gCAGGTc/gACc/gGACGAGCAGCGt/c/g/aGCt/c/g/aGCt/c/g/aGC and 5′-TCGGCIGAGCTc/gAAGGACCGICTc/gAAGCGIGGIGGIACc/gAACCT (where I is inosine, and positions where base mixtures were used are shown in lowercase letters) that correspond to regions of the N-terminal amino acid sequence of mIHF. DNA fragments from positive cosmid clones were subcloned, and a 1054-bp segment was sequenced using appropriate oligonucleotide primers and single-stranded DNA templates (29); the DNA sequences of both strands were determined. The mIHF overexpression plasmid, pMP21, was generated by PCR amplification of the mIHF gene and insertion into the T7 expression vector, pET21a (Novagen). The predicted protein product is identical to that of the protein isolated from M. smegmatis. Sequence analyses were performed using the University of Wisconsin Genetics Computer Group programs including fasta and blast (30–32).
Southern Hybridization.
Approximately 3 μg of M. smegmatis chromosomal DNA was digested with 20 units of the appropriate restriction enzyme overnight at 37°C and electrophoresed through a 0.7% agarose gel. DNA was transferred to a GeneScreenPlus membrane (NEN), probed with a 350-bp 32P-labeled PCR-generated mIHF DNA fragment, washed, and exposed to film.
Protein Purification.
mIHF protein was purified from M. smegmatis as follows. Cells from a 35-liter culture of M. smegmatis mc2155 were pelleted, resuspended in 200 ml of cold TED buffer (20 mM Tris, pH 7.5/10 mM EDTA/1 mM DTT), sonicated, and clarified by centrifugation. The supernatant was extracted in batch by addition of carboxymethyl-Sepharose, which was then collected by centrifugation and extracted with 0.5M NaCl TED. Proteins were precipitated by addition of ammonium sulfate, collected by centrifugation, and resuspended in TED. Following dialysis, the sample was loaded onto an Econo-Pac heparin cartridge (Bio-Rad) connected to an fast protein liquid chromatography system (Pharmacia), and proteins were eluted with a 400-1000 mM NaCl gradient. Active fractions were identified by in vitro recombination, pooled, and loaded onto an Econo-Pac S cartridge (Bio-Rad). Proteins were eluted with a 0–1000 mM NaCl gradient, and active fractions were identified using recombination assays. Purification of mIHF from E. coli strain BL21DE3pLys (Novagen) carrying plasmid pMP21, induced by addition of 0.5 mM isopropyl β-d-thiogalactoside, was accomplished by a similar protocol. Cells from a 14-liter culture were harvested by centrifugation and frozen; thawed pellets were resuspended in 325 ml cold TED and clarified by centrifugation. Following precipitation and removal of nucleic acids with 0.5% polyethyleneimine, proteins were precipitated with ammonium sulfate and mIHF purified by chromatography over BioCAD POROS 20 Heparin and carboxymethyl columns (PerSeptive Biosystems, Framingham, MA); active fractions were identified using recombination assays.
In Vitro Recombination.
Recombination reactions were performed as described (6) and contained ≈0.3 pmol supercoiled pMH39 DNA (which contains attP), either a 3′ 32P-radiolabeled 0.6-kb linear attB DNA (obtained by digestion of plasmid pMH57 with EcoRI and HindIII) or a 3.9-kb unlabeled linear attB DNA (obtained by digestion of pMH57 with HindIII), 2 μl of an extract containing L5 Int, and 0.1- to 1.0-μl protein fractions. Reactions were incubated for 3 hr at room temperature, stopped by addition of 2 μl of 1% SDS, and electrophoresed on a 0.8% agarose gel. Recombinant products were identified by ethidium bromide staining or autoradiography.
Electrophoretic Mobility Shift Assay.
Conditions for DNA-binding were similar to those used for recombination (6). DNAs used were as follows: a 32P-labeled 0.6-kb attP DNA fragment derived by digestion of pMH39 with EcoRI and HindIII (6); a 32P-labeled 450-bp Asp-718–HindIII fragment from pMP3 (containing a segment of mycobacterial DNA that does not include attP or attB); a 32P-labeled 364-bp attP DNA fragment from plasmid pCPΔR11 (34). Unless otherwise noted, all binding reactions contained 1 μg of salmon sperm DNA. Following addition of proteins, reactions were incubated on ice for 15–30 min and electrophoresed through native 5% polyacrylamide gels in 1× TBE (100 mM Tris/84 mM borate/1 mM EDTA).
DNase I Footprinting.
Protein–DNA complexes were formed in binding reactions essentially identical to those used for recombination and electrophoretic mobility shift assays and digested with an appropriate dilution of DNase I at room temperature for 1 min (29). The products were phenol extracted, ethanol precipitated, and analyzed on a 6% sequencing gel in 1× TBE buffer.
Immunoblot of Protein–DNA Complexes.
Rabbit anti-mIHF antiserum was prepared by Pocono Rabbit Farms (Canadensis, PA) using mIHF purified from E. coli. Electrophoretic mobility shift assays were performed as described above, except that ≈0.5 μg of nonradiolabeled pCPΔR11 DNA (34) digested with EcoRI (New England Biolabs) and BamHI (New England Biolabs) were included. After electrophoresis, the wet gel was exposed to film for 1.5 hr and the proteins were electroblotted to polyvinylidene difluoride (Bio-Rad) for 2.5 hr at 150 mA. The filter was blocked in PBST (1.5 mM NaH2PO4/8.1 mM Na2HPO4/145 mM NaCl/0.05% Tween 20) with 4% milk for 1 hr and incubated with rabbit anti-mIHF antiserum diluted 1:1000 in PBST with 4% milk for 2.5 hr. The filter was washed in PBST and incubated in horseradish peroxidase-conjugated goat anti-rabbit IgG (Bio-Rad) diluted 1:5000 in PBST with 4% milk for 1 hr and washed again in PBST. A final wash was performed in PBS (1.5 mM NaH2PO4/8.1 mM Na2HPO4/145 mM NaCl), and chemiluminescence was performed with luminol (DuPont) and detected by autoradiography. Autoradiography of the filter for the same period of time in the absence of luminol demonstrated that radioactive DNA did not contribute to the signal.
RESULTS AND DISCUSSION
It has been shown previously that crude extracts of M. smegmatis provide an activity that is required for L5 Int-mediated recombination in vitro (6). The protein responsible for this activity was purified from M. smegmatis by column chromatography (Fig. 1a), using in vitro recombination to assay for activity (Fig. 1b) (6). Fractions from the final chromatographic separation that strongly stimulate recombination appear to contain a single, small polypeptide approximately 13 kDa in size; N-terminal sequence analysis of this polypeptide produced a unique sequence of 38 amino acids (N-ALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNL; see Fig. 2a). These observations suggest that the mycobacterial protein that stimulates L5 integrative recombination is composed of a single polypeptide species. We have named this protein mycobacterial Integration Host Factor (mIHF).
Degenerate oligonucleotide probes designed from the N-terminal amino acid sequence were used to clone the mIHF gene of M. smegmatis. Southern hybridization studies show that there is only a single copy of the mIHF gene in the M. smegmatis genome (Fig. 2a). A segment of ≈1 kb of M. smegmatis DNA was sequenced, within which a single ORF encoding a putative 105-amino acid protein was identified. Residues 2–39 of the predicted amino acid sequence correspond precisely to the empirically determined amino acid sequence, indicating that this ORF is the mIHF gene (Fig. 2b).
A search of the sequence data bases identified a closely related gene of M. tuberculosis. Alignment of the M. smegmatis and M. tuberculosis DNA sequences shows that the mIHF genes are very closely related, although the flanking regions are not (Fig. 2c). The mIHF protein sequences are almost identical, with only 6 differences (of a total of 105 residues), 3 of which are at the extreme C-termini (Fig. 2d; by contrast, the DnaA proteins of M. smegmatis and M. tuberculosis share only 81% identical residues). The high degree of similarity of the mIHF genes suggests that the proteins perform important functions in the mycobacteria. However, the mIHFs appear to represent a novel class of proteins because no other obvious similarities to known sequences were found, and specific sequence comparisons indicated that the mIHFs are not members of either the HU/IHF or H-NS families of proteins. In addition, no obvious similarities to HMG proteins, histones, or other DNA-binding proteins were found (data not shown).
The M. smegmatis mIHF gene was inserted into an E. coli expression vector, and the mIHF protein was overexpressed and purified (Fig. 3a). The mIHF protein produced in E. coli stimulates recombination (Fig. 3b) and has a similar specific activity to the protein isolated from M. smegmatis; ≈0.2 pmol of mIHF from either source fully stimulates recombination of ≈0.05 pmol attP plasmid DNA (data not shown). These observations confirm that the 13-kDa protein isolated from M. smegmatis (Fig. 1) is the stimulating factor and show that the product of only a single gene is required for this activity (in contrast to heterodimeric E. coli IHF and HU) (15).
Purified mIHF does not appear to bind specifically to attP DNA in vitro (Fig. 4a). At high concentrations mIHF binds to DNA, but mIHF–DNA complexes are formed equivalently with both attP and a non-attP DNA fragment when little or no salmon sperm carrier DNA is present (Fig. 4a). In other experiments, mIHF exhibited no preference for binding to attP DNA over more than a dozen DNA fragments produced by digestion of plasmid pMH94 DNA (3) with BstYI and StyI even over a range of salt conditions (data not shown). In contrast, L5 Int does bind specifically to attP DNA but does not produce a single complex with well-defined electrophoretic mobility; instead, Int-attP complexes remain at the origin of electrophoresis or produce a smear (Fig. 4b). The nature of these complexes is not known, but we speculate that they are complicated networks formed by simultaneous binding of the two domains of Int to different attP DNA molecules. When both Int and mIHF are present with attP DNA, however, a complex with well-defined mobility is observed (Fig. 4c). Moreover, this complex is formed at concentrations of mIHF that are orders of magnitude lower than those needed to produce the nonspecific shift seen in Fig. 4a. Thus, mIHF acts to stimulate the formation of a specific intasome complex.
DNase I footprinting experiments show that the presence of both Int and mIHF results in protection of parts of attP DNA not seen with either protein alone (Fig. 4d). In particular, most of the 70-bp region between the core and the P4 arm-type site is protected, and there may be some additional protection between the core and the P3 arm-type site (Fig. 4d). These protections could result from direct contact of mIHF with the DNA, or from an altered pattern of Int-binding in an mIHF-dependent fashion, but clearly do not result from independent binding of mIHF. The absence of any obvious protection by mIHF alone is consistent with its apparent lack of preference for attP DNA in the gel-shift assays but we cannot rule out the possibility that mIHF binds to attP DNA in regions where there is a paucity of DNase cleavage sites and that these interactions are unstable in the gel conditions used in Fig. 4a.
To address the question of whether mIHF is a constituent of the Int-attP complexes seen in Fig. 4 c and d, we used anti-mIHF serum to immunoblot complexes separated by native gel electrophoresis (Fig. 5). These results demonstrate that mIHF is indeed an integral component of this complex (Fig. 5). We favor the parsimonial explanation that mIHF is in direct contact with the DNA, and as such provides the protection from DNase I seen in Fig. 4d. However, we cannot rule out the possibility that mIHF is only in contact with Int, such that the mIHF-dependent DNase I protection is a consequence of altered Int binding. Thus, mIHF could act by binding and stabilizing transient DNA bends, or through direct contact with Int (these are not mutually exclusive possibilities). We note, however, that mIHF is able to promote the formation of lambda attL intasome complexes, which require stabilization of lambda Int-mediated protein bridges (A. Segall, M.L.P., H. Nash, and G.F.H., unpublished observations). Nevertheless, integrative recombination of both L5 and λ is strongly dependent on the cognate integration host factor, which cannot be replaced by other DNA-binding proteins (6, 27).
We have described here a novel DNA-binding protein that is utilized by phage L5 as an integration host factor. The role of mIHF in L5 integrative recombination appears to be similar to that of E. coli IHF in λ integration: to promote the formation of specific Int–attP DNA higher-order structures that recombine with attB DNA. However, mIHF differs significantly from the IHF proteins of E. coli and other bacteria in that it stimulates recombination and intasome formation without sequence-specific recognition of attP DNA. Further studies will elucidate the role of mIHF in the regulation of the central physiological processes of the pathogenic mycobacteria.
Acknowledgments
We thank J. Hempel for determination of amino acid sequences and composition of mIHF. We also thank A. Segall, H. Nash, L. Barsom, G. Sarkis, and C. Peña for helpful comments. This work was supported by grants from the National Institutes of Health.
Footnotes
References
- 1.Bloom B, editor. Tuberculosis. Washington, DC: Am. Soc. for Microbiol.; 1994. [Google Scholar]
- 2.Snapper S, Lugosi L, Jekkel A, Melton R, Keiser T, Bloom B R, Jacobs W R., Jr Proc Natl Acad Sci USA. 1988;85:6987–6991. doi: 10.1073/pnas.85.18.6987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lee M H, Pascopella L, Jacobs W R, Jr, Hatfull G F. Proc Natl Acad Sci USA. 1991;88:3111–3115. doi: 10.1073/pnas.88.8.3111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hatfull G F, Sarkis G. Mol Microbiol. 1993;7:395–405. doi: 10.1111/j.1365-2958.1993.tb01131.x. [DOI] [PubMed] [Google Scholar]
- 5.Hatfull G F. ASM News. 1994;60:255–260. [Google Scholar]
- 6.Lee M H, Hatfull G F. J Bacteriol. 1993;175:6838–6841. doi: 10.1128/jb.175.21.6836-6841.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mizuuchi M, Mizuuchi K. Proc Natl Acad Sci USA. 1980;77:3220–3224. doi: 10.1073/pnas.77.6.3220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mizuuchi M, Mizuuchi K. Nucleic Acids Res. 1985;13:1193–1208. doi: 10.1093/nar/13.4.1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mizuuchi K, Weisberg R, Enquist L, Mizuuchi M, Buraczynska M, Foeller C, Hsu P L, Ross W, Landy A. Cold Spring Harbor Symp Quant Biol. 1981;45:429–437. doi: 10.1101/sqb.1981.045.01.057. [DOI] [PubMed] [Google Scholar]
- 10.Peña C E A, Stoner J, Hatfull G F. J Bacteriol. 1996;178:5533–5536. doi: 10.1128/jb.178.18.5533-5536.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Nash H A, Robertson C A. J Biol Chem. 1981;256:9246–9253. [PubMed] [Google Scholar]
- 12.Hakimi Astumian J H, Waldman A S, Scocca J J. J Bacteriol. 1988;171:1747–1750. doi: 10.1128/jb.171.3.1747-1750.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Yu A, Haggard-Ljungquist E. J Bacteriol. 1993;175:1239–1249. doi: 10.1128/jb.175.5.1239-1249.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Smith-Mungo L, Chan I T, Landy A. J Biol Chem. 1994;269:20798–20805. [PubMed] [Google Scholar]
- 15.Nash H. Trends Biochem Sci. 1990;15:222–227. doi: 10.1016/0968-0004(90)90034-9. [DOI] [PubMed] [Google Scholar]
- 16.Oberto J, Drlica K, Rouviere-Janiv J. Biochimie. 1994;76:901–908. doi: 10.1016/0300-9084(94)90014-0. [DOI] [PubMed] [Google Scholar]
- 17.Friedman D I. Cell. 1988;55:545–554. doi: 10.1016/0092-8674(88)90213-9. [DOI] [PubMed] [Google Scholar]
- 18.Mahan M, Slauch J, Mekalanos J. Science. 1993;259:686–688. doi: 10.1126/science.8430319. [DOI] [PubMed] [Google Scholar]
- 19.Craig N L, Nash H A. Cell. 1984;39:707–716. doi: 10.1016/0092-8674(84)90478-1. [DOI] [PubMed] [Google Scholar]
- 20.Gardner J F, Nash H A. J Mol Biol. 1986;191:181–189. doi: 10.1016/0022-2836(86)90255-x. [DOI] [PubMed] [Google Scholar]
- 21.Robertson C A, Nash H A. J Biol Chem. 1988;263:3554–3557. [PubMed] [Google Scholar]
- 22.Goodman S D, Nash H A. Nature (London) 1989;341:251–254. doi: 10.1038/341251a0. [DOI] [PubMed] [Google Scholar]
- 23.Moitoso de Vargas L, Kim S, Landy A. Science. 1989;244:1457–1461. doi: 10.1126/science.2544029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kim S, Moitoso de Vargas L, Nunes-Düby S, Landy A. Cell. 1990;63:773–781. doi: 10.1016/0092-8674(90)90143-3. [DOI] [PubMed] [Google Scholar]
- 25.Kim S, Landy A. Science. 1992;256:198–203. doi: 10.1126/science.1533056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Goodman S D, Nicholson S C, Nash H A. Proc Natl Acad Sci USA. 1992;89:11910–11914. doi: 10.1073/pnas.89.24.11910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Segall A M, Nash H A. EMBO J. 1993;12:4567–4576. doi: 10.1002/j.1460-2075.1993.tb06145.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Nunes-Düby S E, Smith-Mungo L I, Landy A. J Mol Biol. 1995;253:228–242. doi: 10.1006/jmbi.1995.0548. [DOI] [PubMed] [Google Scholar]
- 29.Sambrook J, Fritsch E F, Maniatis T. Molecular Cloning: A Laboratory Manual. Plainview, NY: Cold Spring Harbor Lab. Press; 1989. [Google Scholar]
- 30.Devereux J, Haeberli P, Smithies O. Nucleic Acids Res. 1984;12:387–395. doi: 10.1093/nar/12.1part1.387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Pearson W R, Lipman D J. Proc Natl Acad Sci USA. 1988;85:2444–2448. doi: 10.1073/pnas.85.8.2444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Altschul S F, Gish W, Miller W, Myers E W, Lipman D J. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 33.Moitoso de Vargas L, Pargellis C A, Hasan N, Bushman E, Landy A. Cell. 1988;54:923–929. doi: 10.1016/0092-8674(88)90107-9. [DOI] [PubMed] [Google Scholar]
- 34.Peña, C. E. A., Lee, M. H., Pedulla, M. L. & Hatfull, G. F. (1997) J. Mol. Biol., in press. [DOI] [PubMed]