As we find out more about the archaebacteria our sense of their strangeness increases, but its explanation lies in a shared ancestry, not in individually evolved idiosyncrasies. Carl Woese (1)
The more one reads about Archaea, the more one sees a whole new world, revealing itself. What are these mysterious organisms, how did they come into existence, and why did it take us so long to even recognize their distinctness among living creatures? Where does the watershed lie between these microbes and the rest of the living world? What about the dichotomous division between prokaryotes and eukaryotes? The answers to many of these questions are found in the enlightening accounts of Woese (e.g., refs. 1 and 2). Indeed, from the viewpoint of a systematist, animals and plants are rich in complex morphological detail, which can serve as the basis for their systematics; however, not much phylogenetic information can be derived from the simple morphologies and physiologies of the bacteria. Only the revolutionary capability to sequence nucleic acids turned bacterial systematics into a tangible enterprise.
Molecular comparisons, originally of rRNA sequences, followed by analysis of several protein families (discussed in ref. 3) prompted a revolutionary change in our view of the evolutionary relationship among living organisms. The three-domain concept was forwarded (4): life on Earth comprises three domains, Bacteria, Archaea, and Eucarya, each of which contains several kingdoms (Fig. 1). The proposed universal phylogenetic tree places the Archaea in a clearly distinct realm of organisms, differing in fundamental ways from the Bacteria and Eucarya. Although from a cytological point of view Archaea are prokaryotic (they lack nuclei, cytoskeleton, and organelles, see Table 1), at the molecular level they represent a complex mosaic of features of either prokaryotic or eukaryotic nature, as well as their own unique features. This mosaic is itself partitioned into two aspects: metabolism and information processing. Molecular analyses, culminating in the sequencing of the complete genomes of Methanococcus jannaschii (5) and Methanobacterium thermoautotrophicum strain ΔH (6), provide a clear general view of this structural dichotomy. Although the metabolic facets of these organisms are closely related to those seen in Bacteria, the molecules, and hence probably the mechanisms, involved in information processing seem to be recognizably more similar to those in Eucarya (for description of some molecular features of Archaea, in comparison with Bacteria and Eucarya, see Table 1).
Table 1.
Property | Bacteria | Archaea | Eucarya |
---|---|---|---|
Cytological features | |||
Nucleus | No | No | Yes |
Cytoskeleton | No | No | Yes |
Organelles (mitochondria, chloroplasts, Golgi apparatus) | No | No | Yes |
Molecular features | |||
DNA topology | Negatively supercoiled | Relaxed or positively supercoiled (in hyperthermophilic Archaea that contain reverse gyrase) | Negatively supercoiled |
Promoter structure | Two conserved boxes at −10 (TATAAT) and −35 (TTGACA) from transcription start site | TATA box and/or initiator element | TATA box and/or initiator element |
RNA polymerase | One type; relatively simple subunit composition; binds directly to promoter (can be footprinted) | One type; complex subunit structure (subunit pattern, genes, and serological properties similar to eucaryal RNA polymerase II); can be footprinted, but still requires basal transcription factors for promoter recognition | Three types; complex subunit compositions; cannot be footprinted; require basal transcription factors for promoter recognition/binding |
Basal transcription factors | No | TBP, TFIIB, and TIIS homologs of eucaryal RNA polymerase II-associated factors described thus far | TBP, TAFs, TFIIA, TFIIB, TFIIE, TFIIF, TFIIH required for RNA polymerase II initiation; P-TEFb, TFIIS, TFIIF, elongin, and ELL required for elongation |
Poly(A) tails in RNA | Short | Short (avg. 12 bases in length) | Long |
Chromatin | No | ? | Yes |
If Archaea and Eucarya have a common evolutionary ancestor, it must have had some of the molecular features of the information-processing apparatus seen in a modified form in both domains. If this is the case, one may ask how the genome is organized. Eucarya package and regulate (in part) the activity of their genomes by organizing them into chromatin, a nucleoprotein fiber built of nucleosomal particles separated by linker DNA (7, 8). Do Archaea have a structural counterpart of chromatin? It is this important question that has been experimentally addressed in the paper by Pereira et al. (9), published in this issue.
The Archaea “chromatin” field developed along two lines of independent research: the description of histone-like proteins and their interaction with DNA in vitro, and electron microscopy (EM) visualization of nucleoprotein fibers spread out of cells. A small, basic, histone-like protein, HTa, that can organize DNA into nucleosome-like structures (NLS) was identified in Thermoplasma acidophilum (10, 11). The same work presented EM images of native nucleoprotein spreads that contained globular particles 5–6 nm in diameter along the DNA fibers (see below). High-quality, fairly convincing “chromatin” fiber images from Halobacterium salinarium were reported later (12, 13) (unfortunately, no relevant biochemical data are available on this species). Finally, many studies, mainly from Reeve’s laboratory, have been reported on the isolation, sequences, and DNA-binding properties of a family of HL proteins now known as the HMf family of archaeal histones (for reviews see refs. 14 and 15).
The Histone Proteins of Archaea
The histone family in Archaea, named after its first described members, the two closely related HL proteins from Methanothermus fervidus (16), now contains 18 members (refs. 5, 14, and 15; J. N. Reeve, personal communication). They are strongly conserved in sequence among themselves; moreover, and of particular importance, they have strong sequence similarities with the folded regions of eucaryal core histones. In fact, each consensus sequence for a eucaryal core histone is more similar to the HMf histone sequences than to the other core histone sequences (15), putting the two groups of proteins into a distinct group, separate from all other DNA-binding proteins.
The resemblance of the two histone groups is further substantiated from secondary structure predictions (15, 17, 18) and analysis (19). Both protein classes possess the “histone fold,” originally described by Arents et al. (20) in all four core histones. The fold consists of three α-helices interconnected by loop/β-strand segments (Fig. 2A). The histone fold allows the histone monomers to dimerize by the antiparallel pairing of their long central α-helices, with the formation of the so-called handshake motif. NMR solution studies of recombinant HMfB (19) demonstrated that the histone fold in the core histones is superimposable with that in HMfB; moreover, two HMfB monomers interact with each other to form the handshake motif characteristic of core histone heterodimers (Fig. 2B).
HMf proteins exist as dimers in solution, as demonstrated by gel filtration and chemical cross-linking (15). Both homo- and heterodimers can be formed in vitro, and mixtures of the three possible dimers, (HMfA)2, (HMfB)2, and HMfA–HMfB, have been directly isolated from cells. Because the accumulation of the two monomers is dependent on growth phase, and they have somewhat different DNA-binding features, it has been suggested that the cell combines the relative abundance of different dimers with their different binding properties to differentially regulate the structure of the M. fervidus chromosome during growth (15). Finally, and presumably of physiological relevance, when cross-linking is performed on DNA-bound proteins, tetramers can be formed, in addition to dimers.
Formation of NLS in Vitro
The first NLS formed by the interaction of an archaeal HL protein (HTa) with naked DNA were reported back in 1980 (10). The relationship of these particles to those studied later in Reeve’s laboratory is unclear partly because HTa does not seem to belong to the HMf family (15), although it does possess a limited but recognizable homology to eucaryal core histones (11); rather, it is perceived as a distant relative of the HU-family of bacterial HL proteins (15).
Studies involving bona fide members of the HMf family showed that these proteins, on binding to linear DNA molecules increased their electrophoretic mobility in agarose gels, suggesting some sort of DNA compaction. EM visualization of the complexes revealed NLS (e.g., ref. 16); the length of the constrained DNA was estimated to be around 90–150 bp. In addition, micrococcal nuclease (MNase) digestion produced a short ladder of fragments, multiples of ≈60 bp; a particle of ≈60 bp was the main product of more extensive digestion (21). The MNase digestion pattern was interpreted, by analogy with similar patterns of eucaryal chromatin, as resulting from preferential cleavage of naked DNA between NLS, with the DNA within the particles protected by virtue of its association with a tetramer of HMf. The MNase digestion results, however, raise several questions. Why is the new value of 60 bp so different from that previously estimated (90–150 bp; see above)? Why is the fragment so small? Does it reflect an internal cleavage at a specific exposed site within a larger particle, similarly to the situation observed with mononucleosomes reconstituted with (H3–H4)2 tetramers only (22)? What does the observed ladder actually mean—that a small number of small particles of 60 bp are very closely packed with no linker DNA, or alternatively, that larger closely packed particles are also extensively internally cleaved? Finally, if we hypothesize that DNA is wrapped on the outside of a tetrameric HMf core, then the DNA will have to be severely bent, much more so than in the canonical eucaryal nucleosome (23). It is also not clear why fragments longer than 180 bp (“nucleosome trimers”) were never observed, even though the 200-fold molar excess of the protein over DNA (50-fold molar excess of histone tetramers) would certainly be expected to allow the formation of long ladders, especially on the pUC19 DNA.
Other published data may have a bearing on the role of HMf family in organization of archaeal DNA. The proteins preferentially bind to intrinsically curved DNA (24); however, when a set of cloned DNA fragments originating from MNase-resistant portions of reconstituted complexes were analyzed, no strong correlation between “nucleosome” positions and DNA curvature was noted (21). Another intriguing feature of the HMf-DNA complexes is that the DNA seems to be wrapped around the protein core in a right-handed superhelix (25), contrary to the sense of wrapping in eucaryal nucleosomes. This finding was recently explained by analogy with eucaryal nucleosomal particles, containing only the central (H3–H4)2 tetramer. The DNA in such particles can apparently undergo a major, reversible switch from a left-handed superhelix to a right-handed one, and this switch is mediated by a similar switch in the topology of the tetramer itself involving a rotation of the two H3–H4 dimers about their H3–H3 interface (26). The relationship of the observation of Musgrave et al. (25) to the in vivo situation remains to be established.
Evidence for Particulate Deoxyribonucleoprotein (DNP) Structures in Situ
As mentioned above, EM images of spreads of DNA fibers revealed NLS in T. acidophilum (10) and H. salinarium (12, 13). The work of Pereira et al. (9) adds M. thermoautotrophicum to this list. In addition, it directly demonstrates by immuno-EM that the bead-like structures contain HMt. Again, as in the in vitro experiments described above, MNase digestion (of formaldehyde-fixed protoplasts) produces a short ladder of multiples of 60 bp, going up to a barely distinguishable trimer. [Qualitatively similar results were obtained earlier on T. acidophilum nucleoprotein, with the formation of a stable 40-bp fragment and its dimer (10).] Additional fragments of unknown nature can also be seen in the pattern, especially at positions between those of the monomer and dimer bands. Importantly, the monomeric DNP complexes obtained by MNase cleavage were biochemically isolated and shown to contain monomers, dimers, and tetramers of HMf (these experiments were performed with M. fervidus).
Two other important observations are reported in ref. 9. First, it was estimated, from measurements of the amount of HMf in the cell, that there is enough of the protein (one tetramer per 67 bp of DNA) to compact the entire genome. How much of the genome is actually packed in chromatin remains to be established. Second, DNA from cross-linked complexes purified by immunoprecipitation with anti-HMf antibodies was probed for the presence of several genes. In every case, with one exception, the specific gene sequences were present in the complexes, albeit at considerably different levels. Understanding the implications of these results would require additional studies. At present the results suggest that differential organization of different genes in chromatin may constitute one level of their transcriptional regulation, as is the case for eucaryal chromatin.
Missing Pieces of the Puzzle
From the above, it seems clear that archaeal DNA may be organized in an archaeal version of eucaryal chromatin. The data, although appearing at an ever-increasing pace, only set the stage for much more research. What is the actual structure of the archaeal particles? How long is the DNA constrained by the protein core, and what is the sense of the DNA superhelix? How are the two partners, DNA and proteins, interacting? How are the particles interconnected to form a fiber and what is its structure? How does the chromatin fiber interact with the complex replication and transcriptional machineries to allow (regulated) processing of the genetic information? The answers to all these intriguing questions will require the concerted efforts of researchers from many different disciplines. The picture that will emerge as a result of this effort will undoubtedly contribute to our understanding of this fascinating domain of life, and of life in general.
Acknowledgments
I thank Dr. John Reeve for last-minute hectic discussions and for communicating unpublished data and data in press. I also acknowledge Dr. M. Summers for providing Fig. 2, Dr. S. H. Leuba for useful discussions, and Dr. K. E. van Holde for critical reading.
References
- 1.Woese C R. Microbiol Rev. 1987;51:221–271. doi: 10.1128/mr.51.2.221-271.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Olsen G J, Woese C R. Cell. 1997;89:991–994. doi: 10.1016/s0092-8674(00)80284-6. [DOI] [PubMed] [Google Scholar]
- 3.Doolittle R F. Proc Natl Acad Sci USA. 1995;92:2421–2423. doi: 10.1073/pnas.92.7.2421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Woese C R, Kandler O, Wheelis M L. Proc Natl Acad Sci USA. 1990;87:4576–4579. doi: 10.1073/pnas.87.12.4576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bult C J, White O, Olsen G J, Zhou L, Fleischmann R D, et al. Science. 1996;273:1058–1073. doi: 10.1126/science.273.5278.1058. [DOI] [PubMed] [Google Scholar]
- 6.Smith, D. R., Doucette-Stamm, L. A., DeLoughery, C., Lee, H., Dubois, J., et al. (1997) J. Bacteriol., in press. [DOI] [PMC free article] [PubMed]
- 7.van Holde K E. Chromatin. New York: Springer; 1988. [Google Scholar]
- 8.Tsanev R, Russev G, Pashev I, Zlatanova J. Replication and Transcription of Chromatin. Boca Raton, FL: CRC; 1992. [Google Scholar]
- 9.Pereira S L, Grayling R A, Lurz R, Reeve J N. Proc Natl Acad Sci USA. 1997;94:12633–12637. doi: 10.1073/pnas.94.23.12633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Searcy D G, Stein D B. Biochim Biophys Acta. 1980;609:180–195. doi: 10.1016/0005-2787(80)90211-7. [DOI] [PubMed] [Google Scholar]
- 11.Delange R J, Williams L C, Searcy D G. J Biol Chem. 1981;256:905–911. [PubMed] [Google Scholar]
- 12.Shioda M, Sugimori K, Shiroya T, Takayanagi S. J Bacteriol. 1989;171:4514–4517. doi: 10.1128/jb.171.8.4514-4517.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Takayanagi S, Morimura S, Kusaoke H, Yokoyama Y, Kano K, Shioda M. J Bacteriol. 1992;174:7207–7216. doi: 10.1128/jb.174.22.7207-7216.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Grayling R A, Sandman K, Reeve J N. FEMS Microbiol Rev. 1996;18:203–213. doi: 10.1111/j.1574-6976.1996.tb00237.x. [DOI] [PubMed] [Google Scholar]
- 15.Grayling R A, Sandman K, Reeve J N. Adv Protein Chem. 1996;48:437–467. doi: 10.1016/s0065-3233(08)60368-x. [DOI] [PubMed] [Google Scholar]
- 16.Sandman K, Krzycki J A, Dobrinski B, Lurz R, Reeve J N. Proc Natl Acad Sci USA. 1990;87:5788–5791. doi: 10.1073/pnas.87.15.5788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Arents G, Moudrianakis E N. Proc Natl Acad Sci USA. 1995;92:11170–11174. doi: 10.1073/pnas.92.24.11170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Baxevanis A D, Arents G, Moudrianakis E N, Landsman D. Nucleic Acids Res. 1995;14:2685–2691. doi: 10.1093/nar/23.14.2685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Starich M R, Sandman K, Reeve J N, Summers M F. J Mol Biol. 1996;255:187–203. doi: 10.1006/jmbi.1996.0016. [DOI] [PubMed] [Google Scholar]
- 20.Arents G, Burlingame R W, Wang B-C, Love W E, Moudrianakis E N. Proc Natl Acad Sci USA. 1991;88:10148–10152. doi: 10.1073/pnas.88.22.10148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Grayling R A, Bailey K A, Reeve J N. Extremophiles. 1997;1:79–88. doi: 10.1007/s007920050018. [DOI] [PubMed] [Google Scholar]
- 22.Dong F, van Holde K E. Proc Natl Acad Sci USA. 1991;88:10596–10600. doi: 10.1073/pnas.88.23.10596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Morse R H, Simpson R T. Cell. 1988;54:285–287. doi: 10.1016/0092-8674(88)90190-0. [DOI] [PubMed] [Google Scholar]
- 24.Howard M T, Sandman K, Reeve J N, Griffith J D. J Bacteriol. 1992;174:7864–7867. doi: 10.1128/jb.174.23.7864-7867.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Musgrave D R, Sandman K M, Reeve J N. Proc Natl Acad Sci USA. 1991;88:10397–10401. doi: 10.1073/pnas.88.23.10397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hamiche A, Carot V, Alilat M, De Lucia F, O’Donohue M-F, Révet B, Prunell A. Proc Natl Acad Sci USA. 1996;93:7588–7593. doi: 10.1073/pnas.93.15.7588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Reeve J N. Annu Rev Microbiol. 1992;46:165–191. doi: 10.1146/annurev.mi.46.100192.001121. [DOI] [PubMed] [Google Scholar]