Primary production on Earth is dependent on autotrophic carbon fixation, which leads to the incorporation of carbon dioxide into biomass. Multiple metabolic pathways have been described for autotrophic carbon fixation, but most autotrophic organisms were assumed to have the genes for only one of these pathways. Our finding of a cultivable bacterium with two carbon fixation pathways in its genome, the rTCA and the CBB cycle, opens the possibility to study the potential benefits of having these two pathways and the interplay between them. Additionally, this will allow the investigation of the unusual and potentially very efficient mechanism of electron flow that could drive the rTCA cycle in these autotrophs. Such studies will deepen our understanding of carbon fixation pathways and could provide new avenues for optimizing carbon fixation in biotechnological applications.
KEYWORDS: carbon dioxide assimilation, carbon metabolism, electron transport, lithoautotrophic metabolism, symbiosis
ABSTRACT
Very few bacteria are able to fix carbon via both the reverse tricarboxylic acid (rTCA) and the Calvin-Benson-Bassham (CBB) cycles, such as symbiotic, sulfur-oxidizing bacteria that are the sole carbon source for the marine tubeworm Riftia pachyptila, the fastest-growing invertebrate. To date, the coexistence of these two carbon fixation pathways had not been found in a cultured bacterium and could thus not be studied in detail. Moreover, it was not clear if these two pathways were encoded in the same symbiont individual, or if two symbiont populations, each with one of the pathways, coexisted within tubeworms. With comparative genomics, we show that Thioflavicoccus mobilis, a cultured, free-living gammaproteobacterial sulfur oxidizer, possesses the genes for both carbon fixation pathways. Here, we also show that both the CBB and rTCA pathways are likely encoded in the genome of the sulfur-oxidizing symbiont of the tubeworm Escarpia laminata from deep-sea asphalt volcanoes in the Gulf of Mexico. Finally, we provide genomic and transcriptomic data suggesting a potential electron flow toward the rTCA cycle carboxylase 2-oxoglutarate:ferredoxin oxidoreductase, via a rare variant of NADH dehydrogenase/heterodisulfide reductase in the E. laminata symbiont. This electron-bifurcating complex, together with NAD(P)+ transhydrogenase and Na+ translocating Rnf membrane complexes, may improve the efficiency of the rTCA cycle in both the symbiotic and the free-living sulfur oxidizer.
IMPORTANCE Primary production on Earth is dependent on autotrophic carbon fixation, which leads to the incorporation of carbon dioxide into biomass. Multiple metabolic pathways have been described for autotrophic carbon fixation, but most autotrophic organisms were assumed to have the genes for only one of these pathways. Our finding of a cultivable bacterium with two carbon fixation pathways in its genome, the rTCA and the CBB cycle, opens the possibility to study the potential benefits of having these two pathways and the interplay between them. Additionally, this will allow the investigation of the unusual and potentially very efficient mechanism of electron flow that could drive the rTCA cycle in these autotrophs. Such studies will deepen our understanding of carbon fixation pathways and could provide new avenues for optimizing carbon fixation in biotechnological applications.
OBSERVATION
Primary production by autotrophic organisms drives the global carbon cycle. Currently, seven naturally occurring pathways for inorganic carbon fixation are known in autotrophic organisms (1, 2). The dominant carbon fixation pathway used by plants, algae, and many bacteria is the Calvin-Benson-Bassham (CBB) cycle. The six alternative pathways include among others the reverse tricarboxylic acid (rTCA) cycle and the recently discovered reversed oxidative TCA cycle (roTCA) (1, 3, 4). Only a few autotrophic bacteria have more than one carbon fixation pathway (5). These bacteria include a closely related group of sulfur-oxidizing symbionts of marine tubeworms such as Riftia, Escarpia, Tevnia, and Lamellibrachia, which have and express both the oxygen-sensitive rTCA and the oxygen-tolerant CBB cycle (6–11). The only known free-living bacteria that may have all the genes for both cycles are the large sulfur bacteria Beggiatoa and Thiomargarita spp. (12–14). The CBB cycle in the symbionts and the large sulfur bacteria is potentially more energy efficient than the classical version of the CBB cycle based on the replacement of the fructose-1,6-bisphosphatase with a pyrophosphate-dependent enzyme (12, 13, 15, 16). In addition, it is likely that the interplay between the CBB and rTCA cycle under fluctuating redox conditions contributes to the high efficiency of carbon fixation in tubeworm symbioses (6, 7, 17) and consequently to the extremely high growth rates of tubeworms, which grow faster than any other known invertebrate (18).
Given that tubeworm symbionts and large sulfur bacteria could not yet be cultivated, it was not possible to investigate the cooccurrence of their two carbon fixation cycles in detail. In this study, we sequenced a high-quality genome (99.5% completeness as estimated by CheckM) and transcriptome of the symbiont from the tubeworm Escarpia laminata and compared its genome to those of other tubeworm symbionts and free-living microbes. These comparisons revealed the cooccurrence of the complete set of genes for the CBB and rTCA cycles in a cultured bacterium. This discovery will enable future studies of the biochemical and physiological mechanisms that enable the interplay between these two carbon fixation pathways.
Cooccurrence of rTCA cycle genes with RuBisCO in symbiotic and free-living Gammaproteobacteria.
Genes for enzymes that are specific to the rTCA pathway, that is, the type II ATP citrate lyase (ACL, aclAB genes), 2-oxoglutarate:ferredoxin oxidoreductase (OGOR, korABCD genes), and a putative fumarate reductase (tfrAB genes, homologs of genes encoding a thiol:fumarate reductase from Methanobacterium thermoautotrophicum [19]), were assumed to occur in only a few symbiotic Gammaproteobacteria. We discovered, using comparative genomics, that these rTCA cycle enzymes also occur in some Chromatiaceae, including the cultivated sulfur oxidizer Thioflavicoccus mobilis and a gammaproteobacterial metagenome-assembled genome (MAG) from a subsurface aquifer (Gammaproteobacterium RIFOXYD12_FULL_61_37) (20) (Fig. 1). The ACLs of tubeworm symbionts and T. mobilis were likely acquired via horizontal gene transfer from other bacterial clades, because the phylogeny of their aclA genes is not congruent with their placement in a phylogenomic tree (Fig. 1; see also Fig. S1 in the supplemental material) (16). The tubeworm symbionts and Thioflavicoccus also encode either form I or II RuBisCO or both (Text S1, Note 1).
Presence of the rTCA and CBB pathways in the genome of a single bacterium.
Due to the fragmented nature of the previously available genomes of tubeworm symbionts, past studies could not determine whether the genes for both pathways are present in a single genome or if the two pathways are distributed in a strain-specific manner, i.e., only one of the two pathways is present in the genome of a single cell (4). Here, we provide two lines of evidence that the two pathways can cooccur in the genome of a single organism. First, sequencing coverage for the genes of both pathways in the E. laminata symbiont was similar to that of single-copy marker genes (Table S1). Since genes that are strain specific are expected to have lower coverage than the rest of the genome (21), the similar coverage of genes encoding the two pathways and single-copy genes suggests that in the E. laminata symbiont both pathways are present in all cells. Second, in the closed genome of the cultured T. mobilis, the genes encoding the rTCA and the CBB cycle cooccur, providing evidence that these genes coexist in a single genome.
Our transcriptomic analyses of E. laminata tubeworm symbionts revealed high expression levels of both the rTCA and the CBB cycle genes (Fig. 2; Table S1). This observation is consistent with previous proteomic analyses of the Riftia symbiont (the metabolism of the symbionts from these two tubeworms is highly similar) (6, 7). The high expression levels of genes from the rTCA and the CBB cycle suggest that both pathways play an important metabolic role in these symbionts. It is, however, not clear whether these cycles function simultaneously within single symbiont cells or are differentially expressed within the symbiont population (4).
The rTCA gene clusters are conserved among the tubeworm symbionts and some Chromatiaceae bacteria.
In the tubeworm symbionts, the cultivated T. mobilis, and the gammaproteobacterial MAG from a subsurface aquifer, there was a considerable level of conservation of the rTCA gene clusters, at the sequence and synteny levels (Fig. 2). The aclAB genes that encode the two subunits of the ACL were accompanied by those that encode bidirectional TCA cycle enzymes, including acn (aconitase), idh (isocitrate dehydrogenase), and mdh (malate dehydrogenase). The other rTCA-specific genes korABCD (four-subunit OGOR) and tfrAB (putative thiol:fumarate reductase) were also present in the rTCA gene cluster. Similar to the ACL, the four-subunit OGOR and the thiol:fumarate reductase are very rare among Gammaproteobacteria and were probably acquired via a single horizontal gene transfer event from a distant bacterial clade (Fig. 1; Fig. S2 and S3). A dimeric OGOR (korAB genes), more common than the four-subunit enzyme among gammaproteobacterial autotrophs, yet absent in T. mobilis, was located elsewhere in the genome of the E. laminata symbiont. The korAB genes were colocalized with genes that encode other well-expressed TCA cycle enzymes (Text S1, Note 2; Fig. S4). These well-expressed genes included the citrate synthase (gltA) gene, which could indicate its use in the catabolic oxidative TCA cycle. Alternatively, strong expression of citrate synthase could also indicate autotrophic CO2 fixation via the recently discovered ACL-independent reverse oxidative TCA (roTCA) cycle (1, 22).
An array of genes that encode several electron-translocating complexes were integrated into the rTCA cycle gene clusters. These complexes included an electron-bifurcating NADH dehydrogenase/heterodisulfide reductase complex (flxABCD-hdrABC genes [Text S1, Note 3] [23]), an NAD(P)+ transhydrogenase (24) and Na+-translocating Rnf membrane complex (pntAB and rnfABCDGE genes [Text S1, Note 4] [25]). Most interestingly, the conserved interspersing of the korABCD and tfrAB genes with the flxABCD-hdrABC genes hints at the possibility that these proteins form a complex that efficiently shuttles electrons directly to the OGOR and the thiol:fumarate reductase (Text S1, Note 3, and Fig. S5). If this is the case, the carbon fixation efficiency of the rTCA cycle would be most likely considerably higher than the canonical rTCA cycle.
Conclusions.
Until now, the only bacteria known to possess both the CBB and rTCA pathways were sulfur-oxidizing, tubeworm symbionts, and possibly also large sulfur bacteria, all of which are currently not amenable to cultivation-based studies. Experimental studies are now feasible in the cultivable T. mobilis, in which the genes for the CBB and rTCA cycles coexist. Such studies would reveal if these pathways are expressed under different physicochemical conditions and potentially allow the biotechnological optimization of efficiency and yield in production processes that rely on autotrophic carbon fixers. To our knowledge, the use of organisms with multiple carbon fixation pathways has not been used as a design principle for these applications.
Methods. (i) Comparative genomics and transcriptomics.
Publicly available genomes from the NCBI and JGI-IMG collections, as well as de novo-assembled genomes of Escarpia laminata symbionts (estimated completeness 99.5%), were used for genomic comparison (see Text S1). To verify presence/absence of target gene homologs in sequenced organisms, we used NCBI’s BLAST against the nucleotide collection and nonredundant protein database (26). E. laminata symbiont genomes were used as a template for genome-centered transcriptomics.
(ii) Phylogenetic and phylogenomic analyses.
Phylogenomic treeing was performed using scripts available at phylogenomics-tools (https://doi.org/10.5281/zenodo.46122). Twenty-three marker proteins that are universally conserved across the bacterial domain were extracted from genomes using the AMPHORA2 pipeline (27). Twenty-three single-copy markers were used for alignment with MUSCLE (28). The marker alignments were concatenated into a single partitioned alignment, and poorly aligned regions were removed. Functional protein sequences were aligned with MAFFT (29). Maximum likelihood trees were calculated with IQ-TREE (30) and MEGA7 (31), using the best-fitting model.
Data availability.
Sequences are available under the BioProject accession number PRJNA471406.
ACKNOWLEDGMENTS
We thank all individuals who helped during the R/V Meteor research cruise M114, including onboard technical and scientific personnel, the captain and crew, and the ROV MARUM-Quest team. We thank the Max Planck-Genome-Centre Cologne (http://mpgc.mpipz.mpg.de/home/) for generating the metagenomic and the metatranscriptomic data used in this study. We thank Julie Reveillaud for providing genomes of Mid-Cayman Rise tubeworm symbionts and Matthias Winkel, Marc Mußmann, and Jake V. Bailey for helpful and supportive comments on a bioRxiv preprint version of the manuscript. We are grateful to the Mexican authorities for granting permission to conduct this research in the southern Gulf of Mexico (permission of DGOPA: 02540/14 from 5 November 2014).
The Campeche Knoll cruise was funded by the German Research Foundation (DFG-Deutsche Forschungsgemeinschaft). This study was funded by the Max Planck Society, the MARUM DFG-Research Center/Excellence Cluster “The Ocean in the Earth System” at the University of Bremen, an ERC Advanced Grant (BathyBiome, 340535) and a Gordon and Betty Moore Foundation Marine Microbial Initiative Investigator Award to N.D. (grant GBMF3811), and the NC State Chancellor’s Faculty Excellence Program Cluster on Microbiomes and Complex Microbial Communities (M.K.).
M.R.-B., N.D., and M.K. conceived the study. M.R.-B. and M.K. analyzed the samples. M.R.-B., N.D., and M.K. wrote the manuscript.
The authors declare no conflict of interest.
REFERENCES
- 1.Mall A, Sobotta J, Huber C, Tschirner C, Kowarschik S, Bačnik K, Mergelsberg M, Boll M, Hügler M, Eisenreich W, Berg IA. 2018. Reversibility of citrate synthase allows autotrophic growth of a thermophilic bacterium. Science 359:563–567. doi: 10.1126/science.aao2410. [DOI] [PubMed] [Google Scholar]
- 2.Figueroa IA, Barnum TP, Somasekhar PY, Carlström CI, Engelbrektson AL, Coates JD. 2018. Metagenomics-guided analysis of microbial chemolithoautotrophic phosphite oxidation yields evidence of a seventh natural CO2 fixation pathway. Proc Natl Acad Sci U S A 115:E92–E101. doi: 10.1073/pnas.1715549114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Erb TJ. 2011. Carboxylases in natural and synthetic microbial pathways. Appl Environ Microbiol 77:8466–8477. doi: 10.1128/AEM.05702-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hügler M, Sievert SM. 2011. Beyond the Calvin cycle: autotrophic carbon fixation in the ocean. Annu Rev Mar Sci 3:261–289. doi: 10.1146/annurev-marine-120709-142712. [DOI] [PubMed] [Google Scholar]
- 5.Thiel V, Hügler M, Ward DM, Bryant DA. 2017. The dark side of the Mushroom Spring microbial mat: life in the shadow of chlorophototrophs. II. Metabolic functions of abundant community members predicted from metagenomic analyses. Front Microbiol 8:943. doi: 10.3389/fmicb.2017.00943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gardebrecht A, Markert S, Sievert SM, Felbeck H, Thürmer A, Albrecht D, Wollherr A, Kabisch J, Le Bris N, Lehmann R, Daniel R, Liesegang H, Hecker M, Schweder T. 2012. Physiological homogeneity among the endosymbionts of Riftia pachyptila and Tevnia jerichonana revealed by proteogenomics. ISME J 6:766–776. doi: 10.1038/ismej.2011.137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Markert S, Arndt C, Felbeck H, Becher D, Sievert SM, Hugler M, Albrecht D, Robidart J, Bench S, Feldman RA, Hecker M, Schweder T. 2007. Physiological proteomics of the uncultured endosymbiont of Riftia pachyptila. Science 315:247–250. doi: 10.1126/science.1132913. [DOI] [PubMed] [Google Scholar]
- 8.Markert S, Gardebrecht A, Felbeck H, Sievert SM, Klose J, Becher D, Albrecht D, Thürmer A, Daniel R, Kleiner M, Hecker M, Schweder T. 2011. Status quo in physiological proteomics of the uncultured Riftia pachyptila endosymbiont. Proteomics 11:3106–3117. doi: 10.1002/pmic.201100059. [DOI] [PubMed] [Google Scholar]
- 9.Robidart JC, Bench SR, Feldman RA, Novoradovsky A, Podell SB, Gaasterland T, Allen EE, Felbeck H. 2008. Metabolic versatility of the Riftia pachyptila endosymbiont revealed through metagenomics. Environ Microbiol 10:727–737. doi: 10.1111/j.1462-2920.2007.01496.x. [DOI] [PubMed] [Google Scholar]
- 10.Reveillaud J, Anderson R, Reves-Sohn S, Cavanaugh C, Huber JA. 2018. Metagenomic investigation of vestimentiferan tubeworm endosymbionts from Mid-Cayman Rise reveals new insights into metabolism and diversity. Microbiome 6:19. doi: 10.1186/s40168-018-0411-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Thiel V, Hügler M, Blümel M, Baumann HI, Gärtner A, Schmaljohann R, Strauss H, Garbe-Schönberg D, Petersen S, Cowart DA, Fisher CR, Imhoff JF. 2012. Widespread occurrence of two carbon fixation pathways in tubeworm endosymbionts: lessons from hydrothermal vent associated tubeworms from the Mediterranean Sea. Front Microbiol 3:423. doi: 10.3389/fmicb.2012.00423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Winkel M, Carvalho V, Woyke T, Richter M, Schulz-Vogt HN, Flood B, Bailey J, Mußmann M. 2016. Single-cell sequencing of Thiomargarita reveals genomic flexibility for adaptation to dynamic redox conditions. Front Microbiol 7:964. doi: 10.3389/fmicb.2016.00964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.MacGregor BJ, Biddle JF, Harbort C, Matthysse AG, Teske A. 2013. Sulfide oxidation, nitrate respiration, carbon acquisition, and electron transport pathways suggested by the draft genome of a single orange Guaymas Basin Beggiatoa (Cand. Maribeggiatoa) sp. filament. Mar Genomics 11:53–65. doi: 10.1016/j.margen.2013.08.001. [DOI] [PubMed] [Google Scholar]
- 14.Flood BE, Fliss P, Jones DS, Dick GJ, Jain S, Kaster A-K, Winkel M, Mußmann M, Bailey J. 2016. Single-cell (meta-)genomics of a dimorphic Candidatus Thiomargarita nelsonii reveals genomic plasticity. Front Microbiol 7:603. doi: 10.3389/fmicb.2016.00603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kleiner M, Wentrup C, Lott C, Teeling H, Wetzel S, Young J, Chang Y-J, Shah M, VerBerkmoes NC, Zarzycki J, Fuchs G, Markert S, Hempel K, Voigt B, Becher D, Liebeke M, Lalk M, Albrecht D, Hecker M, Schweder T, Dubilier N. 2012. Metaproteomics of a gutless marine worm and its symbiotic microbial community reveal unusual pathways for carbon and energy use. Proc Natl Acad Sci U S A 109:E1173–E1182. doi: 10.1073/pnas.1121198109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kleiner M, Petersen JM, Dubilier N. 2012. Convergent and divergent evolution of metabolism in sulfur-oxidizing symbionts and the role of horizontal gene transfer. Curr Opin Microbiol 15:621–631. doi: 10.1016/j.mib.2012.09.003. [DOI] [PubMed] [Google Scholar]
- 17.Klatt JM, Polerecky L. 2015. Assessment of the stoichiometry and efficiency of CO2 fixation coupled to reduced sulfur oxidation. Front Microbiol 6:484. doi: 10.3389/fmicb.2015.00484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bright M, Klose J, Nussbaumer AD. 2013. Giant tubeworms. Curr Biol 23:R224–R225. doi: 10.1016/j.cub.2013.01.039. [DOI] [PubMed] [Google Scholar]
- 19.Heim S, Kunkel A, Thauer RK, Hedderich R. 1998. Thiol:fumarate reductase (Tfr) from Methanobacterium thermoautotrophicum: identification of the catalytic sites for fumarate reduction and thiol oxidation. Eur J Biochem 253:292–299. doi: 10.1046/j.1432-1327.1998.2530292.x. [DOI] [PubMed] [Google Scholar]
- 20.Anantharaman K, Brown CT, Hug LA, Sharon I, Castelle CJ, Probst AJ, Thomas BC, Singh A, Wilkins MJ, Karaoz U, Brodie EL, Williams KH, Hubbard SS, Banfield JF. 2016. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. Nat Commun 7:13219. doi: 10.1038/ncomms13219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Pasić L, Rodriguez-Mueller B, Martin-Cuadrado A-B, Mira A, Rohwer F, Rodriguez-Valera F. 2009. Metagenomic islands of hyperhalophiles: the case of Salinibacter ruber. BMC Genomics 10:570. doi: 10.1186/1471-2164-10-570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nunoura T, Chikaraishi Y, Izaki R, Suwa T, Sato T, Harada T, Mori K, Kato Y, Miyazaki M, Shimamura S, Yanagawa K, Shuto A, Ohkouchi N, Fujita N, Takaki Y, Atomi H, Takai K. 2018. A primordial and reversible TCA cycle in a facultatively chemolithoautotrophic thermophile. Science 359:559–563. doi: 10.1126/science.aao3407. [DOI] [PubMed] [Google Scholar]
- 23.Ramos AR, Grein F, Oliveira GP, Venceslau SS, Keller KL, Wall JD, Pereira IAC. 2015. The FlxABCD-HdrABC proteins correspond to a novel NADH dehydrogenase/heterodisulfide reductase widespread in anaerobic bacteria and involved in ethanol metabolism in Desulfovibrio vulgaris Hildenborough. Environ Microbiol 17:2288–2305. doi: 10.1111/1462-2920.12689. [DOI] [PubMed] [Google Scholar]
- 24.Spaans SK, Weusthuis RA, van der Oost J, Kengen SWM. 2015. NADPH-generating systems in bacteria and archaea. Front Microbiol 6:742. doi: 10.3389/fmicb.2015.00742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Biegel E, Muller V. 2010. Bacterial Na+-translocating ferredoxin:NAD+ oxidoreductase. Proc Natl Acad Sci U S A 107:18138–18142. doi: 10.1073/pnas.1010318107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Johnson M, Zaretskaya I, Raytselis Y, Merezhuk Y, McGinnis S, Madden TL. 2008. NCBI BLAST: a better web interface. Nucleic Acids Res 36:W5–W9. doi: 10.1093/nar/gkn201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wu M, Scott AJ. 2012. Phylogenomic analysis of bacterial and archaeal sequences with AMPHORA2. Bioinformatics 28:1033–1034. doi: 10.1093/bioinformatics/bts079. [DOI] [PubMed] [Google Scholar]
- 28.Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kumar S, Stecher G, Tamura K. 2016. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol Biol Evol 33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequences are available under the BioProject accession number PRJNA471406.