Complete Genome Sequences of Four Soil-Derived Isolates for Studying Synthetic Bacterial Community Assembly

Carlos N Lozano-Andrade; Mikael Lenz Strube; Ákos T Kovács

doi:10.1128/MRA.00848-21

. 2021 Nov 18;10(46):e00848-21. doi: 10.1128/MRA.00848-21

Complete Genome Sequences of Four Soil-Derived Isolates for Studying Synthetic Bacterial Community Assembly

Carlos N Lozano-Andrade ^a, Mikael Lenz Strube ^b, Ákos T Kovács ^a,^✉

Editor: Leighton Pritchard^c

PMCID: PMC8601137 PMID: 34792377

ABSTRACT

Here, we report the complete genome sequences of four bacterial soil isolates, Chryseobacterium sp., Stenotrophomonas indicatrix, Pedobacter sp., and Rhodococcus globerulus. These isolates can be used for studying microbial interactions and community assembly in vitro.

ANNOUNCEMENT

Soil microbes play diverse and often pivotal roles in ecosystem services, driving biogeochemical cycles, plant growth, and life above and below ground (1, 2).

These activities are performed by complex communities composed of several interacting species, rather than single species. The high complexity of soil microbial communities poses great difficulty for experimentally testing ecological hypotheses, such as microbe-plant interactions or the underlying mechanisms of community assembly. Therefore, experimentally tractable and manipulatable synthetic communities are needed as a model for addressing fundamental questions in microbial ecology (3). We experimented with a four-member synthetic bacterial community to study its assembly and functionality. The isolates were obtained from 1 g soil sample (Dyrehaven, Denmark; coordinates, 55.788800, 12.558300), using serial dilutions plated in 0.1× TSA. Here, we announce the complete genome sequences of Chryseobacterium sp. strain D764, Stenotrophomonas indicatrix D763, Pedobacter sp. strain D749, and Rhodococcus globerulus D757.

For genome sequencing, the strains were grown overnight in LB at 24°C, and genomic DNA (gDNA) was extracted using the GeneMATRIX bacterial and yeast genomic DNA purification kit (EURx, Gdansk, Poland). Two separate DNA extractions were conducted for each sequencing technology, yielding at least 1 μg of gDNA, quantified using Qubit. For Illumina sequencing, a library was prepared using the NEBNext DNA library kit (New England BioLabs, USA).The gDNA was randomly fragmented to 350 bp, end polished, A-tailed, ligated with adapters, and PCR enriched. Then, paired-end reads were generated on the NovaSeq 600 platform with 2 × 150-bp reads. For Nanopore sequencing, a ligation sequencing kit (SQK-LSK109) was used with the native barcoding expansion 1-12 kit (EXP-NBD104), following the manufacturer’s instructions. The libraries were sequenced using an R9.4.1 flow cell on a MinION device running a 48-h sequencing cycle. The reads were base called and demultiplexed using Guppy v.3.1.5 (ONT).

For de novo assembly, the Illumina and Nanopore reads were adapter and quality trimmed using AdapterRemoval v.2.3.1 (4) and Porechop v.0.2.4 (5), respectively. Subsequently, the trimmed reads from both platforms were hybrid assembled using Unicycler v.0.4.8 (6). The complete, circular, and rotated chromosome of each strain produced using Unicycler was evaluated using Bandage v.0.8.1 (7) and BUSCO v.4.1.4 (8) to evaluate the core gene content and CheckM v.1.0.8 for the completeness and contamination levels (9). The chromosomes were annotated using the NCBI Prokaryotic Genome Annotation Pipeline. The species phylogeny was analyzed using autoMLST (10) and TYGS (11). Default parameters were used for all software. Pedobacter sp. D749 and Chryseobacterium sp. D764 had <95% average nucleotide identity (ANI) compared to the genomes of the type strains and thus could represent novel species.

Data availability.

The raw sequencing data have been deposited at the NCBI Sequence Read Archive under accession number PRJNA743326 (SRX11393888 and SRX11393892 for strain D749, SRX11393889 and SRX11393893 for strain D757, SRX11393890 and SRX11393894 for strain D763, and SRX11393891 and SRX11393895 for strain D764 for the Illumina and Nanopore reads, respectively), and the genome assemblies have been deposited in GenBank under BioProject accession number PRJNA743326. Detailed information for each strain is listed in Table 1.

TABLE 1.

Strain names, accession numbers, and genome characteristics of the soil isolates used in this study

Strain^a	GenBank accession no.	No. of assembled reads		Genome assembly size (bp)	Avg read length for Nanopore reads (bp)	Maximum read length for Nanopore reads (bp)	G+C content (%)	No. of CDS^b	No. of rRNAs	No. of tRNAs	Completeness (%)^c	Contamination (%)^c	Complete BUSCO core genes (%)^d	Topology
Strain^a	GenBank accession no.	Illumina	Nanopore	Genome assembly size (bp)	Avg read length for Nanopore reads (bp)	Maximum read length for Nanopore reads (bp)	G+C content (%)	No. of CDS^b	No. of rRNAs	No. of tRNAs	Completeness (%)^c	Contamination (%)^c	Complete BUSCO core genes (%)^d	Topology
Pedobacter sp. D749	CP079218.1	3,337,001	8,681	5,843,246	10,467.7	84,368	38.4	4,895	15	54	98.09	0.19	98.5	Circular
Rhodococcus globerulus D757	CP079698.1	4,662,334	7,757	6,739,623	6,215.7	28,384	61.7	6,091	15	52	99.56	0.89	99	Circular
Stenotrophomonas indicatrix D763	CP079106.1	4,015,635	7,238	4,615,841	13,210.5	78,776	66.3	4,108	13	70	100	0.09	99.9	Circular
Chryseobacterium sp. D764	CP079219.1	3,464,854	7,638	4,921,682	21,343.3	92,855	36.2	4,343	18	85	100	0.25	97.4	Circular

Open in a new tab

Species delineation was performed using TYGS and autoMLST. For TYGS, the digital DNA-DNA hybridization (dDDH) threshold value was >70%.

CDS, coding DNA sequences.

Estimated using CheckM v.1.0.8.

Complete and single-copy benchmarking universal single-copy orthologs (BUSCOs).

ACKNOWLEDGMENT

This project was supported by the Danish National Research Foundation (DNRF137) for the Center for Microbial Secondary Metabolites.

Contributor Information

Ákos T. Kovács, Email: atkovacs@dtu.dk.

Leighton Pritchard, SIPBS, University of Strathclyde.

REFERENCES

1.Philippot L, Raaijmakers JM, Lemanceau P, Van Der Putten WH. 2013. Going back to the roots: the microbial ecology of the rhizosphere. Nat Rev Microbiol 11:789–799. doi: 10.1038/nrmicro3109. [DOI] [PubMed] [Google Scholar]
2.Bahram M, Hildebrand F, Forslund SK, Anderson JL, Soudzilovskaia NA, Bodegom PM, Bengtsson-Palme J, Anslan S, Coelho LP, Harend H, Huerta-Cepas J, Medema MH, Maltz MR, Mundra S, Olsson PA, Pent M, Põlme S, Sunagawa S, Ryberg M, Tedersoo L, Bork P. 2018. Structure and function of the global topsoil microbiome. Nature 560:233–237. doi: 10.1038/s41586-018-0386-6. [DOI] [PubMed] [Google Scholar]
3.Vorholt JA, Vogel C, Carlström CI, Müller DB. 2017. Establishing causality: opportunities of synthetic communities for plant microbiome research. Cell Host Microbe 22:142–155. doi: 10.1016/j.chom.2017.07.004. [DOI] [PubMed] [Google Scholar]
4.Schubert M, Lindgreen S, Orlando L. 2016. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res Notes 9:88. doi: 10.1186/s13104-016-1900-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Completing bacterial genome assemblies with multiplex MinION sequencing. Microb Genom 3:e000132. doi: 10.1099/mgen.0.000132. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31:3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Waterhouse RM, Seppey M, Simao FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM. 2018. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol 35:543–548. doi: 10.1093/molbev/msx319. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Alanjary M, Steinke K, Ziemert N. 2019. AutoMLST: an automated Web server for generating multi-locus species trees highlighting natural product potential. Nucleic Acids Res 47:W276–W282. doi: 10.1093/nar/gkz282. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Meier-Kolthoff JP, Göker M. 2019. TYGS is an automated high-throughput platform for state-of-the-art genome-based taxonomy. Nat Commun 10:2182. doi: 10.1038/s41467-019-10210-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

TABLE 1.

Strain names, accession numbers, and genome characteristics of the soil isolates used in this study

Strain^a	GenBank accession no.	No. of assembled reads		Genome assembly size (bp)	Avg read length for Nanopore reads (bp)	Maximum read length for Nanopore reads (bp)	G+C content (%)	No. of CDS^b	No. of rRNAs	No. of tRNAs	Completeness (%)^c	Contamination (%)^c	Complete BUSCO core genes (%)^d	Topology
Strain^a	GenBank accession no.	Illumina	Nanopore	Genome assembly size (bp)	Avg read length for Nanopore reads (bp)	Maximum read length for Nanopore reads (bp)	G+C content (%)	No. of CDS^b	No. of rRNAs	No. of tRNAs	Completeness (%)^c	Contamination (%)^c	Complete BUSCO core genes (%)^d	Topology
Pedobacter sp. D749	CP079218.1	3,337,001	8,681	5,843,246	10,467.7	84,368	38.4	4,895	15	54	98.09	0.19	98.5	Circular
Rhodococcus globerulus D757	CP079698.1	4,662,334	7,757	6,739,623	6,215.7	28,384	61.7	6,091	15	52	99.56	0.89	99	Circular
Stenotrophomonas indicatrix D763	CP079106.1	4,015,635	7,238	4,615,841	13,210.5	78,776	66.3	4,108	13	70	100	0.09	99.9	Circular
Chryseobacterium sp. D764	CP079219.1	3,464,854	7,638	4,921,682	21,343.3	92,855	36.2	4,343	18	85	100	0.25	97.4	Circular

Open in a new tab

Species delineation was performed using TYGS and autoMLST. For TYGS, the digital DNA-DNA hybridization (dDDH) threshold value was >70%.

CDS, coding DNA sequences.

Estimated using CheckM v.1.0.8.

Complete and single-copy benchmarking universal single-copy orthologs (BUSCOs).

[B1] 1.Philippot L, Raaijmakers JM, Lemanceau P, Van Der Putten WH. 2013. Going back to the roots: the microbial ecology of the rhizosphere. Nat Rev Microbiol 11:789–799. doi: 10.1038/nrmicro3109. [DOI] [PubMed] [Google Scholar]

[B2] 2.Bahram M, Hildebrand F, Forslund SK, Anderson JL, Soudzilovskaia NA, Bodegom PM, Bengtsson-Palme J, Anslan S, Coelho LP, Harend H, Huerta-Cepas J, Medema MH, Maltz MR, Mundra S, Olsson PA, Pent M, Põlme S, Sunagawa S, Ryberg M, Tedersoo L, Bork P. 2018. Structure and function of the global topsoil microbiome. Nature 560:233–237. doi: 10.1038/s41586-018-0386-6. [DOI] [PubMed] [Google Scholar]

[B3] 3.Vorholt JA, Vogel C, Carlström CI, Müller DB. 2017. Establishing causality: opportunities of synthetic communities for plant microbiome research. Cell Host Microbe 22:142–155. doi: 10.1016/j.chom.2017.07.004. [DOI] [PubMed] [Google Scholar]

[B4] 4.Schubert M, Lindgreen S, Orlando L. 2016. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Res Notes 9:88. doi: 10.1186/s13104-016-1900-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Completing bacterial genome assemblies with multiplex MinION sequencing. Microb Genom 3:e000132. doi: 10.1099/mgen.0.000132. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7.Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualization of de novo genome assemblies. Bioinformatics 31:3350–3352. doi: 10.1093/bioinformatics/btv383. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8.Waterhouse RM, Seppey M, Simao FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM. 2018. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol 35:543–548. doi: 10.1093/molbev/msx319. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10.Alanjary M, Steinke K, Ziemert N. 2019. AutoMLST: an automated Web server for generating multi-locus species trees highlighting natural product potential. Nucleic Acids Res 47:W276–W282. doi: 10.1093/nar/gkz282. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11.Meier-Kolthoff JP, Göker M. 2019. TYGS is an automated high-throughput platform for state-of-the-art genome-based taxonomy. Nat Commun 10:2182. doi: 10.1038/s41467-019-10210-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Complete Genome Sequences of Four Soil-Derived Isolates for Studying Synthetic Bacterial Community Assembly

Carlos N Lozano-Andrade

Mikael Lenz Strube

Ákos T Kovács

Roles

ABSTRACT

ANNOUNCEMENT

Data availability.

TABLE 1.

ACKNOWLEDGMENT

Contributor Information

REFERENCES

Associated Data

Data Availability Statement

TABLE 1.

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Complete Genome Sequences of Four Soil-Derived Isolates for Studying Synthetic Bacterial Community Assembly

Carlos N Lozano-Andrade

Mikael Lenz Strube

Ákos T Kovács

Roles

ABSTRACT

ANNOUNCEMENT

Data availability.

TABLE 1.

ACKNOWLEDGMENT

Contributor Information

REFERENCES

Associated Data

Data Availability Statement

TABLE 1.

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases