Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2023 Feb 13;12(3):e01015-22. doi: 10.1128/mra.01015-22

Metagenomes from Soils along an Agricultural Transect in Ulster County, New York

Carolina Oliveira de Santana a, Pieter Spealman b, David Gresham b, Gabriel G Perron a,b,c,
Editor: Leighton Pritchardd
PMCID: PMC10019288  PMID: 36779724

ABSTRACT

Many modern farming practices negatively impact ecosystems on the local and global scales. Here, we assessed the taxonomic structures of 48 soil microbial communities along an agricultural transect using 16S rRNA and internal transcribed spacer (ITS) amplicon sequencing. We further characterized the functional structures of a subsample of 12 microbiomes using whole-genome sequencing.

ANNOUNCEMENT

Microbial communities in agricultural soil play important ecological and economic roles (1, 2). Many modern agricultural practices negatively impact soils and soil microbiomes, degrading nutrients and reducing diversity (3). The use of antimicrobials in agriculture further alters microbial communities in soil and surrounding environments (4). Given increasing pressures on the global food system, understanding how soil microbiomes respond to agricultural practices is critical. Here, we present metagenomic sequences from soil samples collected from an agricultural transect encompassing different proximities to human activity.

Samples were collected from four sites along a transect of previously farmed soil, perpendicular to Esopus Creek, New York (Fig. 1A), in a gradient from most affected by human activity to least, as follows: F8-W, a forested strip between the riverbank and a dirt road; F8-0 (3.05 m), F8-300 (91.44 m), and F8-600 (182.88 m), fallow farm sites. At each site, we collected three 10-g topsoil samples on 6 June, 19 June, and 3 July 2019 using sterile techniques. Samples were transported to the laboratory on ice in a dark cooler and frozen for at least 24 h.

FIG 1.

FIG 1

(A) Schematic diagram showing the distance from the water line (white circle) of each site along the F8 transect from which triplicate samples were taken in 2019. (B) Schematic diagram of the sampling, library construction, and sequencing of each sample. (C) Pie charts showing the relative proportions of eukaryotes, prokaryotes, and archaea. (D) Percent abundances of the top five most abundant KO orthologs.

For each sample (Fig. 1B), we extracted DNA from 1.8 g of soil using the Quick-DNA fecal/soil microbe miniprep kit (Zymo), and this DNA was used for 16S rRNA and internal transcribed spacer (ITS) amplicon sequencing; for samples collected on 3 July, these extractions were also used for whole-genome sequencing (WGS). All 16S rRNA amplicon sequencing amplified the V4 region using the 515F and 806R primers (5). ITS amplicon sequencing used the ITS1f and ITS2 primers (5). PCR products for the 16S rRNA and ITS amplicons were pooled separately and purified on a 2% agarose gel using a Qiagen gel extraction kit. The excised regions were ~385 bp for the 16S rRNA amplicons and 200 to 600 bp for the ITS amplicons. WGS library preparation was performed using the Nextera XT DNA library preparation kit (Illumina). All purified libraries were quality checked using an Agilent 2100 BioAnalyzer and a high-sensitivity DNA kit (Agilent) and stored at −20°C until sequencing. Amplicon libraries were sequenced at Wright Labs (Huntingdon, PA) using Illumina MiSeq v2 paired-end sequencing (2 × 250-bp reads) with 20% PhiX spike-in. WGS libraries were sequenced using an Illumina NextSeq 2000 system (2 × 150-bp paired-end reads) with default parameters.

For 16S rRNA and ITS analyses, the QIIME2 pipeline was used with default parameters except for DADA2 (16S rRNA, denoise-paired, –p-trim-left-f 0 –p-trim-left-r 0 –p-trunc-len-f 250 –p-trunc-len-r 250; ITS, denoised-single, –p-trunc-len 150). For WGS, raw-read processing, assembly, and analysis were performed using the MGnify v5.0 pipeline (6) with default parameters for adapter trimming, quality filtering, and subsequent analysis. The results for each sample are shown in Table 1. We found that the majority of small subunit (SSU)- and large subunit (LSU)-containing contigs identified by MGnify belonged to bacteria (82%), eukarya (14%), and archaea (4%) (Fig. 1C). From 149,688 contigs, MGnify identified 208,556 coding sequences associated with 6,039 genome properties (7). Transposases and putative transposases (0.8% of KEGG Orthology [KO] orthologs) accounted for the majority of high-abundance KO orthologs in each sample (11.5% of all KO orthologs assigned) (Fig. 1D).

TABLE 1.

Summary of data

Samplea SRA or MGnify accession no. Mean read length (bp) Total no. of read pairs submitted Total no. of read pairs Total no. of contigs N50 (bp)
WGS
 F8W_T25 ERR9752702 151 13,493,283
 F8W_T26 ERR9752724 151 11,497,693
 F8W_T27 ERR9752734 151 9,693,650
 F80_T28 ERR9752742 151 9,510,055
 F80_T29 ERR9752750 151 11,153,342
 F80_T30 ERR9752757 151 10,188,435
 F8300_T31 ERR9752762 151 13,358,533
 F8300_T32 ERR9752768 151 12,361,583
 F8300_T33 ERR9752775 151 11,262,599
 F8600_T34 ERR9752780 151 11,369,775
 F8600_T35 ERR9752794 151 14,442,272
 F8600_T36 ERR9760450 151 16,556,960
ITS
 F80_1_0607 ERR10168568 151 1,444,523
 F80_1_0619 ERR10168570 151 1,444,589
 F80_1_0703 ERR10168632 151 2,185,005
 F80_2_0607 ERR10168636 151 2,328,428
 F80_2_0619 ERR10168647 151 1,453,311
 F80_2_0703 ERR10168649 151 1,011,053
 F80_3_0607 ERR10168652 151 948,188
 F80_3_0619 ERR10168654 151 1,571,348
 F80_3_0703 ERR10168656 151 975,492
 F8300_1_0607 ERR10168659 151 1,095,855
 F8300_1_0619 ERR10168660 151 787,551
 F8300_1_0703 ERR10168662 151 1,072,100
 F8300_2_0607 ERR10168666 151 1,319,891
 F8300_2_0619 ERR10168672 151 1,446,988
 F8300_2_0703 ERR10168675 151 1,152,882
 F8300_3_0607 ERR10168677 151 1,414,719
 F8300_3_0619 ERR10168680 151 23
 F8300_3_0703 ERR10168682 151 1,489,768
 F8600_1_0607 ERR10168684 151 1,456,243
 F8600_1_0619 ERR10168688 151 3,790,963
 F8600_1_0703 ERR10168745 151 1,336,475
 F8600_2_0607 ERR10168750 151 1,154,444
 F8600_2_0619 ERR10168753 151 506,739
 F8600_2_0703 ERR10168760 151 1,315,871
 F8600_3_0607 ERR10168776 151 1,582,067
 F8600_3_0619 ERR10168779 151 1,398,289
 F8600_3_0703 ERR10639905 151 47
 F8W_1_0607 ERR10213569 151 2,199,554
 F8W_1_0619 ERR10213571 151 873,471
 F8W_1_0703 ERR10213561 151 1,256,782
 F8W_2_0607 ERR10213574 151 656,150
 F8W_2_0619 ERR10213577 151 1,178,439
 F8W_2_0703 ERR10213582 151 759,588
 F8W_3_0607 ERR10213585 151 1,327,567
 F8W_3_0619 ERR10213587 151 1,281,001
 F8W_3_0703 ERR10213589 151 926,657
16S rRNA
 F80_1_2019_06_07_16S ERR10169571 250.7 30,535
 F80_1_2019_06_19_16S ERR10169574 250.6 36,286
 F80_1_2019_07_03_16S ERR10169578 250.6 21,911
 F80_2_2019_06_07_16S ERR10169580 250.7 7,794
 F80_2_2019_06_19_16S ERR10169582 250.6 24,368
 F80_2_2019_07_03_16S ERR10169585 250.6 18,269
 F80_3_2019_06_07_16S ERR10169587 250.6 30,048
 F80_3_2019_06_19_16S ERR10169589 250.6 34,117
 F80_3_2019_07_03_16S ERR10169592 250.6 20,410
 F8300_1_2019_06_07_16S ERR10169803 250.6 33,208
 F8300_1_2019_06_19_16S ERR10169804 250.6 44,023
 F8300_1_2019_07_03_16S ERR10169815 250.6 23,870
 F8300_2_2019_06_07_16S ERR10169821 250.6 28,959
 F8300_2_2019_06_19_16S ERR10169825 250.5 36,641
 F8300_2_2019_07_03_16S ERR10169841 250.7 17,673
 F8300_3_2019_06_07_16S ERR10169845 250.6 27,000
 F8300_3_2019_06_19_16S ERR10169847 250.6 38,461
 F8300_3_2019_07_03_16S ERR10169849 250.6 30,309
 F8600_1_2019_06_07_16S ERR10169850 250.6 28,737
 F8600_1_2019_06_19_16S ERR10169852 250.6 21,212
 F8600_1_2019_07_03_16S ERR10169854 250.6 42,005
 F8600_2_2019_06_07_16S ERR10169855 250.7 29,635
 F8600_2_2019_06_19_16S ERR10169856 250.6 34,151
 F8600_2_2019_07_03_16S ERR10169859 250.6 40,674
 F8600_3_2019_06_07_16S ERR10169861 250.6 50,432
 F8600_3_2019_06_19_16S ERR10169885 250.6 37,360
 F8600_3_2019_07_03_16S ERR10169864 250.7 33,704
 F8W_1_0607_16S ERR10213607 250.6 46,515
 F8W_1_0619_16S ERR10213609 250.6 48,041
 F8W_1_0703_16S ERR10213610 250.6 16,155
 F8W_2_0607_16S ERR10213612 250.4 288
 F8W_2_0619_16S ERR10213613 250.6 44,701
 F8W_2_0703_16S ERR10213614 250.7 15,809
 F8W_3_0607_16S ERR10213616 250.6 34,981
 F8W_3_0619_16S ERR10213618 250.6 54,720
 F8W_3_0703_16S ERR10213620 250.6 39,692
MGnify
 F80_T28 MGYA00607100 9,016,418 9,977 562
 F80_T29 MGYA00607095 10,558,938 833 560
 F80_T30 MGYA00607099 9,646,580 7,393 554
 F8300_T31 MGYA00607093 12,653,552 17,047 566
 F8300_T32 MGYA00606633 11,703,364 14,294 555
 F8300_T33 MGYA00607091 10,659,098 10,304 555
 F8600_T34 MGYA00607098 10,767,204 11,398 558
 F8600_T35 MGYA00607096 13,663,903 13,799 557
 F8600_T36 MGYA00607092 15,683,527 20,013 564
 F8W_T25 MGYA00607094 12,781,962 22,814 572
 F8W_T26 MGYA00607097 10,887,794 7,033 551
 F8W_T27 MGYA00607090 9,183,848 7,283 556
a

Sequencing run statistics are available for WGS, ITS amplicon sequencing, and 16S rRNA V4 amplicon sequencing, including the sample name (for WGS, site_id; for ITS, site_replicate_date; for 16S, site_replicate_date_16S), read accession number, mean read length, and total reads or read pairs submitted. WGS raw reads were used to perform metagenomic analyses using MGnify, which are accessible using the MGnify accession numbers.

Data availability.

All data are available through EMBL/EBI BioProject accession number PRJEB52998 and the MGnify identification number MGYS00006044. Individual accession numbers for each sample are available on Table 1.

ACKNOWLEDGMENTS

This work was funded by the Applied Farmscape Ecology Research Collaborative of the Hudson Valley FarmHub (Hurley, NY, USA) and the Bard Summer Research Institute of Bard College.

We thank Conrad Vispo, Anne Bloomfield, Tejaswee Neupane, Hannah Herrick, Maram Al Zayyad, Benjamin Bryant, Christopher Benincasa, and Maureen Schulz-O’Callaghan for their assistance in the field and in the laboratory.

Contributor Information

Gabriel G. Perron, Email: gperron@bard.edu.

Leighton Pritchard, University of Strathclyde.

REFERENCES

  • 1.Kim N, Zabaloy MC, Guan K, Villamil MB. 2020. Do cover crops benefit soil microbiome? A meta-analysis of current research. Soil Biol Biochem 142:107701. doi: 10.1016/j.soilbio.2019.107701. [DOI] [Google Scholar]
  • 2.Singh BK, Trivedi P, Egidi E, Macdonald CA, Delgado-Baquerizo M. 2020. Crop microbiome and sustainable agriculture. Nat Rev Microbiol 18:601–602. doi: 10.1038/s41579-020-00446-y. [DOI] [PubMed] [Google Scholar]
  • 3.Santos LF, Olivares FL. 2021. Plant microbiome structure and benefits for sustainable agriculture. Curr Plant Biol 26:100198. doi: 10.1016/j.cpb.2021.100198. [DOI] [Google Scholar]
  • 4.Ventola CL. 2015. The antibiotic resistance crisis: part 1: causes and threats. P T 40:277–283. [PMC free article] [PubMed] [Google Scholar]
  • 5.Bokulich NA, Mills DA. 2013. Improved selection of internal transcribed spacer-specific primers enables quantitative, ultra-high-throughput profiling of fungal communities. Appl Environ Microbiol 79:2519–2526. doi: 10.1128/AEM.03870-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mitchell AL, Almeida A, Beracochea M, Boland M, Burgin J, Cochrane G, Crusoe MR, Kale V, Potter SC, Richardson LJ, Sakharova E, Scheremetjew M, Korobeynikov A, Shlemov A, Kunyavskaya O, Lapidus A, Finn RD. 2020. MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res 48:D570–D578. doi: 10.1093/nar/gkz1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Richardson LJ, Rawlings ND, Salazar GA, Almeida A, Haft DR, Ducq G, Sutton GG, Finn RD. 2019. Genome properties in 2019: a new companion database to InterPro for the inference of complete functional attributes. Nucleic Acids Res 47:D564–D572. doi: 10.1093/nar/gky1013. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data are available through EMBL/EBI BioProject accession number PRJEB52998 and the MGnify identification number MGYS00006044. Individual accession numbers for each sample are available on Table 1.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES