Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2018 May 28;6:e4925. doi: 10.7717/peerj.4925

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication. This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.

PMC Copyright notice

ITS1 and ITS2 sequences were extracted from the UNITE v7.2 general fasta release database using “AMPtk database.” Identical sequences were collapsed (dereplication) and remaining sequences were clustering using UPARSE (“cluster_otus”) to generate the total number of UPARSE OTUs expected for the (A) ITS1 and (B) ITS2 regions. The data was then processed to five different lengths (150, 200, 250, 300, and 350 bp) and then clustered (UPARSE “cluster_otus”) using (i) default UPARSE truncation (longer sequences are truncated and shorter sequences are discarded), (ii) padding with ambiguous bases (longer sequences truncated and shorter sequences padded with N’s to length threshold), and (iii) full-length sequences (longer sequences are truncated and shorter sequences are retained if reverse primer is found). Full-length and padding pre-processing sequences outperforms default UPARSE truncation.