Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2022 Jan 13;11(1):e01031-21. doi: 10.1128/mra.01031-21

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2022 Ji et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license.

PMC Copyright notice

FIG 1 — Overview of the MagCluster workflow. (a) Genomes are annotated using Prokka with a mandatory reference file of magnetosome proteins via ‐‐proteins. (b) Putative MGCs or MGC-containing contigs are retrieved by the MGC_Screen module from GenBank files generated by the annotation module. First, contigs are filtered by the contig length (‐‐contiglength) and the minimum number of magnetosome genes in a contig (‐‐threshold). Then, the length of a genomic region containing no less than the given number of magnetosome genes is checked to meet the value of ‐‐windowsize. Finally, contigs that pass all restrictions are regarded as putative MGC-containing contigs. (b1) Contigs shorter than 2,000 bp (by default) are discarded. (b2) Magnetosome genes are identified through a text-mining strategy using the keyword “magnetosome” in protein names, and contigs containing fewer than 3 (by default) magnetosome genes are discarded. (b3) Putative MGCs are screened under a 10,000-bp (by default) window, and the minimum number of magnetosome genes (3 by default) in each window size is rechecked. (c) Putative MGCs are compared and visualized using clinker. MAGs, metagenome-assembled genomes; SAGs, single amplified genomes.