Skip to main content
PLOS One logoLink to PLOS One
. 2022 Nov 22;17(11):e0277819. doi: 10.1371/journal.pone.0277819

Role of DNA modifications in Mycoplasma gallisepticum

Tatiana A Semashko 1,2,*, Alexander A Arzamasov 1, Daria V Evsyutina 1,2, Irina A Garanina 1, Daria S Matyushkina 1,2, Valentina G Ladygina 1, Olga V Pobeguts 1, Gleb Y Fisunov 1,2, Vadim M Govorun 2
Editor: Mary Bryk3
PMCID: PMC9681074  PMID: 36413541

Abstract

The epigenetics of bacteria, and bacteria with a reduced genome in particular, is of great interest, but is still poorly understood. Mycoplasma gallisepticum, a representative of the class Mollicutes, is an excellent model of a minimal cell because of its reduced genome size, lack of a cell wall, and primitive cell organization. In this study we investigated DNA modifications of the model object Mycoplasma gallisepticum and their roles. We identified DNA modifications and methylation motifs in M. gallisepticum S6 at the genome level using single molecule real time (SMRT) sequencing. Only the ANCNNNNCCT methylation motif was found in the M. gallisepticum S6 genome. The studied bacteria have one functional system for DNA modifications, the Type I restriction-modification (RM) system, MgaS6I. We characterized its activity, affinity, protection and epigenetic functions. We demonstrated the protective effects of this RM system. A common epigenetic signal for bacteria is the m6A modification we found, which can cause changes in DNA-protein interactions and affect the cell phenotype. Native methylation sites are underrepresented in promoter regions and located only near the -35 box of the promoter, which does not have a significant effect on gene expression in mycoplasmas. To study the epigenetics effect of m6A for genome-reduced bacteria, we constructed a series of M. gallisepticum strains expressing EGFP under promoters with the methylation motifs in their different elements. We demonstrated that m6A modifications of the promoter located only in the -10-box affected gene expression and downregulated the expression of the corresponding gene.

Background

Mycoplasma gallisepticum belongs to the class Mollicutes, which is characterized by reduced genome size, lack of a cell wall, and simplified organization of cells. Owing to these properties, it serves as a convenient model for minimal cells [14]. Despite its simplicity, the overall regulation mechanisms in M. gallisepticum are poorly understood, and DNA modifications add another layer of complexity to the same.

The most frequent modifications of genomic DNA in prokaryotes include N6-methyladenine (m6A), N4-methylcytosine (m4C), and, more rarely, 5-methylcytosine (m5C) [5, 6]. These modifications are introduced by methyltransferases (MTases), which are generally components of the restriction-modification (RM) systems in bacteria. RM systems can be classified into four types. Type I RM systems form a multi-subunit complex consisting of MTase, restriction, and sequence-recognition subunits. Type II RM systems include two separate enzymes for modification and restriction. Type III RM systems form a complex between MTase and restrictase. Type IV RM systems consist of sole restrictases that, unlike the previous types, cleave modified DNA [7].

The primary function of RM systems is defense against exogenous DNA, such as phages and plasmids [7]. However, in prokaryotic organisms, orphan MTases of Type II RM systems without cognate restriction enzymes can mediate epigenetic regulation of gene expression through DNA modification. Examples of orphan m6A-MTases include DNA adenine MTase (Dam) in Gammaproteobacteria, cell cycle-regulated MTase (CcrM) in Alphaproteobacteria, and orphan m5C-MTase DNA cytosine MTase (Dcm) in Escherichia coli. Dam is known to participate in methylation-dependent bacterial gene silencing, DNA replication, and DNA mismatch repair, while CcrM participates in the regulation of the cell cycle [8]. Even though Dcm methylation is conserved, its role remains unclear [9]. A large number of other functional orphan MTases have been identified by means of high-throughput search for epigenetic modifications in prokaryotes [6]. In addition, complete RM systems can carry out both defense and regulatory functions. For example, it has been shown in Helicobacter pylori that methylation guided by the Type I RM system affects its gene expression [10] and switching to the variable Type III RM system may affect virulence [11]. For Streptococcus pneumoniae, switching between asymptomatic and invasive phenotypes is mediated by rearrangements in the Type I RM system [12].

RM systems in mycoplasmas are still under research. For example, a new Type II RM system for Ureaplasma parvum, isolated from human placenta, has recently been described [13]. Phase-variable systems are also present in mycoplasmas, in addition to classical RM systems. For instance, there are phase-variable forms of the Type I RM system in Mycoplasma pulmonis, where the polymorphic hsdS genes produced by gene rearrangement encode a family of functional S subunits with differing DNA sequence specificities [14]. A Type III RM system from Mycoplasma mycoides subsp. capri has been described, where MTase gene expression has a phase-variable activity [15]. DNA modifications in Mycoplasma genitalium and Mycoplasma pneumoniae have been characterized at the whole-genome level, and enzymes performing these modifications have been identified [16]. In a previous study, we characterized a transcription factor that controls the expression of the Type I RM system MgaS6I in M. gallisepticum S6 [17].

In this study, we performed whole-genome identification of M. gallisepticum S6 DNA modifications, analyzed the distribution of its motifs, and identified the enzymes responsible for the modifications. In addition, we also investigated the protective and epigenetic effects of the modifications on mycoplasma.

Results

Identification of genomic DNA modifications and distribution of their motifs along the genome

The whole-genome strand-specific detection of DNA modifications in M. gallisepticum S6 was studied using single-molecule real-time (SMRT) sequencing [18]. Currently, this technique is a major method for the detection of bacterial DNA modifications [6, 19]. SMRT sequencing allows for the identification of a number of modifications, one of which, m6A, has been predicted in M. gallisepticum. Mycoplasma genome was sequenced with 90× coverage per strand and with an average read length of 8 kb [S1 Table in S1 File, column wild-type (WT)]. The m6A modification was the only one identified with high quality. Modified positions with low-quality identifications also presented several motifs. Considering that these motifs were significantly degenerate and only a small fraction of them were modified in the genomic DNA, we concluded that they represented noise and only one actual DNA modification motif existed in the S6 strain. The predicted high-quality DNA modification motif was ANCNNNNCCT/AGGNNNNGNT (with the modified nucleotides indicated in bold). Methylation occurs at both the adenines of the double-stranded motif, in the direct and complementary strands. The characteristics of the modification motifs are shown in Table 1.

Table 1. Characteristics of the modification motif ANCNNNNCCT.

Mean Score–mean Modification QV of instances of this motif that were detected as modified; Mean IPD Ratio–mean interpulse duration (IPD) ratio of instances of this motif that were detected as modified; Mean Coverage–mean coverage of instances of this motif that were detected as modified.

Motif Position Modification Type Fraction Number of Detected Motifs Number of Motifs in Genome Group of Motif Partner of Motif Mean Score Mean IPD Ratio Mean Coverage
ANCNNNNCCT 0 m6A 1 1312 1312 ANCNNNNCCT/AGGNNNNGNT AGGNNNNGNT 139.5 5.96 89.8
AGGNNNNGNT 0 m6A 0.97 1273 1312 ANCNNNNCCT/AGGNNNNGNT ANCNNNNCCT 126.2 4.98 89.9

In total, 1312 sites of the modification motif were identified in the genome, with a medium density of approximately 1 site per 1 kb (Fig 1, rings B and C). There were several extended regions without modification sites, which were up to 6.7 kb long. Mobile genetic elements were predominant within the unmethylated regions. The coding sequences located within respective regions, apart from transposases, did not belong to any common functional group. However, the regions with the highest density of methylated sites corresponded to the gene clusters involved in virulence (S2 Table in S1 File).

Fig 1. Distribution of DNA modification sites along the genome of M. gallisepticum S6.

Fig 1

A–Annotated features: RM systems are in orange, vlhA clusters in green, the CRISPR system in grey, the virulence cluster in blue, and mobile elements in black. B and С –Whole-genome representation of the DNA modification motif ANCNNNNCCT for plus and minus strand, respectively. The lines indicate positions of sites, and the colors from red to blue represent percentage of methylation in each site. Percentage of methylation was calculated using the build-in SMRT protocol, as the fraction of reads aligning to the position that has a modified base. D–Density of methylation sites. Maximum density of methylation is highlighted in orange, while minimum is highlighted in green. E–GC content. Maximum GC content is highlighted in red, while minimum is highlighted in blue. F–Hypo-methylated sites are in green, hemi-methylated sites in orange and blue (for plus and minus strands, respectively); proteins with differential abundance (as assessed using 2D-DIGE) between WT and ΔS.MgaS6I strains are in grey.

Majority of the identified sites were fully modified at both the strands, with a median fraction of methylated adenines of 0.94 (Fig 1, ring B; Table 1). Nevertheless, there was a small fraction of hemi- and hypo-methylated sites. Only 39 sites were hemi-methylated. Since we performed a high-coverage SMRT sequence, we estimated the proportion of methylated adenines in each of the 1312 methylation sites, for adenines in both the strands. Fig 2 shows the number of sites at different methylation levels. Sites located in different parts of the genome relative to the origin and midpoint of replication, either on the plus or minus strand, were divided into different groups (Fig 2A). This corresponds to the maternal and daughter DNA strands during bacterial division, as well as, the leading and lagging strands during replication. We also compared the efficiency of methylation for both the strands, ANCNNNNCCT/AGGNNNNGNT, in the methylation motif (Fig 2B). The position of under-methylated adenines in the hemi-methylated sites did not depend on the location in the genome relative to the replichore (Fisher’s exact test two-tailed P-value>0.05). Interestingly, all under-methylated adenines were located on the AGGNNNNGNT strand of the motif (Fig 2, S3 Table in S1 File). There were only six hypo-methylated sites with a fraction of both modified adenines less than 0.75 (S4 Table in S1 File). Neither hypo- nor hemi-methylated sites featured additional sequence motifs, as seen in the fully methylated sites.

Fig 2. Methylated fraction of each methylation site.

Fig 2

A–Number of sites with different fractions of methylation. Colors represent sites located in one of the four parts of the genome: between the origin and midpoint of the chromosome, on the plus or minus strand. The panel on the right schematically shows the methylation sites that were counted in each group. B–Number of sites with different fractions of methylation. In this figure, the location of sites in the genome has not been considered. Colors represent the sites on the ANCNNNNCCT or AGGNNNNGNT strands in the double-stranded methylation motif. The panel on the right schematically shows the methylation sites that were counted in each group.

RM systems in M. gallisepticum and their functionality

Among all known enzymes capable of modifying DNA, M. gallisepticum S6 has two annotated type I RM systems, both of which are predicted to methylate the N6-position in the adenine. The first system shares homology with the RM systems found in other strains of M. gallisepticum. Frameshift mutations have disrupted the genes encoding subunits of MTase HsdM and the DNA sequence-recognizing protein HsdS (specificity subunit). Mutation in the MTase is found only in the S6 strain, while disruption of the S-subunit is shared by all strains of M. gallisepticum we analyzed. The other RM system (MgaS6I) is intact, potentially functional, and unique for the S6 strain. MgaS6I is present in the S6 strain and none of the others, its representation in the genomic context of different strains is shown in S1 Fig. In addition to the proteins forming the RM system complex, it has a regulatory C-subunit and an HTH-family transcription factor, whose activity was demonstrated in our previous work [17] (Fig 3). Despite the frameshifts discussed above, transcripts and proteins of both the RM systems exist on the transcriptome and proteome levels of the S6 strain [3, 20].

Fig 3. Restriction-modification systems of M. gallisepticum S6.

Fig 3

Arrows represent the ORFs; grey, blue, green, and orange colors indicate genes hsdC (controller protein), hsdM (methyltransferase), hsdS (specificity subunit), and hsdR (restriction subunit), respectively. Ori–origin of replication. Faded colors represent disrupted genes.

To assign the identified methylated sites to the activity of a particular RM system, we obtained a knockout mutant with a disrupted sequence-recognition subunit S.MgaS6I (GCW_02360) of the unique RM system (ΔS.MgaS6I) using transposon mutagenesis. The Type I RM system works as a multi-subunit complex where the S subunit recognizes and binds to the DNA sequence, following which the MTase and restrictase are able to work. Therefore, we assumed that in mutants with both the S subunits disrupted, the DNA modification systems would be non-functional. Whole-genome DNA modifications in WT M. gallisepticum were compared with the modifications in ΔS.MgaS6I (S1 Table in S1 File). After de novo genome assembly, we did not observe any significant differences between the strains, except for the transposon insertion site (S7 Table in S1 File). The m6A modification and methylated ANCNNNNCCT motif were identified only in the WT strain, and the ΔS.MgaS6I strain lacked them. This indicates that the RM system common for M. gallisepticum strains is non-functional, at least in the S6 strain. The identified motif was typical for the Type I RM system [21] and belonged to the unique RM system MgaS6I. Proteins of this system are encoded by an operon consisting of four genes: XRE family transcriptional regulator (GCW_02350), methyltransferase (GCW_02355), specificity subunit (GCW_02360), and restrictase (GCW_02365).

MgaS6I was not found to have close homologs in other whole-genome sequenced M. gallisepticum strains. However, its particular components hsdM and hsdS demonstrated approximately 70% sequence similarity with the respective genes from Enterococcus cecorum strains SA1 and SA2, which are pathogenic isolates from chickens [22]. Within mycoplasmas, hsdS homolog was found in Mycoplasma hyosynoviae isolates from pigs with arthritis symptoms and shared about 65% sequence similarity [23].

The identified modification motif ANCNNNNCCT is unique among those represented in the REBASE database, and its specificity for HsdS homologs is unknown [7]. DNA binding of the HsdS subunits is mediated by the target recognition domains (TRDs) that define specificity of Type I RM systems. TRDs are two similar globular domains, each of which recognizes half of the DNA sequence specificity motif [21, 24, 25]. TRDs recognizing AGG and ANC halves of the motif were selected using the REBASE database. Sequence comparison did not reveal a phylogenetic relationship between the TRD from MgaS6I and those from other S-subunits recognizing the same sub-motifs (Fig 4).

Fig 4. Phylogenetic tree of TRDs of specificity subunits of M. gallisepticum S6 and other TRDs with similar specificity.

Fig 4

The names in the leaves represent the TRD names, which consist of the REBASE name of the HsdS and its sequence specificity. The numbers on the nodes are distance measures calculated using BLOSUM62 substitution matrices.

Methylation effect of sites in the native location on gene expression

To investigate the effect of DNA methylation on gene expression, we searched for methylation sites in the regulatory regions of the genes using previous data on the transcription start sites (TSSs) in M. gallisepticum [3]. Only 26 genes have the methylation sites in their regulatory regions, those in the promoter (-40 to -1 relative to TSS), upstream to the promoter (-100 to -40 relative to TSS), or within the 5’-UTR (S5 Table in S1 File). The number of methylation sites in the regulatory regions of genes in M. gallisepticum S6 was underrepresented compared to the whole genome (as assessed using Fisher’s exact two-tailed test P-value = 0.001). Only two modification sites overlapped with the core promoter of the genes of 50S ribosomal protein L31 (GCW_00775) and the molecular chaperone dnaJ_4 (GCW_01610), both of which reside near the -35 box, which is insignificant for mycoplasma [3]. Several modification sites overlapped with putative regulatory sequences upstream of the TSS and 5’-UTR for 13 and 11 genes, respectively. The RM system operon did not have modification motifs, in its own promoter or in the coding part of the controller protein.

The effects of methylation on gene expression were investigated using RT-qPCR by comparing gene expression in WT M. gallisepticum and the ΔS.MgaS6I strain (Fig 5). All strains were in the exponential phase of growth, similar to the conditions used for the SMRT analysis. The genes with a methylation motif in the -35 box of promoter (dnaJ_4 and its homologs dnaJ_2 and dnaJ_6, as additional controls), TSS (DNA topoisomerase I topA GCW_03565), upstream of the promoter (transketolase tktA_1 GCW_00160, RNA 3-phosphate cyclase scpA GCW_92264, and peroxyredoxin osmC homolog osmC_2 GCW_03005), within the 5’-UTR (thioredoxin trxA_1 GCW_00460, transcriptional regulator yebC GCW_00725, and replication initiation/membrane attachment protein dnaB GCW_01380) were selected as genes with methylation in potentially regulatory elements. All these genes, except dnaB, were found to be fully methylated in the SMRT analysis. Additionally, we studied the expression of genes with different numbers of methylation sites (without methylation sites–DNA ligase ligA GCW_03920 and DNA gyrase subunit B gyrB GCW_04070; genes containing more than average sites–excinuclease ABC subunit B uvrB GCW_00205 and molecular chaperone groEL GCW_02710) or with hemi-methylation status (sites in 5’-UTR–dnaB and uracil-DNA glycosylase ung GCW_03435; sites in protein–histone-like DNA-binding superfamily protein hup_2 GCW_02335) (S3 and S4 Tables in S1 File). We also measured the gene expression of two methyltransferases, hsdM from MgaS6I and mraW. The genes encoding enolase eno GCW_02860, glyceraldehyde-3-phosphate dehydrogenase gapd GCW_01780, elongation factor Tu tuf GCW_01315, gyrB, and ligA were chosen as controls. Among them, eno, gapd, and tuf were considered as housekeeping genes, and the gyrB and ligA genes did not contain methylation sites in their sequence.

Fig 5. Differences in expression levels of genes with DNA methylations sites between WT and ΔS.MgaS6I strains.

Fig 5

Groups of control genes eno, gadp, tuf, ligA, gyrB, dnaJ_2, and dnaJ_6; genes with the methylation motif in promoters topA, dnaJ_4, tktA_1, scpA, osmC_2, trxA_1, yebC, and dnaB; genes with different numbers of methylation sites or with hemi-methylation status ligA, gyrB, dnaB, hup_2, ung, uvrB, and groEL; genes of methyltransferases hsdM from MgaS6I and mraW have been presented. Genes with significant difference between strains (Student’s t-test, Benjamini–Hochberg correction, P<0.05) are shown using red boxes and asterisks.

It was shown that there were no systemic effects on gene expression for methyltransferase genes, for the genes containing more than average sites or hemi-methylated sites or for the genes with the methylation sites in their regulatory regions. The last group was select genes with methylation sites within the 5’-UTR, genes with the methylation sites upstream to the promoter, gene topA with the methylation site near the TSS, and gene dnaJ_4 with the methylation site in the -35 box of the promoter. These facts did not allow us to draw concrete conclusions about the potential influence of native methylation on gene expression.

Protein changes in the absence of genomic DNA methylation

We further assessed the impact of DNA modifications at the proteome level by comparing the WT mycoplasma and ΔS.MgaS6I strain (Fig 6) using two-dimensional difference gel electrophoresis (2D-DIGE). Changes in the proteome can be caused by methylation and genetic modifications in the knockout ΔS.MgaS6I strain, which has a tetM insertion, and thus resistance to tetracycline. To discriminate these effects on protein abundance, we performed 2D-DIGE analysis for three other strains with tetM insertion in various genomic regions (2D-DIGE maps not shown). Changed only in ΔS.MgaS6I strain major proteins included single-stranded DNA-binding protein GCW_03460, phosphoglycerate kinase GCW_01785, cytadherence protein GCW_01845, and 4 proteins with unknown functions, GCW_00815, GCW_00850, GCW_00875, and GCW_02685. None of the changes observed were directly linked to the presence of DNA methylation in the respective genes, indicating that they were indirectly affected.

Fig 6. Comparison of the proteomic profiles of the wild-type M. gallisepticum (red) and the ΔS.MgaS6I strain (green) with unmethylated DNA.

Fig 6

Proteins, whose abundances reproducibly changed in all the studied strains with tetM insertion in the various genomic regions, as compared to that in the WT, are in blue, while differences that are unique to ΔS.MgaS6I are in white.

Protective function of the RM system

The classic function of RM systems is to protect against exogenous DNA. To characterize this function of the MgaS6I system, we performed an experiment by introducing a transposon insertion into the ΔS.MgaS6I strain, which disrupted the RM systems in comparison to those of WT M. gallisepticum, and four other strains with tetM insertion in various intergenic regions. The transposon for this experiment carried resistance genes to chloramphenicol, camR, and five methylation sites ANCNNNNCCT, one of which is in camR. We demonstrated that inactivation of the MgaS6I RM system resulted in 19-fold more efficient transposon insertion (Fig 7).

Fig 7. The efficiency of camR insertion into wild-type M. gallisepticum, ΔS.MgaS6I strain, and four strains with tetM insertion in various intergenic regions.

Fig 7

CFUs of strains with significant difference, as compared to the CFU of ΔS.MgaS6I strain (Student’s t-test, P<0.05), have been indicated using asterisks.

Effect of m6A modification in the promoter on gene expression

It is known that epigenetic signals that affect DNA-protein interactions are caused by methylated bases in DNA [26, 27]. In M. gallisepticum S6, methylation sites in the regulatory regions are underrepresented, and no methylated base is located in the promoter sites important for gene expression, except for methylation in the -35 box, which is not very important for mycoplasma [3]. To investigate the effect of m6A DNA methylation on the efficiency of gene expression by RNA polymerase, we designed constructs with an EGFP reporter under different artificial promoters. Constructs included methylation sites in the promoters overlapping with -10 box, -35 box, region between the -10 and -35 box, or in the TSS; as a control, each construct had a corresponding construct, where the methylation site was inactivated by means of sequence inversion (Fig 8B). The inversion did not disrupt the promoter itself or the local GC content. Each of these constructs was integrated into the WT strain of M. gallisepticum using a transposon vector. Three biological repeats with different transposon locations in the genome for each promoter type were used for gene expression measurements. The genes eno, gapd, tuf, gyrB, and ligA were chosen as another set of controls, as done in the experiment on the methylation effect of sites in the native location on gene expression. We showed that the level of gene expression did not differ by more than two-fold between the paired constructs, with or without a methylated promoter, for almost all the studied genes. Only methylation of the -10-box resulted in a 4-fold downregulation of transcription (Student’s t-test, Benjamini–Hochberg correction, P = 0.03) (Fig 8A).

Fig 8.

Fig 8

A–Difference between the expression levels of paired genes for mutants with EGFP genes under synthetic promoters, with or without methylation sites. For each pair, the difference for EGFP gene in comparison with control genes eno, gadp, tuf, ligA, and gyrB is shown. Genes with significant difference between promoters with or without the methylation site (Student’s t-test, Benjamini–Hochberg correction, P<0.05) are shown with red boxes and asterisks. B–Sequences of paired EGFP promoters with or without methylated sites in -10 box, -35 box, between -35 and -10 boxes, and in the TSS. Methylation sites are underlined; methylated adenines in forward and reverse strands are in bold; -10 boxes are in blue and highlighted with blue boxes; -35 boxes are highlighted with green boxes and the sequence with strong consensus to the -35 box is in green; TSSs are in red and marked with red arrows; sequence differences between paired methylated or non-methylated promoters are in pink.

Discussion

In the present study, we investigated the DNA modifications in Mycoplasma gallisepticum and the enzymes involved in these modifications. We separately considered the modifying enzymes and the effect of the modifications on cell function.

We demonstrated that there is only one functional RM system, MgaS6I, in the M. gallisepticum strain S6, which seems to be acquired by via horizontal gene transfer, since it did not have close homologs in other whole-genome sequenced M. gallisepticum strains and a unique location in a genomic context compared to other strains. It is likely that for all these M. gallisepticum strains, the functional DNA methylation system exists only in the strain S6. MgaS6I is a Type 1 RM system. It differs from the more studied Type II RM systems as it functions as a multi-subunit complex in which the sequence recognition subunit HsdS is bound to DNA, and MTase and restrictase subunits are bound to HsdS. MgaS6I methylates the N6-position of adenine in the ANCNNNNCCT motif (methylation occurs at both the adenines of the double-stranded motif, in the direct and complementary strands).

In prokaryotes, there is an effect of underrepresentation in the genome of restriction sites of RM systems, which is called avoidance [28]. In this case, the recognition site is destroyed by nucleotide substitutions in it. This phenomenon is mainly associated with Type II RM systems and depends on the lifespan of the RM system proteins in the organism[29]. No destroyed ANCNNNNCCT motifs were found in the S6 strain in comparison with other M. gallisepticum strains. This may be due to DNA cleavage at a distance from the recognition site by the Type I RM system. Another explanation is that since restrictase works in conjunction with MTase in the studied RM system of Type I, this imposes a limitation on the existence of unmethylated DNA sites and their further exclusion from the genome.

As mentioned above, almost all sites of the modification motif were fully methylated. Notably, only few methylation sites in genomic DNA were hypo- or hemi-methylated, and we did not identify any additional common sequence motifs in these cases. We hypothesize that hypo- or hemi-methylation may be caused by the lower local accessibility of DNA, particularly due to the chromatin structure or occupancy of nearby positions by DNA-binding proteins.

An interesting characteristic of MgaS6I is greater methylation efficiency for the ANCNNNNCCT strand, as compared to the AGGNNNNGNT strand. Thus, the level of methylation does not depend on whether the DNA chain is newly synthesized during replication but depends on MgaS6I affinity instead. The apparent affinity is stronger on the AGGNNNNGNT strand of the methylation motif, taking into account that binding and methylation occur on opposite strands [21].

The main function of RM systems in bacteria is to protect against foreign DNA. Several mycoplasma DNA viruses have been described to date. The list includes dsDNA phages of Mycoplasma bovirhinis, M. hyorhinis, and M. pylmonis [30], which are characterized by their shape. Additionally, sequences of phages of M. pylmonis [31] and M. arthridis [32] were determined, and prophages in the genomic sequences of M. fermentans [33] and M. agalactiae [34] were also discovered. In the example of M. gallisepticum transformation, we demonstrated that the lack of MgaS6I RM system increases the transformation efficiency by 19-fold. Our data is consistent with that of another study in which the authors cloned a Mycoplasma mycoides genome as a yeast centromeric plasmid. Only considering the action of RM systems during genome transplantation into Mycoplasma capricolum, a sufficient number of viable M. mycoides cells were obtained in this study [35].

RM systems may also demonstrate the properties of selfish elements [36]. We found no evidence that MgaS6I enzymatic activity affects the Mycoplasma genome, but we have previously shown that its presence in the host genome has changed the implementation of genetic information. The controller subunit HsdC of MgaS6I can bind a set of promoters in addition to one in its own operon [17]. Thus, indirect participation of MgaS6I in the evolution of M. gallisepticum can be proposed.

There is growing evidence of the epigenetic role of m6A modifications in bacteria. So, N6-methylated adenine can influence DNA-protein interactions [27]. The number of methylation sites in the promoter regions of genes in M. gallisepticum S6 is underrepresented compared to the whole genome sequence. We also estimated enrichment ratio of ANCNNNNCCT in set of promoter sequences and in 100 sets of random generated sequences with the same length and GC content of the M. gallisepticum S6. In this case enrichment ratio was 0.49±0.11. Additionally, ANCNNNNCCT sites in the genome of the closest strain without the MgaS6I system, M. gallisepticum Rlow, exhibits a similar level of counts. Therefore, underrepresentation of methylation sites in promoter regions is possible due to the overall low GC content.

We did not find any influence of native methylation on gene expression in M. gallisepticum S6. In addition, phenotypic differences in the absence of methylation, as detected using 2D-DIGE, did not show clear correlations between methylation DNA and variation in protein levels. For M. pneumoniae, there was no correlation between the MTase M.MpnI gene and genes with methylation sites in their regulatory sequences at the transcription level [16]. We can speculate that M. gallisepticum S6 acquired the RM system MgaS6I, which did not introduce dramatic changes in cell regulation, as compared to that of the other strains of M. gallisepticum.

To demonstrate the possibility of epigenetic regulation by m6A in Mollicutes, we constructed M. gallisepticum strains with EGFP expression under artificial promoters with methylation motifs in their different elements, which were lacking native methylation sites in M. gallisepticum S6. We showed that only methylation in the -10 box of the promoter affected gene expression. Previously, we demonstrated that the -10 box and the first nucleotide of the transcript are the main features of functional promoters in M. gallisepticum [3]. We assume that methylation of the -10 box of the promoter results in weaker binding of the RNA polymerase complex or alters the melting or conformation of DNA strands in the promoter, thus leading to downregulation of expression of the respective gene. Another reason might be the competition between MgaS6I and RNA polymerase for the promoter. On extrapolating these data to all representatives of Mollicutes with a reduced genome and RNA polymerase with close homology, we showed that m6A modification affects gene expression. Consequently, the presence of DNA modifications in the -10 box of promoters of genes of M. gallisepticum could play a role in the epigenetic regulation of M. gallisepticum and related bacteria.

Conclusion

In this study, we identified DNA modifications in M. gallisepticum S6 at the whole-genome level. The only functional system capable of DNA modification was MgaS6I, a unique Type I RM system, encoded by genes- MTase GCW_02355, the specificity subunit GCW_02360, and the restrictase GCW_02365. We identified a unique methylation motif ANCNNNNCCT according to the REBASE database. Our results indicate that the MgaS6I RM system has a protective function and no regulatory effect in M. gallisepticum, that is, native methylation of DNA does not directly affect gene expression or protein abundance. This is due to the lack of methylation in the regulatory regions of the genes. When m6A modifications are present in the -10 box of the promoter, they influence gene expression and downregulate the expression of the corresponding gene in M. gallisepticum and related bacteria.

Methods

Cell culture

The Mycoplasma gallisepticum S6 strain was provided by Prof. S.N. Borkhsenius, Institute of Cytology, St. Petersburg, Russian Academy of Science. M. gallisepticum S6 was cultivated in a liquid medium containing tryptose (20 g/l), Tris (3 g/l), NaCl (5 g/l), KCl (5 g/l), yeast dialysate (5%), horse serum (10%), and glucose (1%) at pH = 7.4 and 37°C under aerobic conditions. Cells were passaged at a dilution of 1:10, twice for 24 h, starting from the frozen culture prior to the experiment [37].

Construction of transformants of M. gallisepticum

The knockout mutant in hsdS GCW_02360 (ΔS.MgaS6I) was selected from a library of random transposon knockouts (Arzamasov et al., manuscript in preparation). Construction of a vector for the transformation of M. gallisepticum was performed as described in [3]. Transformation was performed using electroporation, as described previously [38]. Localization of the position of transposon integration was confirmed using Sanger sequencing on an ABI Prism 3730XL system (Applied Biosystems, USA) using the primers W2 (5’-AACCTGTATGCCAACCGAGG-3’) and W3 (5’-GTGAGCAAGTACACCGATGTTAAT-3’), complementary to the sequences of the ends of the tetracycline resistance gene tetM. Localization of the transposon position and its single insertion in the ΔS.MgaS6I mutant was additionally confirmed using SMRT sequencing.

For studying the effect of methylation on gene expression, strains with EGFP expression under different promoters were constructed, and the EGFP gene was cloned into the Tn4001-based vector described in [3], between the SphI and XbaI sites. Promoter variants were chemically synthesized and cloned into the plasmid, between the ApaI and SphI sites. Sequences of oligos for promoters with methylations in the following regions: -10 box, -35 box, between -10 and -35 box, and TSS are shown in S6 Table in S1 File. Transformation and localization of the transposon integration position were performed for the ΔS.MgaS6I mutant. Three biological samples were selected for the transformants with artificial promoters.

The mutants used as controls in the experiments involving 2D-DIGE and assessment of the protective function of the MgaS6I RM system were selected from the same library of random transposon knockouts as the ΔS.MgaS6I mutant.

DNA isolation

DNA was isolated from WT Mycoplasma gallisepticum S6 and the ΔS.MgaS6I mutant in the exponential phase of growth using the QIAamp® DNA Micro Kit (Qiagen, USA), according to the manufacturer’s instructions. The amount of DNA was determined using a Qubit® 2.0 fluorometer (Thermo Fisher Scientific, USA), and the quality of DNA was estimated using electrophoresis on a 1% agarose gel.

DNA sequencing

SMRT sequencing [18] was performed at the DNA Sequencing and Genomics Lab, Institute of Biotechnology, University of Helsinki. Libraries were prepared following the SMRTbell template preparation protocol with 8–10 kb inserts and sequenced on the PacBio® RS II (Pacific Biosciences, USA) using DNA/Polymerase Binding Kit P6 v2, DNA Sequencing Reagent Kit 4.0 v2 with C4 chemistry, and SMRT Cell v3. Each sample was sequenced in 1 SMRT cell. The coverage of the genome per strand was approximately 90× for WT M. gallisepticum S6 and approximately 110× for the ΔS.MgaS6I strain (S1 Table in S1 File).

Bioinformatics analysis

De novo genome assembly, modification detection, and motif search were performed using the SMRT® Analysis software. Sequencing data were deposited into the Sequence Read Archive (SRA) with the accession SRX10666727 and link https://www.ncbi.nlm.nih.gov/sra/SRX10666727 for WT M. gallisepticum and accession SRX10666728 and link https://www.ncbi.nlm.nih.gov/sra/SRX10666728 for the ΔS.MgaS6I mutant. The circular whole-genome sequence of M. gallisepticum S6 was assembled by us earlier https://www.ncbi.nlm.nih.gov/genome/1113?genome_assembly_id=266085 [2]. Modification detection and motif analysis were performed using the protocol RS_Modification_and_Motif_Analysis.1. Modification data were deposited in the REBASE database [7] (http://rebase.neb.com/cgi-bin/pacbioget?11602). The characteristics of the identified motifs are shown in Table 1.

All subsequent calculations for motif characteristics were performed using in-house R and Python scripts.

The genomes of WT M. gallisepticum and ΔS.MgaS6I mutants were compared using Mauve [39].

The circular genome was visualized using Circos [40].

Specificity subunits with similar specificity to S.MgalS6I are described in S8 Table in S1 File. TRDs were predicted from HsdS protein sequences as those between long conservative alpha helices [21, 25]. The secondary structure prediction for HsdS was built using JPred4 [41]. TRDs with similar specificity for recognizing AGG or ANC half motifs were found in the REBASE database [7]. Alignments of TRDs were made using Clustal Omega [42]. The phylogenetic tree of TRDs was constructed using the Neighbor-Joining tree algorithm and visualized using Jalview 2 [43], and distance measures were calculated using BLOSUM62 substitution matrices.

RNA isolation and RT-qPCR

RNA was isolated from M. gallisepticum and its transformants in the exponential phase of growth using Trizol® LS (Thermo Fisher Scientific), according to the manufacturer’s instructions. The amount of RNA was determined using a Qubit® 2.0 fluorometer (Thermo Fisher Scientific), and the quality of RNA was estimated by conducting 1.5% agarose gel electrophoresis. RNA was treated with DNase I (Thermo Fisher Scientific). cDNA was synthesized from random hexamer primers using Maxima™ H Minus Reverse Transcriptase (Thermo Scientific). Real-time PCR was performed using dNTPs, PCR buffer, Taq polymerase (Lytech, Russia), SYBR® Green I (Invitrogen, USA), and a CFX96 Real-Time PCR Detection System (Bio-Rad, USA), as described previously [37]. The primers used for EGFP were EGFP_F 5’-GACCCTGAAGTTCATCTGCACC-3’ and EGFP_R 5’-TAGTTGTACTCCAGCTTGTGCC-3’; topA_F 5’-AACTTGAAGAAGTCAGTGGGAT-3’ and topA_R 5’-TCTTACCATATTCTGGACTAAGACG-3’; primers for other genes have been described previously [37]. Primers were designed using BAC-Browser tools [44] and NCBI Primer-BLAST [45]. Quantitative data were normalized to the geometric mean of the eno, gapd, tuf, gyrB, and ligA genes. Relative changes in gene expression were calculated using the 2−ΔΔCT method. The significance of changes was calculated using Student’s t-test with multiple hypotheses testing correction using the Benjamini–Hochberg procedure.

2D-DIGE

Cell lysis, 2D-DIGE, and sample trypsin digestion with subsequent MALDI analysis were performed as previously described [2]. After cell lysis, proteins were covalently tagged with CyDyeDIGE Cy5 (red, WT M. gallisepticum) and CyDye DIGE Cy3 (green, knockout mutant) (Amersham Bioscience, USA), according to the manufacturer’s instructions (400 pmol for 50 μg protein).

To assess the effects of tetracycline or tetracycline resistance on protein abundance, we performed control 2D-DIGE with three other knockout mutants with tetM insertion in the GCW_01045, GCW_01060, and GCW_03935 genes.

Protective function of the MgaS6I RM system

Four strains with tetM insertion in various intergenic regions were selected for comparison from the same library of random transposon knockouts (Arzamasov et al., manuscript in preparation) as the strain ΔS.MgaS6I. Construction of a vector for transformation of M. gallisepticum was performed as described in [3] except for the presence of the chloramphenicol resistance gene camR instead of the tetracycline resistance gene tetM. All strains before transformation were in the middle-log phase. Transformation was performed by means of electroporation, as described previously [38]. Selection of clones with camR insertion was carried out by growing the cells in the presence of 20 mkg/ml chloramphenicol. CFUs obtained after transformation were normalized to the mass of total RNA.

Supporting information

S1 Fig. Genome alignment of Mycoplasma gallisepticum strains obtained using the program Mauve.

RM system MgaS6I is marked in orange, variable lipoproteins vlhA clusters in green, the CRISPR system in blue. M. gallisepticum S6 genome is on the top, below the genomes of the other strains. A–whole genome representation, B–fragment of genomes near MgaS6I genomic context.

(PDF)

S1 File. Supplementary tables S1-S8.

(XLSX)

Acknowledgments

We thank Mr. Lars Paulin and DNA Sequencing and Genomics Lab, Institute of Biotechnology, University of Helsinki for the PacBio sequencing.

List of abbreviations

2D-DIGE

two-dimensional difference gel electrophoresis

CV

coefficient of variation

m6A

N6-methyladenine

m4C

N4-methylcytosine

m5C

5-methylcytosine

MRM

multiple reaction monitoring

MTase

methyltransferase

SMRT sequencing

single molecule real-time sequencing

RM system

restriction-modification system

TRD

target-recognition domain

TSS

transcription start sites

WT

wild-type

ΔS.MgaS6I

knockout mutant of M. gallisepticum in HsdS S.MgaS6I (GCW_02360)

Data Availability

Sequencing data were deposited into the SRA with the accession no. SRX10666727 and link https://www.ncbi.nlm.nih.gov/sra/SRX10666727 for WT M. gallisepticum and accession no. SRX10666728 and link https://www.ncbi.nlm.nih.gov/sra/SRX10666728 for the ΔS.MgaS6I mutant. Modified DNA data was deposited in the REBASE database (http://rebase.neb.com/cgi-bin/pacbioget?11602).

Funding Statement

This work was supported by the Russian Science Foundation grant no. 19-74-10105 “The role of chromatin structure in minimal cell in the maintenance of housekeeping proteome homeostasis.” The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

Decision Letter 0

Mary Bryk

17 Aug 2022

PONE-D-22-17373Role of DNA modifications in Mycoplasma gallisepticum

PLOS ONE

Dear Dr. Semashko,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

1.  Please address comments made by the reviewer and myself.  Most comments ask for clarification of statements made in the manuscript. 2.  Reviewer 1 requests additional informatics work to address the probability of finding a methylation motif in a promoter region normally, given the GC content of the organism.  Because this information is important to support arguments made in the manuscript, this analysis should be performed and the results included in your revised manuscript. 3. Include additional evidence to support your suggestion that the RM system was acquired by horizontal gene transfer in your revised manuscript.

Please submit your revised manuscript by Oct 01 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Mary Bryk, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Thank you for stating the following financial disclosure: 

  "This work was supported by the Russian Science Foundation grant no. 19-74-10105 “The role of chromatin structure in minimal cell in the maintenance of housekeeping proteome homeostasis.”"

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." 

If this statement is not correct you must amend it as needed. 

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 

Additional Editor Comments:

This is a solid manuscript that is well written. I identified a few points that require clarification.

Lines 176-177. The m6A modification and DNA modification motif ANCNNNNCCT were identified only in the WT strain, and the ΔS.MgaS6I strain lacked them. I do not understand why the DNA modification motif ANCNNNNCCT was not identified in the ΔS.MgaS6I strain.

I did not understand the meaning of the statement in lines 177-178, "This indicates that the RM system common for M. gallisepticum strains is nonfunctional, unless in the S6 strain." Are you suggesting that the common RM system is functional in the presence of the unique RM system in MgaS6I?

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In this work, the authors investigate genome methylation within Mycoplasma gallisepticum. The work itself is mostly solid (though I take issue with the “underrepresentation” part), so it’s rather unfortunate that most of the results are negative. The authors find one methylation motif and associate it with the one predicted functional RM system in the genome. They show this RM system basically does what RM systems are known to do, and the system has no other real impact on the physiology of the organism. It’s not transformative research, but I can’t blame the authors for that.

Major comments

It is my understanding that SMRT sequencing only is able to detect m6A modifications on native DNA, and special procedures have to be performed to identify m5C methylations. The authors should comment on whether they performed these procedures, otherwise we don’t know if they missed m5C modifications or not.

One of the pieces of data the authors present is the general lack of methylation sites within promoter regions. However, it’s not clear if that has occurred randomly or it has been selected for. I would have liked to have seen some informatic work to give an indication on what the probability of finding a methylation motif in a promoter region would be normally, given the GC content of the organism. Is the actual occurrence less than expected by random chance? Or is it that the genome has reduced so much and the motif itself is not simple enough where just randomly the motif doesn’t occur in those spots. Basically, I want to know if nature has selected for removal of methylation motifs from promoters because it could be disruptive. Without this analysis the authors cannot state that methylation sites are “underrepresented” in these places as they do later in the manuscript. They could only say they were not detected. We need to know what the actual expected representation would be to know if the sites are actually underrepresented.

As a side note, I appreciate that the authors did not try to over-extend their transcriptional change data. It would have been easy to present some of those transcriptional changes as significant in the figure, but they relied on the statistics and showed that most genes showed no significant change in gene expression. That’s good science.

Line 320 – the authors state the RM system was acquired by horizontal gene transfer. That’s a big statement to make. I would make sure to provide all the evidence that supports it. The only thing that comes to mind right now is that it’s present in the S6 strain and none of the others. But there are things that could be done to back up that statement.

Line 328 – honestly, I just don’t understand what the authors are talking about in this paragraph. Please clarify.

Minor comments:

Line 16-17 – poor grammar, should be “investigated DNA modifications of the model”

Line 178-179 – I’m confused by this sentence. The authors state that the RM system common to the different strains of M. gallisepticum is non-functional, which makes sense based on the mutations in this system, but then include the clause “unless in the S6 strain”. It’s this last part I don’t understand. I thought the point was that the unique system is responsible for the methylation, but that clause indicates the common system is responsible for the methylation.

Line 195 – I believe there is a typo, and it should be MgaS6I instead of MhaS6I

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Nov 22;17(11):e0277819. doi: 10.1371/journal.pone.0277819.r002

Author response to Decision Letter 0


15 Oct 2022

Dear Editor and Reviewer!

Please accept the revised manuscript «Role of DNA modifications in Mycoplasma gallisepticum» of the following authors: Tatiana Semashko, Alexander Arzamasov, Daria Evsyutina, Irina Garanina, Daria Matyushkina, Valentina Ladygina, Olga Pobeguts, Gleb Fisunov and Vadim Govorun for consideration as a publication in PLOS ONE journal.

Thank you for your time and for your appreciation of our work. We have corrected our manuscript in accordance with all your comments and suggestions. We hope you find our manuscript suitable for publication.

Response to Editor.

1. Please address comments made by the reviewer and myself. Most comments ask for clarification of statements made in the manuscript.

2. Reviewer 1 requests additional informatics work to address the probability of finding a methylation motif in a promoter region normally, given the GC content of the organism. Because this information is important to support arguments made in the manuscript, this analysis should be performed and the results included in your revised manuscript.

We have performed the requested calculations and corrected the text of the manuscript in accordance with results.

3. Include additional evidence to support your suggestion that the RM system was acquired by horizontal gene transfer in your revised manuscript.

We included supplementary figure with comparing genomes of different strains, extended paragraph “RM systems in M. gallisepticum and their functionality” in the Results and extended the reasons of our suggestion that the RM system was acquired by horizontal gene transfer in the Discussion.

Additional Editor Comments:

Lines 176-177. The m6A modification and DNA modification motif ANCNNNNCCT were identified only in the WT strain, and the ΔS.MgaS6I strain lacked them. I do not understand why the DNA modification motif ANCNNNNCCT was not identified in the ΔS.MgaS6I strain.

I did not understand the meaning of the statement in lines 177-178, "This indicates that the RM system common for M. gallisepticum strains is nonfunctional, unless in the S6 strain." Are you suggesting that the common RM system is functional in the presence of the unique RM system in MgaS6I?

Thank you for your comments. Both these statements are the results of mistakes. We fixed them.

Response to Reviewer.

Major comments

1. It is my understanding that SMRT sequencing only is able to detect m6A modifications on native DNA, and special procedures have to be performed to identify m5C methylations. The authors should comment on whether they performed these procedures, otherwise we don’t know if they missed m5C modifications or not.

Yes, you are right. For identification of DNA modification using SMRT sequencing without additional special procedures, 25x sequencing fold coverage is required for the identification of 4-methylcytosine, glucosylated 5-hydroxymethylcytosine, 6-methyladenine and 8-oxoguanine; 250x sequencing fold coverage is required for the identification of 5-methylcytosine and 5-hydroxymethylcytosine [https://www.pacb.com/wp-content/uploads/2015/09/WP_Detecting_DNA_Base_Modifications_Using_SMRT_Sequencing.pdf]. In our study the coverage of the genome per strand was approximately 90× for WT M. gallisepticum S6 and approximately 110× for the ΔS.MgaS6I strain. We didn’t find annotated m5C-methyltransferases in M. gallisepticum genome and didn’t identify m5C modifications. But if m5C-methyltransferases were present, we would have seen a not very significant signal for m5C modifications in our experiment.

2. One of the pieces of data the authors present is the general lack of methylation sites within promoter regions. However, it’s not clear if that has occurred randomly or it has been selected for. I would have liked to have seen some informatic work to give an indication on what the probability of finding a methylation motif in a promoter region would be normally, given the GC content of the organism. Is the actual occurrence less than expected by random chance? Or is it that the genome has reduced so much and the motif itself is not simple enough where just randomly the motif doesn’t occur in those spots. Basically, I want to know if nature has selected for removal of methylation motifs from promoters because it could be disruptive. Without this analysis the authors cannot state that methylation sites are “underrepresented” in these places as they do later in the manuscript. They could only say they were not detected. We need to know what the actual expected representation would be to know if the sites are actually underrepresented.

We thank the reviewer for pointing this out. In previous version of manuscript we used two-tailed Fisher’s exact test to validate the observed number of ANCNNNNCCT motif in promoter’s regions compared to occurrence it across whole genome. The promoter sequences in M. gallisepticum S6 were determined in our previous work [Fisunov GY et. al. Reconstruction of Transcription Control Networks in Mollicutes by High-Throughput Identification of Promoters. Front Microbiol. 2016 Dec 6;7:1977]. Now according your suggestion we tried to estimate motif underrepresentation in the promoter regions through ratio of observed over expected motif counts. There are several methods to estimate the expected frequency of short oligonucleotides in the genome. The simplest ones are based on the frequencies of mono- or dinucleotides in the genome. More complex methods use the maximal order Markov model or method by Samuel Karlin [Karlin S, Cardon LR. Computational DNA sequence analysis. Annu Rev Microbiol. 1994;48:619-54.], [Elhai J. Determination of bias in the relative abundance of oligonucleotides in DNA sequences. J Comput Biol. 2001;8(2):151-75.]. We calculated ratio of observed over expected motif through frequencies of each nucleotide in M. gallisepticum S6 genome and promoter regions. For whole genome, ratio was 1.29; for promoter regions - 0.2. It means there is no ANCNNNNCCT avoidance in whole genome of M. gallisepticum S6, but it probably exists in promoter regions. We also estimated enrichment ratio of ANCNNNNCCT in set of promoter sequences and in 100 sets of random generated sequences with the same length and GC content of the M. gallisepticum S6. In this case enrichment ratio was 0.49±0.11.

3. As a side note, I appreciate that the authors did not try to over-extend their transcriptional change data. It would have been easy to present some of those transcriptional changes as significant in the figure, but they relied on the statistics and showed that most genes showed no significant change in gene expression. That’s good science.

Thank you!

4. Line 320 – the authors state the RM system was acquired by horizontal gene transfer. That’s a big statement to make. I would make sure to provide all the evidence that supports it. The only thing that comes to mind right now is that it’s present in the S6 strain and none of the others. But there are things that could be done to back up that statement.

The MgaS6I system does not have close homologs in other M. gallisepticum strains. Additionally, it has a unique location in a genomic context compared to other strains. We added figure S1 with alignments of genomes of M. gallisepticum different strains. We extended of arguments for our suggestion that the RM system was acquired by horizontal gene transfer in text of the manuscript: in the paragraph “RM systems in M. gallisepticum and their functionality” in the Results and in the second paragraph of the Discussion.

5. Line 328 – honestly, I just don’t understand what the authors are talking about in this paragraph. Please clarify.

In this paragraph we discuss underrepresentation of recognition sites in whole genome of M. gallisepticum S6. Sites are not underrepresented comparing with genomes of other strains. We have rewritten this paragraph.

Minor comments:

Line 16-17 – poor grammar, should be “investigated DNA modifications of the model”

Thank you, we fixed it.

Line 178-179 – I’m confused by this sentence. The authors state that the RM system common to the different strains of M. gallisepticum is non-functional, which makes sense based on the mutations in this system, but then include the clause “unless in the S6 strain”. It’s this last part I don’t understand. I thought the point was that the unique system is responsible for the methylation, but that clause indicates the common system is responsible for the methylation.

Thank you, we fixed this typo.

Line 195 – I believe there is a typo, and it should be MgaS6I instead of MhaS6I

Thank you, we fixed it.

Best regards,

Tatiana Semashko

Address: Nauchnyj proezd 18, Moscow, Russia

Email: t.semashko@gmail.com

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Mary Bryk

4 Nov 2022

Role of DNA modifications in Mycoplasma gallisepticum

PONE-D-22-17373R1

Dear Dr. Semashko,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Mary Bryk, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Thank you for addressing the major and minor comments thoughtfully. I found three minor errors that you will want to correct.

Line 216 “was” should be “were”.

Line 361 “of its own operon” should be “in its own operon”.

Line 388 “this data” should be “these data”.

Acceptance letter

Mary Bryk

14 Nov 2022

PONE-D-22-17373R1

Role of DNA modifications in Mycoplasma gallisepticum

Dear Dr. Semashko:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Mary Bryk

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Genome alignment of Mycoplasma gallisepticum strains obtained using the program Mauve.

    RM system MgaS6I is marked in orange, variable lipoproteins vlhA clusters in green, the CRISPR system in blue. M. gallisepticum S6 genome is on the top, below the genomes of the other strains. A–whole genome representation, B–fragment of genomes near MgaS6I genomic context.

    (PDF)

    S1 File. Supplementary tables S1-S8.

    (XLSX)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    Sequencing data were deposited into the SRA with the accession no. SRX10666727 and link https://www.ncbi.nlm.nih.gov/sra/SRX10666727 for WT M. gallisepticum and accession no. SRX10666728 and link https://www.ncbi.nlm.nih.gov/sra/SRX10666728 for the ΔS.MgaS6I mutant. Modified DNA data was deposited in the REBASE database (http://rebase.neb.com/cgi-bin/pacbioget?11602).


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES