Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Mar 6;14:5522. doi: 10.1038/s41598-024-55849-1

Genome-wide identification of WRKY transcription factor family members in Miscanthus sinensis (Miscanthus sinensis Anderss)

Yongkang Yan 1,, Zhanyou Yan 2, Guofang Zhao 3
PMCID: PMC10918066  PMID: 38448638

Abstract

Miscanthus is an emerging sustainable bioenergy crop whose growing environment is subject to many abiotic and biological stresses. WRKY transcription factors play an important role in stress response and growth of biotic and abiotic. To clarify the distribution and expression of the WRKY genes in Miscanthus, it is necessary to classify and phylogenetically analyze the WRKY genes in Miscanthus. The v7.1 genome assembly of Miscanthus was analyzed by constructing an evolutionary tree. In Miscanthus, there are 179 WRKY genes were identified. The 179 MsWRKYs were classified into three groups with conserved gene structure and motif composition. The tissue expression profile of the WRKY genes showed that MsWRKY genes played an essential role in all growth stages of plants. At the early stage of plant development, the MsWRKY gene is mainly expressed in the rhizome of plants. In the middle stage, it is mainly expressed in the leaf. At the end stage, mainly in the stem. According to the results, it showed significant differences in the expression of the MsWRKY in different stages of Miscanthus sinensis. The results of the study contribute to a better understanding of the role of the MsWRKY gene in the growth and development of Miscanthus.

Keywords: Miscanthus sinensis, WRKY, Phylogenetic analysis, Biotic stress, Expression profiling

Subject terms: Molecular biology, Plant sciences

Introduction

WRKY transcription factors (TFs) were widely distributed in plants, which were first discovered in sweet potato (Ipomoea batatas)1. With genome-wide analyses of different species, WRKY genes have been identified in more species, that included 66 WRKY genes in Arabidopsis2, 119 WRKY genes in maize3, 94 WRKY genes in sorghum4, 79 WRKY genes in potato5, 70 WRKY genes in chickpea6, and 61 WRKY genes in cucumber7. The WRKY protein contains a conserved WRKYGQK motif at its N-terminal and a 60-amino acid-long zinc finger motif at its C-terminus8. Zinc finger motifs can be classified as C2H2 or C2HC. WRKY proteins can be classified into three categories (I, II, III) based on the number of WRKY domains and the type of zinc finger motif9. Group I members have two WRKY domains and zinc fingers of type C2H2. Group II members have only one WRKY domain and one C2H2 zinc finger motif, and Group III members have one WRKY domain and one C2HC zinc finger. Group II can be further subdivided into five subgroups: IIa, IIb, IIc, IId, and IIe10,11.

Studies have shown that WRKY TFs play an important regulatory role in plant growth and development12. For example, AtWRKY12 and AtWRKY13 can exert important regulation on the flowering process of Arabidopsis13. The over-expression of GsWRKY20 from Glycine soja in Arabidopsis also altered the plant's flowering process14. However, OsWRKY11 in rice can control flowering time and plant height15. These depend on WRKY genes being expressed at different times in different parts of the plant. Similarly, the expression of these genes also affects various aspects of plants, such as hormone regulation and stress resistance16.

For example, overexpression of OsWRKY45 in rice enhances disease resistance and drought resistance17. The AtWRKY25 and AtWRKY33 in Arabidopsis thaliana enhance salt tolerance18. Some AcWRKY TFs of kiwifruit were up-regulated under salt stress19. Considering the important role of WRKY TFs in plant growth and development, the study of these genes is very important for agricultural production. The overexpression of TaWRKY2 in wheat enhanced the tolerance to drought stress and increased the yield20. Vitis amurensis VaWRKY12 gene enhanced the cold tolerance of transgenic grape callus21.

Most studies have focused on the genes of annual grasses, but few studies have focused on the genes of perennial grasses. Miscanthus is a perennial grass that has historically been cultivated as a papermaking material and as an ornamental plant. In recent years, Miscanthus has played a role in the direction of ecological restoration and sustainable bioenergy crops22. Therefore, the study has great significance for the genetic improvement of Miscanthus. The improvements can increase productivity and ensure that the crops remain robust despite persistent biological and abiotic stresses. As an essential stress resistance gene, WRKY TFs have high research value. The availability of the whole genome assembly of Miscanthus sinensis enabled identifying MsWRKY. The identification and study of MsWRKY are helpful in understanding the mechanism of plant stress resistance. The genomes of at least four Miscanthus species (M. sacchariflorus, M. sinensis, M. lutarioriparius, and M. floridulus) have been sequenced. M. sinensis was used as the object of this study. M. sacchariflorus has strong stress resistance and large biomass. However, the growth cycle of the plant is too long for commercial biomass production. M. lutarioriparius and M. floridulus have a narrow distribution range and are only found in parts of East Asia. Their biomass yield is low, and the fiber content is too high. Therefore, they are not suitable for the bioenergy development. The function of the Miscanthus WRKY family has been identified and characterized by various methods.

Material and methods

Identification of WRKY family genes in Miscanthus

The genomic data for Miscanthus were obtained from Phytozome 13 (https://phytozome-next.jgi.doe.gov/). From these data, the putative MsWRKY genes can be identified. The database contains the amino acid sequences of Miscanthus WRKY proteins. The Joint Genome Institute (JGI) (https://phytozome.jgi.doe.gov/pz/portal.html#) and the WRKY domain ID were used to identify the potential WRKY (PF03106) proteins of Miscanthus. The data used was Miscanthus sinensis v7.1. To ensure the quality of the data, the CD-HIT suite (https://github.com/weizhongli/cdhit) and Simple Modular Architecture the Research Tool (SMART) (http://smart.embl-heidelberg.de/#) were used to process the resulting sequence23. The duplicate sequences and incomplete sequences were removed.

At the same time, the ExPASy proteomic server (http://web.expasy.org/protparam) was used to predict the physical and chemical properties of the proteins. Their isoelectric point (pI) and molecular weight (MW) were obtained.

Chromosome mapping and classification and phylogenetic analysis of MsWRKY genes

The chromosomal locations of all identified MsWRKY genes were obtained from the Phytozome BioMart tool (https://phytozome.jgi.doe.gov/biomart/martview/). And the MG2C v2.1 (mg2c.iask.in/mg2c_v2.1/) was used for Chromosome mapping of MsWRKY genes.

In constructing a phylogenetic tree to classify MsWRKY genes, we need to use Sorghum bicolor WRKY amino acid sequences. The data were obtained from The Arabidopsis Information Resource (TAIR) (https://www.arabidopsis.org/). They will be used with our MsWRKY sequences. The MEGA v7.0 (https://www.megasoftware.net/) for constructing a phylogenetic tree used multiple sequence alignments with ClustalW to process SbWRKY and MsWRKY protein sequences. The Neighbor-Joining method and the p-distance model were used in the process, and the pairwise deletion and 1000 bootstrap replicates were selected24. Eventually, the phylogenetic tree of SbWRKY and MsWRKY sequences was obtained. Thus, the unknown MsWRKY genes can be divided into different groups and subgroups. By using the sequence alignment data and the phylogenetic tree, the putative Miscanthus WRKY orthologs in Arabidopsis can be identified5.

Gene structure analysis and conserved motif distribution analysis of MsWRKY genes

The genomic sequence and coding sequence (CDS) of each MsWRKY gene can be used to predict the gene structure of the MsWRKY gene. The exon–intron structure of MsWRKYs was analyzed using TBtools25.

The Multiple Em for Motif Elicitation (MEME) (v.5.5.3; https://meme-suite.org/meme/tools/meme) used the parameters which are: maximum motif number: 20; site distributions: any number of repetitions; minimum and maximum width: 6 and 50, respectively to distinguish MsWRKY proteins with conserved motif.26.

Gene ontology annotation and analysis of cis-acting elements of MsWRKY genes

For the gene ontology (GO) annotation analysis of the obtained MsWRKY proteins, the eggNOG-mapper 2.1.12 (http://eggnog-mapper.embl.de/)27 was used. Then, the TBtools was used to map and annotate the obtained data. Ultimately, the biological processes, molecular functions and cellular components of these proteins were obtained.

The online website PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) analyzed 2000 bp of the upstream region for all MsWRKY genes to analyze the cis-acting elements of MsWRKY genes. It provided the cis-acting elements of MsWRKY genes.

Synteny analysis of MsWRKY genes

The Multiple Collinearity Scan toolkit (MCScanX) was used to examine the gene duplication events with the default parameters. The TBtools was the platform used to analyze the data. The Evalue of the blastP was 15. To explore the syntenic relationships of the WRKY genes obtained from Miscanthus and other selected species, syntenic analysis maps were constructed using MCScanX.

Digital expression pattern analysis of MsWRKY genes

The TBtools analyzed the Miscanthus transcriptomic array data to make heatmaps of the MsWRKY expression profiles to survey MsWRKY expression profiles. In the meantime, the Miscanthus transcriptomic array data was obtained from the JGI database (https://phytozome-next.jgi.doe.gov/).

Results

Identification of WRKY family members in Miscanthus

To identify members of the MsWRKY family, the WRKY domain consensus sequence (PF03106) and the keyword WRKY were used to search in the database. The most complete genome assembly for Miscanthus (Miscanthus sinensis v7.1) in the JGI database was selected. In the MsWRKY family, a conserved domain exists as the basic criterion for inclusion of genes. Using the conserved domain called the WRKYGQK or WRKYGQK-like conserved domain, 203 genes were identified in the JGI database. In these genes, the duplicates and incomplete were removed by the multiple sequence alignments of MEGA 7.0. At the same time, we removed the severely incomplete gene, identified the location of WRKY domain, and kept some of the incomplete genes because they clearly belonged to the MsWRKY family. It was done using the SMART database. In the end, there are a total of 179 non-redundant MsWRKY sequences. The sequences and proteins of these genes are summarized in Table 1.

Table 1.

Characteristics of the identified MsWRKY genes.

Gene name Gene locus ID Chromosome location Gene start Gene end pI MW Conserved heptapeptide Zinc finger type Domain number Group Protein length (aa)
MsWRKY01 Misin01G001000.1 Chr01 256,399 260,899 5.96 42,019.13 WRKYGQK C2H2 1 IIe 388
MsWRKY02 Misin01G047900.1 Chr01 7,447,384 7,449,297 9.28 45,084.5 WRKYGQK C2H2 1 IId 423
MsWRKY03 Misin01G063100.1 Chr01 9,967,017 9,972,013 6.41 34,538.04 WRKYGQK C2H2 1 IIc 331
MsWRKY04 Misin01G064300.1 Chr01 10,185,437 10,187,725 6.27 44,986.89 WRKYGQK C2H2 2 I 420
MsWRKY05 Misin01G080100.1 Chr01 13,181,193 13,184,725 10.03 43,273.5 WRKYGQK C2H2 1 IId 402
MsWRKY06 Misin01G144700.1 Chr01 27,617,080 27,617,952 9.51 11,896.32 WRKYGQK C2H2 1 IId 108
MsWRKY07 Misin01G144800.1 Chr01 27,658,926 27,661,691 8.79 14,941.48 WKKYGQK C2H2 1 IId 138
MsWRKY08 Misin01G263700.1 Chr01 84,350,262 84,352,266 6.35 47,584.95 WRKYGQK C2H2 1 IIe 441
MsWRKY09 Misin01G315300.1 Chr01 105,230,385 105,233,381 6.67 29,695.24 WRKYGQK C2H2 2 I 270
MsWRKY10 Misin01G341600.1 Chr01 112,898,915 112,899,930 10.23 18,701.01 WRKYGQK C2 1 IId 179
MsWRKY11 Misin01G362700.1 Chr01 118,028,855 118,030,800 5.97 33,081.46 WRKYGQK C2HC 1 III 319
MsWRKY12 Misin01G370500.1 Chr01 119,779,719 119,781,349 8.71 23,777.31 WRKYGEK C2HC 1 III 219
MsWRKY13 Misin02G032200.1 Chr02 4,903,340 4,905,683 9.42 46,713.58 WRKYGQK C2H2 1 IId 439
MsWRKY14 Misin02G061600.1 Chr02 10,345,970 10,348,174 6.33 45,153.2 WRKYGQK C2H2 2 I 419
MsWRKY15 Misin02G062400.1 Chr02 10,531,789 10,536,585 8.28 36,115.94 WRKYGQK C2H2 1 IIc 342
MsWRKY16 Misin02G074100.1 Chr02 12,502,555 12,505,918 10.03 43,237.35 WRKYGQK C2H2 1 IId 403
MsWRKY17 Misin02G115600.1 Chr02 21,231,516 21,233,686 8.81 15,261.74 WKKYGQK C2H2 1 IId 142
MsWRKY18 Misin02G115700.1 Chr02 21,277,852 21,278,769 9.3 17,306.11 WRKYGQK C2H2 1 IId 161
MsWRKY19 Misin02G129800.1 Chr02 24,888,414 24,891,276 5.13 29,550.9 WRKYGQK C2HC 1 III 275
MsWRKY20 Misin02G139500.1 Chr02 27,977,090 27,980,493 9.78 37,940.88 WRKYGQK C2H2 1 IId 351
MsWRKY21 Misin02G258000.1 Chr02 82,777,287 82,779,248 6.21 51,394.67 WRKYGQK C2H2 1 IIe 477
MsWRKY22 Misin02G303500.1 Chr02 98,992,567 99,008,454 7.04 51,566.58 WRKYGQK C2H2 2 I 487
MsWRKY23 Misin02G345000.1 Chr02 109,902,400 109,904,305 5.39 32,934.03 WRKYGQK C2HC 1 III 316
MsWRKY24 Misin02G360900.1 Chr02 113,665,512 113,666,881 8.5 23,918.39 WRKYGEK C2HC 1 III 221
MsWRKY25 Misin03G030100.1 Chr03 7,315,277 7,319,424 5.91 40,003.58 WRKYGQK C2H2 1 IIc 385
MsWRKY26 Misin03G090400.1 Chr03 26,714,328 26,724,781 6.53 65,823.66 WKIYHEK C2H2 1 III 577
MsWRKY27 Misin03G145800.1 Chr03 58,356,338 58,358,329 5.9 34,210.05 WRKYGQK C2HC 1 III 323
MsWRKY28 Misin03G145900.1 Chr03 58,463,020 58,464,851 5.15 32,889.18 WRKYGQK C2HC 1 III 308
MsWRKY29 Misin03G167600.1 Chr03 67,670,898 67,672,930 7.5 34,390.44 WRKYGQK C2H2 1 IIa 322
MsWRKY30 Misin03G167700.1 Chr03 67,713,031 67,714,003 5.67 27,360.52 WRKYGQK C2H2 1 IIa 255
MsWRKY31 Misin03G309200.1 Chr03 99,305,845 99,312,404 6.22 65,765.89 WRKYGQK C2H2 2 I 612
MsWRKY32 Misin03G348200.1 Chr03 105,876,921 105,879,425 6.52 34,512.02 WRKYGQK C2HC 1 III 334
MsWRKY33 Misin04G012600.1 Chr04 2,811,844 2,816,422 5.71 40,011.36 WRKYGQK C2H2 1 IIc 388
MsWRKY34 Misin04G103000.1 Chr04 28,871,792 28,881,760 6.64 180,043.9 WEKFGEK C2H2 1 III 1620
MsWRKY35 Misin04G121800.1 Chr04 37,165,681 37,167,742 8.1 32,376.41 WRKYGQK C2HC 1 III 310
MsWRKY36 Misin04G159500.1 Chr04 59,513,396 59,515,552 5.8 33,477.21 WRKYGQK C2HC 1 III 314
MsWRKY37 Misin04G159600.1 Chr04 59,590,103 59,591,840 6.27 35,403.26 WRKYGQK C2HC 1 III 333
MsWRKY38 Misin04G159700.1 Chr04 59,628,053 59,630,374 6.41 32,262.04 WRKYGQK C2HC 1 III 300
MsWRKY39 Misin04G189100.1 Chr04 70,520,338 70,521,730 8.61 24,998.4 WRKYGQK C2H2 1 IIa 232
MsWRKY40 Misin04G189300.1 Chr04 70,613,566 70,615,037 6.45 29,149.56 WSKYGQK C2H2 1 IIa 272
MsWRKY41 Misin04G224400.1 Chr04 80,781,032 80,788,531 6.07 62,448.14 WRKYGQK C2H2 2 I 584
MsWRKY42 Misin04G335900.1 Chr04 103,115,716 103,123,318 6.2 65,841.98 WRKYGQK C2H2 2 I 613
MsWRKY43 Misin04G395900.1 Chr04 113,184,557 113,186,904 9.49 28,237.28 WRKYGQK C2HC 1 III 266
MsWRKY44 Misin05G004000.1 Chr05 1,062,340 1,065,068 6.95 40,831.11 –––- C2H2 0 IIb 398
MsWRKY45 Misin05G004600.1 Chr05 1,127,680 1,130,708 9.02 61,031.03 WRKYGQK C2H2 1 IIb 586
MsWRKY46 Misin05G039300.1 Chr05 8,714,579 8,716,081 6.51 25,862.72 WRKYGKK C2H2 1 IIc 247
MsWRKY47 Misin05G039400.1 Chr05 8,724,881 8,727,718 6.74 54,286.43 WRKYGQK C2H2 1 IIb 529
MsWRKY48 Misin05G043000.1 Chr05 9,436,264 9,443,373 9.66 29,836.26 WRKYGQK C2H2 1 IIc 282
MsWRKY49 Misin05G044100.1 Chr05 9,661,358 9,662,378 10.01 25,167.29 WRKYGQK –– 1 IIc 243
MsWRKY50 Misin05G133400.1 Chr05 33,763,626 33,768,855 6.92 58,669.54 WRKYGQK C2H2 1 IIb 565
MsWRKY51 Misin05G179500.1 Chr05 61,537,333 61,538,346 5.02 23,472.75 WRK–– C2H2 1 IIc 218
MsWRKY52 Misin05G181000.1 Chr05 62,453,032 62,454,436 9.45 34,432.88 WRKYGQK C2H2 1 IIb 334
MsWRKY53 Misin05G204000.1 Chr05 71,047,834 71,051,020 6.39 38,748.61 WRKYGQK C2H2 1 IIc 358
MsWRKY54 Misin05G204800.1 Chr05 71,261,560 71,263,661 8.16 42,492.78 WRKYGQK C2H2 1 IIe 399
MsWRKY55 Misin05G223600.1 Chr05 76,270,790 76,274,606 7.75 40,925.8 WRKYGQK C2H2 1 IIc 392
MsWRKY56 Misin05G245300.1 Chr05 82,376,407 82,381,492 6.65 27,467.87 WRKYGQK C2H2 1 IIc 257
MsWRKY57 Misin05G247500.1 Chr05 82,585,497 82,587,483 5.09 31,767.99 WRKYGQK C2H2 1 IId 309
MsWRKY58 Misin05G257300.1 Chr05 84,706,704 84,708,239 8.87 23,279.05 WRKYGKK C2H2 1 IIc 217
MsWRKY59 Misin05G266000.1 Chr05 87,076,200 87,078,166 4.84 33,807.07 WRKYGQK C2H2 1 IIe 311
MsWRKY60 Misin05G314300.1 Chr05 97,084,382 97,087,219 6.07 58,493.75 WRKYGQK C2H2 2 I 547
MsWRKY61 Misin05G318500.1 Chr05 97,853,542 97,855,451 6.13 28,871.07 WRKYGQK C2HC 1 III 273
MsWRKY62 Misin05G318700.1 Chr05 97,894,342 97,897,352 5.55 35,408.38 WRKYGQK C2HC 1 III 321
MsWRKY63 Misin05G318800.1 Chr05 97,904,621 97,907,196 5.88 39,828.44 WRKYGQK C2HC 1 III 365
MsWRKY64 Misin05G318900.1 Chr05 97,921,736 97,926,790 5.56 29,563.32 WRKYGQK C2HC 1 III 263
MsWRKY65 Misin05G319000.1 Chr05 97,947,986 97,954,617 5.15 24,871.51 –––- C2HC 0 III 222
MsWRKY66 Misin05G341800.1 Chr05 102,614,804 102,620,037 6.21 37,871.09 WRKYGQK C2H2 1 IId 360
MsWRKY67 Misin06G033600.1 Chr06 8,287,503 8,288,890 6.88 25,986.1 WRKYGKK C2H2 1 IIc 248
MsWRKY68 Misin06G033700.1 Chr06 8,316,137 8,318,460 6.6 54,945.21 WRKYGQK C2H2 1 IIb 535
MsWRKY69 Misin06G035300.1 Chr06 8,827,872 8,837,074 9.84 30,409.98 WRKYGQK C2H2 1 IIc 288
MsWRKY70 Misin06G118100.1 Chr06 32,022,401 32,028,104 6.68 58,207.58 WRKYGQK C2H2 1 IIb 559
MsWRKY71 Misin06G173600.1 Chr06 64,362,468 64,364,136 5.41 25,009.44 WRKYGKK C2H2 1 IIc 230
MsWRKY72 Misin06G176500.1 Chr06 65,837,433 65,838,824 9.46 34,595.22 WRKYGQK C2H2 1 IIb 330
MsWRKY73 Misin06G194000.1 Chr06 71,485,939 71,489,452 6.64 37,778.45 WRKYGQK C2H2 1 IIe 352
MsWRKY74 Misin06G201700.1 Chr06 74,689,039 74,691,538 6.65 35,420.7 WRKYGQK C2HC 1 III 341
MsWRKY75 Misin06G223300.1 Chr06 81,494,458 81,497,861 6.95 40,981.74 WRKYGQK C2H2 1 IIc 394
MsWRKY76 Misin06G257000.1 Chr06 89,646,615 89,650,205 6.29 27,463.97 WRKYGQK C2H2 1 IIc 256
MsWRKY77 Misin06G263000.1 Chr06 91,159,450 91,161,480 4.82 33,231.57 WRKYGQK C2H2 1 IIe 308
MsWRKY78 Misin06G304500.1 Chr06 98,736,612 98,740,191 5.98 26,953.02 WRKYGQK C2HC 1 III 238
MsWRKY79 Misin06G304600.1 Chr06 98,786,343 98,796,970 5.61 29,373.08 WRKYGQK C2HC 1 III 262
MsWRKY80 Misin06G304700.1 Chr06 98,801,362 98,804,326 7.08 39,644.48 WRKYGQK C2HC 1 III 365
MsWRKY81 Misin06G304800.1 Chr06 98,822,340 98,825,116 6.14 33,304.99 WRKYGQK C2HC 1 III 308
MsWRKY82 Misin06G304900.1 Chr06 98,833,287 98,834,794 5.98 29,436.72 WRKYGQK C2HC 1 III 276
MsWRKY83 Misin06G308400.1 Chr06 99,650,347 99,653,285 6.65 56,227.46 WRKYGQK C2H2 2 I 524
MsWRKY84 Misin06G319700.1 Chr06 101,544,841 101,551,366 7.72 35,540.77 WRKYGQK C2H2 1 IId 337
MsWRKY85 Misin07G063600.1 Chr07 11,867,503 11,869,447 8.81 38,782.96 WRKYGQK C2H2 1 IIa 361
MsWRKY86 Misin07G113800.1 Chr07 22,968,339 22,970,872 5.77 40,000.76 WRKYGQK C2H2 1 IIe 375
MsWRKY87 Misin07G139300.1 Chr07 29,364,149 29,365,802 9.23 11,245.87 WRKYGQK C2H2 2 I 98
MsWRKY88 Misin07G156800.1 Chr07 33,883,167 33,890,997 6.45 62,499.9 WRKYGQK C2H2 2 I 577
MsWRKY89 Misin07G221700.1 Chr07 48,363,328 48,367,783 6.18 73,536.66 WRKYGQK C2H2 2 I 680
MsWRKY90 Misin07G320900.1 Chr07 80,414,542 80,415,531 6.49 35,113.11 WRKYGQK C2HC 1 III 329
MsWRKY91 Misin07G336100.1 Chr07 97,182,907 97,186,054 9.53 32,283.89 WRKYGQK C2H2 1 IId 304
MsWRKY92 Misin07G427800.1 Chr07 133,666,880 133,670,859 8.74 26,212.78 WRKYGQK C2H2 1 IIc 237
MsWRKY93 Misin07G453800.1 Chr07 139,859,854 139,863,939 5.97 50,932.84 WRKYGQK C2H2 1 IIe 487
MsWRKY94 Misin07G454800.1 Chr07 140,141,540 140,145,937 5.74 50,614.47 WRKYGQK C2H2 1 IIe 487
MsWRKY95 Misin07G514500.1 Chr07 153,014,613 153,017,365 5.33 60,875.02 WRKYGQK C2H2 1 IIb 575
MsWRKY96 Misin08G064100.1 Chr08 13,397,841 13,399,660 8.81 38,153.22 WRKYGQK C2H2 1 IIa 357
MsWRKY97 Misin08G113400.1 Chr08 31,140,937 31,143,013 5.6 39,155.69 WRKYGQK C2H2 1 IIe 368
MsWRKY98 Misin08G130600.1 Chr08 40,498,773 40,500,342 9.46 32,546.22 WRKYGQK C2H2 1 IId 305
MsWRKY99 Misin08G223600.1 Chr08 74,637,099 74,641,364 8.47 26,138.66 WRKYGQK C2H2 1 IIc 236
MsWRKY100 Misin08G295700.1 Chr08 89,997,647 90,000,423 5.41 64,800.38 WRKYGQK C2H2 1 IIb 604
MsWRKY101 Misin09G094400.1 Chr09 31,825,786 31,831,560 6.09 142,969.7 WRKYGQK C2H2 1 III 1269
MsWRKY102 Misin09G124800.1 Chr09 48,969,562 48,973,277 9.32 23,326.47 WRKYGQK C2H2 1 IIc 212
MsWRKY103 Misin09G168300.1 Chr09 69,478,619 69,480,158 5.16 35,435.27 WRKYGEK C2HC 1 III 315
MsWRKY104 Misin09G168400.1 Chr09 69,516,272 69,517,802 5.23 35,384.37 CRKYGEK C2HC 1 III 315
MsWRKY105 Misin10G075100.1 Chr10 22,664,223 22,667,110 7.58 68,606.04 WRKYGR- C2HC 1 III 602
MsWRKY106 Misin10G075200.1 Chr10 22,667,111 22,669,574 11.79 21,954.27 –––- C2 0 III 197
MsWRKY107 Misin10G090500.1 Chr10 28,806,465 28,809,996 9.46 23,366.54 WRKYGQK C2H2 1 IIc 211
MsWRKY108 Misin10G147700.1 Chr10 57,752,929 57,755,316 9.09 45,494.27 WNKYSQK C2HC 1 III 402
MsWRKY109 Misin10G168900.1 Chr10 64,290,186 64,291,933 8.31 28,207.55 WRKYGEK C2HC 1 III 251
MsWRKY110 Misin11G033500.1 Chr11 21,369,493 21,371,062 9.69 31,121.11 WRKYGQK C2H2 1 IId 290
MsWRKY111 Misin11G109800.1 Chr11 50,960,040 50,966,140 6.99 76,641.46 WRKYGQK C2H2 2 I 722
MsWRKY112 Misin11G161900.1 Chr11 61,907,380 61,911,291 8.87 26,425.98 WRKYGQK C2H2 1 IIc 241
MsWRKY113 Misin11G172000.1 Chr11 63,970,651 63,972,274 10.11 32,836.5 WRKYGQK C2H2 1 IId 311
MsWRKY114 Misin11G177600.1 Chr11 64,766,776 64,770,389 5.42 51,673.75 WRKYGQK C2H2 1 IIe 490
MsWRKY115 Misin12G114700.1 Chr12 53,371,560 53,377,084 6.71 76,479.3 WRKYGQK C2H2 2 I 721
MsWRKY116 Misin12G168300.1 Chr12 64,849,948 64,854,472 8.58 26,693.31 WRKYGQK C2H2 1 IIc 246
MsWRKY117 Misin12G221100.1 Chr12 75,273,661 75,277,569 5.4 51,312.26 WRKYGQK C2H2 1 IIe 490
MsWRKY118 Misin12G224200.1 Chr12 75,983,746 75,985,597 10.12 33,577.23 WRKYGQK C2H2 1 IId 320
MsWRKY119 Misin13G053300.1 Chr13 15,284,932 15,287,835 9.19 23,715.16 WRKYGQK C2H2 2 I 214
MsWRKY120 Misin13G053800.1 Chr13 15,379,156 15,382,102 8.1 24,999.43 WRKYGQK C2H2 2 I 225
MsWRKY121 Misin13G065100.1 Chr13 19,010,992 19,011,900 9.97 31,219.41 WRKYGQK C2H2 1 IId 302
MsWRKY122 Misin13G077200.1 Chr13 26,514,539 26,521,340 6.14 61,765.12 WRKYGQK C2H2 2 I 568
MsWRKY123 Misin14G000600.1 Chr14 88,281 90,014 8.39 20,741.62 WRKYGQK C2H2 2 I 187
MsWRKY124 Misin14G044100.1 Chr14 10,368,810 10,372,904 9.05 25,295.99 WRKYGEK C2HC 1 III 225
MsWRKY125 Misin14G044300.1 Chr14 10,410,602 10,412,653 6.45 30,496.32 WRKYGQK C2HC 1 III 269
MsWRKY126 Misin14G044400.1 Chr14 10,415,330 10,420,545 9.23 22,949.89 WRKYGQK C2 1 III 197
MsWRKY127 Misin14G044700.1 Chr14 10,486,916 10,488,031 5.53 30,704.25 WRKYGQK C2HC 1 III 271
MsWRKY128 Misin14G044900.1 Chr14 10,550,794 10,554,423 5.75 32,763.68 WRKYGQK C2HC 1 III 316
MsWRKY129 Misin14G045000.1 Chr14 10,578,622 10,579,241 8.84 15,262.13 WRKYGQK –– 1 III 146
MsWRKY130 Misin14G045100.1 Chr14 10,615,079 10,617,387 5.35 39,100.04 WRKYGQK C2HC 1 III 369
MsWRKY131 Misin14G055800.1 Chr14 13,292,467 13,294,425 5.81 33,108.79 WRKYGQK C2HC 1 III 311
MsWRKY132 Misin14G094800.1 Chr14 36,988,970 36,997,185 7.26 51,967.19 WRKYGQK C2H2 2 I 489
MsWRKY133 Misin14G145800.1 Chr14 53,975,422 53,978,412 10.11 39,116.59 WRKYGQK C2H2 1 IId 367
MsWRKY134 Misin15G021700.1 Chr15 5,345,780 5,348,381 6.01 37,525.59 WRKYGQK C2HC 1 III 354
MsWRKY135 Misin15G022300.1 Chr15 5,508,198 5,510,603 5.95 34,408.46 WRKYGQK C2HC 1 III 301
MsWRKY136 Misin15G022400.1 Chr15 5,519,610 5,521,171 6.41 30,314.18 WRKYGQK C2HC 1 III 269
MsWRKY137 Misin15G022600.1 Chr15 5,554,241 5,557,091 9.24 25,266.01 RRKYGEK C2HC 1 III 224
MsWRKY138 Misin15G022700.1 Chr15 5,568,564 5,569,944 6.24 36,949.21 WRKYGEK C2HC 1 III 349
MsWRKY139 Misin15G059000.1 Chr15 14,509,029 14,511,457 5.86 34,045.56 WRKYGQK C2HC 1 III 321
MsWRKY140 Misin15G117000.1 Chr15 44,938,827 44,944,840 7.26 52,373.63 WRKYGQK C2H2 2 I 494
MsWRKY141 Misin15G165100.1 Chr15 61,759,600 61,762,535 10.06 39,406.88 WRKYGQK C2H2 1 IId 369
MsWRKY142 Misin15G207300.1 Chr15 72,991,988 72,993,980 5.68 33,773.54 WRKYGQK C2HC 1 III 318
MsWRKY143 Misin16G028000.1 Chr16 6,831,166 6,841,544 6.1 42,064.15 WRKYGQK C2H2 1 IIe 390
MsWRKY144 Misin16G048900.1 Chr16 10,992,577 10,999,877 6.07 55,955.5 WRKYGQK C2H2 1 IIb 539
MsWRKY145 Misin16G081700.1 Chr16 22,116,722 22,118,179 8.38 19,270.16 WRKYGKK C2H2 1 IIc 178
MsWRKY146 Misin16G101800.1 Chr16 37,889,291 37,892,295 6.76 44,475.77 WRKYGQK C2HC 1 III 415
MsWRKY147 Misin16G105500.1 Chr16 40,038,943 40,041,203 8.11 52,702.08 WRKYGQK C2H2 2 I 498
MsWRKY148 Misin16G172200.1 Chr16 63,685,681 63,688,656 6.24 59,899.74 WRKYGQK C2H2 2 I 567
MsWRKY149 Misin16G174600.1 Chr16 64,449,520 64,453,297 9.67 16,747.88 WRKYGEK –– 1 III 149
MsWRKY150 Misin16G174700.1 Chr16 64,489,809 64,491,209 4.8 33,581.52 WRKYGQK C2HC 1 III 305
MsWRKY151 Misin16G203800.1 Chr16 69,942,927 69,944,332 9.25 25,824.25 WRKYGQK C2H2 1 IIc 245
MsWRKY152 Misin16G214500.1 Chr16 71,981,919 71,983,740 6.51 22,679.88 WRKYGKK C2H2 1 IIc 220
MsWRKY153 Misin16G230900.1 Chr16 74,460,403 74,463,496 6.14 45,216.03 WRKYGQK C2H2 1 IIc 427
MsWRKY154 Misin16G238000.1 Chr16 75,949,728 75,952,344 6.47 55,813.4 WRKYGQK C2H2 1 IIb 533
MsWRKY155 Misin16G240400.1 Chr16 76,369,804 76,371,283 5.64 31,057.52 WRKYGQK C2HC 1 III 292
MsWRKY156 Misin16G248100.1 Chr16 78,361,013 78,362,627 7.59 40,882.18 WRKYGQK C2H2 1 IIe 383
MsWRKY157 Misin17G038700.1 Chr17 10,631,690 10,636,844 6.08 55,291.53 WRKYGQK C2H2 1 IIb 532
MsWRKY158 Misin17G083500.1 Chr17 25,376,889 25,385,898 6.96 21,980.17 WRKYGKK C2H2 1 IIc 204
MsWRKY159 Misin17G109300.1 Chr17 38,677,661 38,679,975 6.87 53,141.5 WRKYGQK C2H2 2 I 507
MsWRKY160 Misin17G113800.1 Chr17 41,268,683 41,271,638 6.99 44,427.78 WRKYGQK C2HC 1 III 414
MsWRKY161 Misin17G171600.1 Chr17 64,424,660 64,428,034 6.08 21,848.34 WTKYGEK C2HC 1 III 191
MsWRKY162 Misin17G174100.1 Chr17 65,394,933 65,397,741 6.65 60,054.88 WRKYGQK C2H2 2 I 563
MsWRKY163 Misin17G209900.1 Chr17 73,211,242 73,212,749 9.15 24,778.25 WRKYGQK C2H2 1 IIc 234
MsWRKY164 Misin17G216300.1 Chr17 74,457,742 74,459,617 6.2 22,606.83 WRKYGKK C2H2 1 IIc 218
MsWRKY165 Misin17G243200.1 Chr17 79,583,903 79,586,980 6.37 43,647.54 WRKYGQK C2H2 1 IIc 414
MsWRKY166 Misin17G243900.1 Chr17 79,818,715 79,821,649 6.82 61,955.37 WRKYGQK C2H2 1 IIb 589
MsWRKY167 Misin17G248100.1 Chr17 80,589,893 80,591,407 5.8 31,233.78 WRKYGQK C2HC 1 III 294
MsWRKY168 Misin17G257600.1 Chr17 82,409,895 82,413,771 5.06 35,515.91 WRKYGQK C2H2 1 IIc 336
MsWRKY169 Misin17G258100.1 Chr17 82,719,255 82,721,075 7.6 32,985.94 WRKYGQK C2H2 1 IIe 314
MsWRKY170 Misin18G036700.1 Chr18 8,281,538 8,287,821 8.86 67,058.03 WRKYGQK C2H2 1 IIb 638
MsWRKY171 Misin18G048700.1 Chr18 10,206,417 10,210,611 5.82 41,183.59 WRKYGQK C2HC 1 III 385
MsWRKY172 Misin18G150100.1 Chr18 48,766,642 48,768,157 8.32 35,201.51 WRKYGQK C2H2 1 IIe 321
MsWRKY173 Misin18G152000.1 Chr18 50,380,536 50,382,557 5.18 60,818.16 WRKYGEK C2H2 1 IIe 569
MsWRKY174 Misin18G207000.1 Chr18 69,577,536 69,579,342 9.29 44,127.96 WRKYGQK C2H2 1 IIa 411
MsWRKY175 Misin19G033600.1 Chr19 7,916,396 7,933,365 8.04 65,369.63 WRKYGQK C2H2 1 IIb 624
MsWRKY176 Misin19G046600.1 Chr19 10,576,130 10,580,248 6.01 39,605.9 WRKYGQK C2HC 1 III 371
MsWRKY177 Misin19G148800.1 Chr19 49,190,343 49,191,582 7.58 38,894.29 WRKYGQK C2H2 1 IIe 359
MsWRKY178 Misin19G148900.1 Chr19 49,254,920 49,256,188 7.99 40,336.02 WRKYGQK C2H2 1 IIe 377
MsWRKY179 Misin19G200400.1 Chr19 69,378,305 69,380,812 9.55 36,897.55 WRKYGQK C2H2 1 IIa 343

These MsWRKYs have been named from MsWRKY01 to MsWRKY179 according to their distribution on the chromosome. The starting point was the upper arm of chromosome 1, moving down to the lower arm (Table 1). The properties of this set of MsWRKY proteins were investigated. The MsWRKY proteins ranged from 98 to 1620 amino acids. Their average length was 368 amino acids. Their predicted MW and pI values ranged from 11,246 to 180,044 and 4.8 to 11.79, respectively.

Chromosome mapping and classification and phylogenetic analysis of MsWRKY genes

The locations of the 179 MsWRKY genes were determined by MG2C v2.1 (Fig. 1). MsWRKY genes were distributed on all 19 Miscanthus chromosomes (Chr). Chr01-Chr19 was the chromosome (Chr) number indicating the names and positions of the MsWRKY. Chr 5 had the highest number of MsWRKYs, with 23, representing 12.9% of the total gene family. It was followed by 18 genes on Chr 6, 14 on Chr 16, and 13 on Chr 17. Chromosomes 9, 12, and 13 had only four MsWRKYs each, the fewest.

Figure 1.

Figure 1

Distribution of 179 MsWRKY genes on Miscanthus chromosomes.

An unrooted phylogenetic tree to study the evolution of MsWRKY family members was constructed by the multiple sequence alignment and neighbor-joining method in MEGA7.0. The data was full-length protein sequences of 40 SbWRKYs and 179 MsWRKYs. SbWRKYs are used as the basis for grouping. They were from Sorghum bicolor (L.) moench, a Poaceae plant that is similar to Miscanthus sinensis.

179 MsWRKYs could be divided into three major groups (I, II, and III) according to the constructed phylogenetic tree (Fig. 2). Of the 24 MsWRKYs in group I, all of them had two WRKYGQK motifs and 23 of them have two C2H2-type zinc finger motifs (C-X3-4-C-X22-23-H-X1-H), corresponding to two complete WRKY domains. Although the protein encoded by MsWRKY09 had only one zinc finger motif, it belonged to group I on the phylogenetic tree.

Figure 2.

Figure 2

Phylogenetic tree of WRKY members in Miscanthus and Sorghum. All MsWRKYs genes were further divided into subgroups I, II, and III, and group II was further divided into subgroups IIa, IIb, IIc, IId, and IIe.

Group II had a total of 97 protein sequences and was the largest group, accounting for 54.2% of all putative MsWRKYs. Similar reports can also be found in sorghum, cucumber and chickpeas. Most of these proteins had one WRKY domain and the C2H2-type zinc finger motif (C-X4-5-C-X23-H-X1-H). This group was further divided into five subgroups, IIa, IIb, IIc, IId, and IIe, with 8, 16, 32, 21, and 20 members, respectively. Fifty-eight proteins belong to Group III. The proteins in this group had one WRKY domain and the C2HC-type zinc finger motif (C-X7-C-X23-H-X1-C)28. In summary, the classification of MsWRKYs indicated the diversity of these proteins. An extremely wide range of functions could be performed by these proteins.

Gene structure analysis and conserved motif distribution analysis of MsWRKY genes

The exon–intron structures of MsWRKY family members can illustrate the evolution of MsWRKY family members. The introns number of MsWRKY genes ranged from zero to five, while their size varied. The data showed that genes within the same group had certain similarities in the exon–intron distribution patterns. These results suggest that the MsWRKY genes had important structural diversity. It represented the functional diversity among closely related members of MsWRKYs (Fig. 3).

Figure 3.

Figure 3

Exon–intron structures of MsWRKY genes. Exon–intron structures of MsWRKY genes were obtained after analysed with TBtools for gene structure. Green bars indicate upstream and downstream UTRs, yellow bars indicate coding sequences (CDS), and black lines indicate introns in the gene diagrams.

MEME (version 5.5.3), used to analyze the conserved motifs of all MsWRKY protein sequences, identified 20 distinct conserved motifs. The distribution of 20 conserved motifs identified by MEME in the different groups of MsWRKYs is shown in Fig. 4. Motifs 1 is the WRKY domain. Similar Motif structures can be found between the genes in the same group or subgroup through the results. Motifs 1, 2, 3, and 4 are found in almost all genes. Motifs 15 and 19 were unique to group I. Motifs 9 and 13 were unique to group IIb. Motifs 12 were unique to group IIe. Motifs 10, 16, and 18 were unique to group III. Some of the motifs shared by different groups. Motif 5 was shared by groups I and IIc, and motif 6 and 7 were shared by groups IIa and IIb.

Figure 4.

Figure 4

Motif analysis of MsWRKYs.

Gene ontology annotation and analysis of cis-acting elements of MsWRKY genes

The Blast2GO analyzed the Gene ontology (GO) annotations of 179 MsWRKY proteins. The MsWRKY target genes can be categorized into three main categories according to different functional groups. The biological processes, molecular functions, and cellular components together make up the Gene ontology (GO) annotations. Through the enrichment analysis, the involvement of MsWRKY in biological processes, molecular functions, and cellular components is in Fig. 5. Most MsWRKYs are involved in regulating cellular processes, biosynthetic processes, and different metabolic processes. Further analysis showed that most MsWRKYs were involved in the plant's stress to external adversity. Many MsWRKYs have been linked to bacterial infections and environmental stress29. The molecular functions of MsWRKYs are mainly a variety of DNA-binding and gene expression regulation. The cellular component of this protein family is mainly organelle and intracellular organelle. Most of the MsWRKY proteins are located in the cell nucleus.

Figure 5.

Figure 5

Gene ontology analysis of identified MsWRKYs.

Cis-acting elements are essential sequences in regulating gene expression by transcription factors. An online tool PlantCARE was used to analyze all MsWRKY cis-acting elements and extracted 2000 bp promoter regions upstream of all MsWRKY genes. The result shows that every MsWRKY have many cis-acting elements.

Firstly, many transcription-related cis-acting elements can be found including TATA-box, CAAT-box, A-box, HD-Zip, and W-box. The stress-responsive elements formed an important part in the cis-acting elements. This suggests that MsWRKY plays an important role in plants' resistance to external stress. These cis-acting elements include MBS (Anti-drought stress), LTR (anti-low temperature stress), and WUN-motif (wound-responsive elements). In addition, many of the cis-acting elements were regulated by phytohormones. ABA regulates the ABA-responsive element (ABRE). Methyl jasmonate (MeJA) responsive element (TGACG-motif and CGTCA-motif) is regulated by jasmonate phytohormones30. Finally, there are auxin-responsive elements (AuxRR-core and TGA-element), salicylic acid-responsive elements (TCA-element), etc. Finally, there are many light-responsive elements and other regulatory elements. At least one stress-responsive element is on all of the MsWRKY genes and reflects the potential functional variation of the MsWRKY gene.

Synteny analysis of MsWRKY genes

The segmental duplication events occurring in the Miscanthus WRKY family were investigated by conducting a synteny analysis of the MsWRKY genes using BLASTP and MCScanX in TBtools. As shown in Fig. 6, 19 segmental duplication events involving 53 WRKY genes were observed. Tandem duplication events, which were defined by a chromosomal region within 200 kb containing two or more genes, were widely identified for Miscanthus WRKY genes. A very large tandem duplication event was observed in the chromosome 14. These results suggested that some MsWRKYs were possibly generated by segmental duplication events and that the evolution of MsWRKY genes may have been driven, at least in part, by segmental duplication events.

Figure 6.

Figure 6

Schematic representations for the interchromosomal relationships of MsWRKYs. Blue lines show duplicated WRKY gene pairs in the Miscanthus genome.

The phylogenetic mechanisms of the Miscanthus WRKY family were further explored by constructing comparative syntenic maps of cucumber associated with four representative species, including two dicots (Arabidopsis and cucumber) and two monocots (sorghum and maize) (Fig. 7). 249, 231, 28, and 27 pairs of genes showed syntenic relationships between the other four species: cucumber, Arabidopsis, sorghum and maize, respectively. A total of 249 WRKY collinear gene pairs between Miscanthus and maize were identified, followed by Miscanthus and sorghum (231), Miscanthus and cucumber (28), and Miscanthus and Arabidopsis (27). Both Miscanthus and maize belong to the Poaceae family, and more than 75.4% of the MsWRKY genes showed a syntenic relationship with WRKYs in maize. But some of MsWRKY genes were associated with more than one syntenic gene pair, indicating that WRKY genes in Poaceae family have gone through multiple rounds of duplication events. This may be the reason why monocotyledonous plants have far more WRKY genes than dicotyledonous plants. Importantly, collinear MsWRKY09/11/60/64/83/85/112/179 genes pairs were observed between Miscanthus and all of the other four species, suggesting that these orthologous pairs may have formed before the divergence of dicotyledonous and monocotyledonous plants.

Figure 7.

Figure 7

Synteny analysis of WRKYs between Miscanthus and other plant species. The collinear blocks are marked by gray lines, while the collinear gene pairs with WRKY genes are highlighted in the red lines. ‘M. sinensis’, ‘A. thaliana’, ‘C. sativus’, ‘S. bicolor’ and ‘Z. mays’ indicate Miscanthus sinensis, Arabidopsis thaliana, Cucumis sativus, Sorghum bicolor, and Zea mays, respectively.

Digital expression analysis of MsWRKY genes at different seasons and in different tissues

The study of the temporal and spatial expression profiles of MsWRKY genes used the microarray data provided by the JGI database and presented the results as heatmaps by TBtools. the microarray datasets used gene expression data for M. sinensis and included a total of 22 samples. The samples were taken from leaf (7), rhizome (9) and stem (6). The samples were collected from plants at different times of the year and reflected the expression of the MsWRKY gene at different stages of plant growth. 175 of the 179 genes showed differential expression in plants. Most MsWRKY genes are highly expressed in rhizome. The expression patterns of MsWRKY in different growth and development stages were also analyzed. Firstly, the MsWRKY gene is first expressed in rhizome in large quantities during plant growth. Then, some MsWRKY genes were heavily expressed in the leaf. Eventually, some MsWRKY genes are over-expression in the stem when plants wither. The results showed that these genes may be involved in stress response at sensitive developmental stages to improve plant tolerance (Fig. 8).

Figure 8.

Figure 8

Heatmaps of MsWRKY gene expression. MsWRKY expression levels in different tissues and at different seasons.

By analyzing the gene expression heat map and cis-acting elements together, it can be found that genes activated in different periods have different characteristics. In the early stage of plant development, the expression of the MsWRKY gene is mostly controlled by plant hormones and light regulatory elements. In the middle stage of plant development, the MsWRKY gene expressed in leaves is more regulated by infection and injury. At the end of plant development, MsWRKY genes expressed were mostly regulated by ABA and jasmonic acid, and some were stressed by environmental conditions such as drought. This suggests that the MsWRKY gene plays an important role in the growth of the perennial plant Miscanthus sinensis.

Discussion

WRKY transcription factors (TFs) are widely distributed in the plant kingdom and play a crucial role in stress tolerance. The WRKY gene family comprises 66 genes in Arabidopsis, 119 in maize, 94 in sorghum, 79 in potatoes, 70 in chickpeas, and 61 in cucumbers. By analyzing the genomic assembly of Miscanthus sinensis, 179 WRKY genes were identified. As the Miscanthus sinensis is a paleotetraploid31, the amount of the WRKY genes is much higher than that of normal plants, but this has not received much attention in previous studies. The MsWRKY gene is distributed across all 19 chromosomes of Miscanthus, and we have identified MsWRKY92, located on chromosome 7, as a gene that promotes flowering32. Conserved WRKY domains bind to the W-box motif in the promoter of WRKY target genes, which is the most important feature of the WRKY family33,34. A phylogenetic analysis of all the obtained MsWRKY genes has been performed. We performed a phylogenetic analysis of all the obtained MsWRKY genes and classified them into groups I, II, and III based on the number of WRKY domains and the type of zinc finger motif. group II is further subdivided into five subgroups: IIa, IIb, IIc, IId, and IIe. Group I had 24 MsWRKY genes, group II had 97, and group III had 58. In group II, group IIc had the most MsWRKY genes, with 32. The proportions of these genes are similar to those found in other plants3537.

Most MsWRKY genes have a very conserved WRKYGQK motif. However, other similar sequences have been found in many genes. (MsWRKY07 MsWRKY12 MsWRKY17 MsWRKY24 MsWRKY26 MsWRKY34 MsWRKY40 MsWRKY46 MsWRKY51 MsWRKY58 MsWRKY67 MsWRKY71 MsWRKY103 MsWRKY104 MsWRKY105 MsWRKY108 MsWRKY109 MsWRKY124 MsWRKY137 MsWRKY138 MsWRKY145 MsWRKY149 MsWRKY152 MsWRKY158 MsWRKY161 MsWRKY164 MsWRKY173) Furthermore, there are some genes that are clearly WRKY genes but are missing this sequence (MsWRKY44 MsWRKY65 MsWRKY106). These differences can seriously affect the ability of MsWRKY proteins to bind to W-box elements, which in turn affects the biological function of these proteins. Similar heptapeptide motif variations have been found in other plants, such as sorghum4. In soybeans, for example, two WRKY genes with WRKYGKK motif do not bind to normal W-box elements and do not work33. Therefore, further studies are necessary to confirm the biological function of these WRKY motif aberrant genes.

Studying the exon–intron structure of MsWRKY genes can reveal population-specific patterns. Similar MsWRKY genes on the evolutionary tree tend to have similar exon–intron patterns. The number of introns in the MsWRKY gene ranges from 0 to 5, and some MsWRKY genes do not contain introns, indicating that some WRKY genes have experienced intron loss events38. Intron-free genes have been found in other organisms. There are three main mechanisms by which intron-free genes are produced: reverse transcription (the integration of sequences from RNA into the genome), duplication of existing intron-free genes, and horizontal gene transfer39. Differences in the intron size of MsWRKY genes may result from gene duplication, inversion, and/or fusion events40. In conclusion, the diverse exon–intron structure of MsWRKY genes reflects the evolutionary diversity of the MsWRKY gene family.

Motif structural studies of the MsWRKY gene family reveal both structural conservatism and diversity. Motifs 1, 2, 3, and 4 correspond to WRKY domains and zinc finger domains, and they are found in most MsWRKY genes. Although most motifs' functions are unclear, their distribution also has certain rules. Motifs 15 and 19 were unique to group I. Motifs 9 and 13 were unique to group IIb. Motifs 12 were unique to group IIe. Motifs 10, 16, and 18 were unique to group III. Some of the motifs shared by different groups included motif 5, shared by groups I and IIc, and motifs 6 and 7, shared by groups IIa and IIb. Motif 8 is a nuclear localization signal (NLS), mainly in groups IId, IIe, and III41. In conclusion, motif analysis clearly demonstrates the structural differences of genes in different groups in MsWRKY. These motifs may reflect that these genes participate in specific biological processes and play similar biological functions.

Studying the cis-acting elements of MsWRKY can obtain more information about the gene expression of the MsWRKY gene family. Firstly, many transcription-related cis-acting elements42 including TATA-box, CAAT-box, A-box and HD-Zip are essential for gene expression, and most are involved in constructing transcription complexes. In addition, there is also a batch of cis-acting elements regulated by phytohormones. ABA regulates the ABA-responsive element (ABRE). Methyl jasmonate (MeJA) responsive element (TGACG-motif and CGTCA-motif) is regulated by jasmonate phytohormones. In addition, there are auxin-responsive elements (AuxRR-core and TGA-element), salicylic acid-responsive elements (TCA-element), and so on. A variety of biological and abiotic stresses also regulate these genes. These cis-acting elements include MBS (Anti-drought stress), LTR (anti-low temperature stress), and WUN-motif (wound-responsive elements). At the same time, W-box was also found in the promoter region of many MsWRKY genes. This suggests that there is also mutual regulation between WRKY genes43,44. Studies on cis-acting elements of MsWRKY reflect the diversity of the MsWRKY gene in gene expression regulation.

Existing transcriptome data of Miscanthus sinensis can be used to analyze the expression of MsWRKY gene in different stages of Miscanthus development and expression patterns in different tissues. Future studies should investigate the function of the MsWRKY gene by studying homologous genes in other model plants (such as Arabidopsis) and related plants (other grasses). Future research should focus on the effect of MsWRKY on plant growth and development and against adverse winter environments. Additionally, exploring the role of the MsWRKY gene in plant stress response would be valuable. Our research results provide a foundation for future studies in this field.

Conclusion

In this study, 179 WRKY genes were identified from Miscanthus sinensis. The identification, chromosome mapping, classification, phylogenetic analysis, gene structure analysis, conserved motif distribution analysis, gene ontology annotation, analysis of cis-acting elements, and digital expression pattern analysis have been performed. Through digital expression pattern analysis, the specific expression of the MsWRKY gene in different developmental stages and different parts of plants was found. At the same time, some MsWRKY genes may play an important role in plant stress resistance. In conclusion, this study is conducive to further research on the important functions of the WRKY gene in response to abiotic and biological stresses.

Supplementary Information

Author contributions

All authors reviewed the manuscript.

Funding

Overseas Scholar Program in the Hebei Province (C20190514), Science and Technology Project of Hebei Province (15457605D, 144576106D), National Natural Science Foundation of China (12072205).

Data availability

All custom scripts used for parsing and analyzing transposable elements, gene families, and gene expression, as described in Supplementary Notes, are available at JGI and NCBI database [https://data.jgi.doe.gov/refine-download/phytozome?organism=Msinensis&expanded=497&_gl=1*13m4cmx*_ga*MTQwODM0NDIwMy4xNjk4MDQwMzEw*_ga_YBLMHYR3C2*MTY5ODA0MDMwOS4xLjEuMTY5ODA0MDgwMS4wLjAuMA..] [https://www.ncbi.nlm.nih.gov/bioproject/PRJNA575573] [https://www.ncbi.nlm.nih.gov/bioproject/PRJNA346689].

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-55849-1.

References

  • 1.Ishiguro S, Nakamura K. Characterization of a cDNA encoding a novel DNA-binding protein, SPF1, that recognizes SP8 sequences in the 5′ upstream regions of genes coding for sporamin and β-amylase from sweet potato. Mol. Gen. Genet. MGG. 1994;244:563–571. doi: 10.1007/BF00282746. [DOI] [PubMed] [Google Scholar]
  • 2.Wang Q, Wang M, Zhang X, Hao B, Kaushik SK, Pan Y. WRKY gene family evolution in Arabidopsis thaliana. Genetica. 2011;139:973–983. doi: 10.1007/s10709-011-9599-4. [DOI] [PubMed] [Google Scholar]
  • 3.Wei KF, Chen J, Chen YF, Wu LJ, Xie DX. Molecular phylogenetic and expression analysis of the complete WRKY transcription factor family in maize. DNA Res. 2012;19(2):153–164. doi: 10.1093/dnares/dsr048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Baillo EH, Hanif MS, Guo Y, Zhang Z, Xu P, Algam SA. Genome-wide Identification of WRKY transcription factor family members in sorghum (Sorghum bicolor (L.) moench) PLoS ONE. 2020;15(8):e0236651. doi: 10.1371/journal.pone.0236651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zhang C, Wang D, Yang C, Kong N, Shi Z, Zhao P, Chen Q. Genome-wide identification of the potato WRKY transcription factor family. PLoS ONE. 2017;12(7):e0181573. doi: 10.1371/journal.pone.0181573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Waqas M, Azhar MT, Rana IA, Azeem F, Ali MA, Nawaz MA, Atif RM. Genome-wide identification and expression analyses of WRKY transcription factor family members from chickpea (Cicer arietinum L.) reveal their role in abiotic stress-responses. Genes Genom. 2019;41:467–481. doi: 10.1007/s13258-018-00780-9. [DOI] [PubMed] [Google Scholar]
  • 7.Chen C, Chen X, Han J, Lu W, Ren Z. Genome-wide analysis of the WRKY gene family in the cucumber genome and transcriptome-wide identification of WRKY transcription factors that respond to biotic and abiotic stresses. BMC Plant Biol. 2020;20:1–19. doi: 10.1186/s12870-020-02625-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Eulgem T, Rushton PJ, Robatzek S, Somssich IE. The WRKY superfamily of plant transcription factors. Trends Plant Sci. 2000;5(5):199–206. doi: 10.1016/S1360-1385(00)01600-9. [DOI] [PubMed] [Google Scholar]
  • 9.Rushton PJ, Somssich IE, Ringler P, Shen QJ. WRKY transcription factors. Trends Plant Sci. 2010;15(5):247–258. doi: 10.1016/j.tplants.2010.02.006. [DOI] [PubMed] [Google Scholar]
  • 10.Jin W, Wu F. Characterization of miRNAs associated with Botrytis cinerea infection of tomato leaves. BMC Plant Biol. 2015;15:1–14. doi: 10.1186/s12870-014-0410-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bakshi M, Oelmüller R. WRKY transcription factors: Jack of many trades in plants. Plant Signal. Behav. 2014;9(2):e27700. doi: 10.4161/psb.27700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Finatto T, Viana VE, Woyann LG, Busanello C, Maia LCD, Oliveira ACD. Can WRKY transcription factors help plants to overcome environmental challenges? Genet. Mol. Biol. 2018;41:533–544. doi: 10.1590/1678-4685-gmb-2017-0232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Li W, Wang H, Yu D. Arabidopsis WRKY transcription factors WRKY12 and WRKY13 oppositely regulate flowering under short-day conditions. Mol. Plant. 2016;9(11):1492–1503. doi: 10.1016/j.molp.2016.08.003. [DOI] [PubMed] [Google Scholar]
  • 14.Luo X, Sun X, Liu B, Zhu D, Bai X, Cai H, Zhu Y. Ectopic expression of a WRKY homolog from Glycine soja alters flowering time in Arabidopsis. PLoS ONE. 2013;8(8):e73295. doi: 10.1371/journal.pone.0073295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Cai Y, Chen X, Xie K, Xing Q, Wu Y, Li J, Guo Z. Dlf1, a WRKY transcription factor, is involved in the control of flowering time and plant height in rice. PLoS ONE. 2014;9(7):e102529. doi: 10.1371/journal.pone.0102529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pandey SP, Somssich IE. The role of WRKY transcription factors in plant immunity. Plant Physiol. 2009;150(4):1648–1655. doi: 10.1104/pp.109.138990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wu X, Shiroto Y, Kishitani S, Ito Y, Toriyama K. Enhanced heat and drought tolerance in transgenic rice seedlings overexpressing OsWRKY11 under the control of HSP101 promoter. Plant Cell Rep. 2009;28:21–30. doi: 10.1007/s00299-008-0614-x. [DOI] [PubMed] [Google Scholar]
  • 18.Fu QT, Yu DQ. Expression profiles of AtWRKY25, AtWRKY26 and AtWRKY33 under abiotic stresses. Yi chuan Hereditas. 2010;32(8):848–856. doi: 10.3724/SP.J.1005.2010.00848. [DOI] [PubMed] [Google Scholar]
  • 19.Jing Z, Liu Z. Genome-wide identification of WRKY transcription factors in kiwifruit (Actinidia spp.) and analysis of WRKY expression in responses to biotic and abiotic stresses. Genes Genom. 2018;40:429–446. doi: 10.1007/s13258-017-0645-1. [DOI] [PubMed] [Google Scholar]
  • 20.Gao H, Wang Y, Xu P, Zhang Z. Overexpression of a WRKY transcription factor TaWRKY2 enhances drought stress tolerance in transgenic wheat. Front. Plant Sci. 2018;9:997. doi: 10.3389/fpls.2018.00997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zhang L, Zhao T, Sun X, Wang Y, Du C, Zhu Z, Xin H. Overexpression of VaWRKY12, a transcription factor from Vitis amurensis with increased nuclear localization under low temperature, enhances cold tolerance of plants. Plant Mol. Biol. 2019;100:95–110. doi: 10.1007/s11103-019-00846-6. [DOI] [PubMed] [Google Scholar]
  • 22.Mitros T, Session AM, James BT, Wu GA, Belaffif MB, Clark LV, Rokhsar DS. Genome biology of the paleotetraploid perennial biomass crop Miscanthus. Nat. Commun. 2020;11(1):5442. doi: 10.1038/s41467-020-18923-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Letunic I, Bork P. 20 years of the SMART protein domain annotation resource. Nucleic acids research. 2018;46(D1):D493–D496. doi: 10.1093/nar/gkx922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kumar S, Stecher G, Tamura K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016;33(7):1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, Xia R. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol. Plant. 2020;13(8):1194–1202. doi: 10.1016/j.molp.2020.06.009. [DOI] [PubMed] [Google Scholar]
  • 26.Zou Z, Yang L, Wang D, Huang Q, Mo Y, Xie G. Gene structures, evolution and transcriptional profiling of the WRKY gene family in castor bean (Ricinus communis L.) PLoS ONE. 2016;11(2):e0148243. doi: 10.1371/journal.pone.0148243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Götz S, García-Gómez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, Conesa A. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008;36(10):3420–3435. doi: 10.1093/nar/gkn176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lovering R, Hanson IM, Borden KL, Martin S, O'Reilly NJ, Evan GI, Freemont PS. Identification and preliminary characterization of a protein motif related to the zinc finger. Proc. Natl. Acad. Sci. USA. 1993;90(6):2112–2116. doi: 10.1073/pnas.90.6.2112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Tao Z, Liu H, Qiu D, Zhou Y, Li X, Xu C, Wang S. A pair of allelic WRKY genes play opposite roles in rice-bacteria interactions. Plant Physiol. 2009;151(2):936–948. doi: 10.1104/pp.109.145623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Jin Y, Ding X, Li J, Guo Z. Isolation and characterization of wheat ice recrystallisation inhibition gene promoter involved in low temperature and methyl jasmonate responses. Physiol. Mol. Biol. Plants. 2022;28(11–12):1969–1979. doi: 10.1007/s12298-022-01257-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Baillo EH, Kimotho RN, Zhang Z, Xu P. Transcription factors associated with abiotic and biotic stress tolerance and their potential for crops improvement. Genes. 2019;10(10):771. doi: 10.3390/genes10100771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Yu Y, Hu R, Wang H, Cao Y, He G, Fu C, Zhou G. MlWRKY12, a novel Miscanthus transcription factor, participates in pith secondary cell wall formation and promotes flowering. Plant Sci. 2013;212:1–9. doi: 10.1016/j.plantsci.2013.07.010. [DOI] [PubMed] [Google Scholar]
  • 33.Bi C, Xu Y, Ye Q, Yin T, Ye N. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis. PeerJ. 2016;4:e2437. doi: 10.7717/peerj.2437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Ding M, Chen J, Jiang Y, Lin L, Cao Y, Wang M, Ye W. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium. Mol. Genet. Genom. 2015;290:151–171. doi: 10.1007/s00438-014-0904-7. [DOI] [PubMed] [Google Scholar]
  • 35.Li D, Liu P, Yu J, Wang L, Dossa K, Zhang Y, Zhang X. Genome-wide analysis of WRKY gene family in the sesame genome and identification of the WRKY genes involved in responses to abiotic stresses. BMC Plant Biol. 2017;17(1):1–19. doi: 10.1186/s12870-017-1099-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ross CA, Liu Y, Shen QJ. The WRKY gene family in rice (Oryza sativa) J. Integr. Plant Biol. 2007;49(6):827–842. doi: 10.1111/j.1744-7909.2007.00504.x. [DOI] [Google Scholar]
  • 37.Luo X, Bai X, Sun X, Zhu D, Liu B, Ji W, Zhu Y. Expression of wild soybean WRKY20 in Arabidopsis enhances drought tolerance and regulates ABA signalling. J. Exp. Bot. 2013;64(8):2155–2169. doi: 10.1093/jxb/ert073. [DOI] [PubMed] [Google Scholar]
  • 38.Yenerall P, Zhou L. Identifying the mechanisms of intron gain: progress and trends. Biol. Direct. 2012;7(1):1–10. doi: 10.1186/1745-6150-7-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zou M, Guo B, He S. The roles and evolutionary patterns of intronless genes in deuterostomes. Int. J. Genom. 2011;2011:1–8. doi: 10.1155/2011/680673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Li MY, Xu ZS, Tian C, Huang Y, Wang F, Xiong AS. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants. Sci. Rep. 2016;6(1):23101. doi: 10.1038/srep23101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Lin JR, Hu J. SeqNLS: nuclear localization signal prediction based on frequent pattern mining and linear motif scoring. PLoS ONE. 2013;8(10):e76864. doi: 10.1371/journal.pone.0076864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Hernandez-Garcia CM, Finer JJ. Identification and validation of promoters and cis-acting regulatory elements. Plant Sci. 2014;217:109–119. doi: 10.1016/j.plantsci.2013.12.007. [DOI] [PubMed] [Google Scholar]
  • 43.Phukan UJ, Jeena GS, Shukla RK. WRKY transcription factors: molecular regulation and stress responses in plants. Front. Plant Sci. 2016;7:760. doi: 10.3389/fpls.2016.00760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Tao Z, Kou Y, Liu H, Li X, Xiao J, Wang S. OsWRKY45 alleles play different roles in abscisic acid signalling and salt stress tolerance but similar roles in drought and cold tolerance in rice. J. Exp. Bot. 2011;62(14):4863–4874. doi: 10.1093/jxb/err144. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

All custom scripts used for parsing and analyzing transposable elements, gene families, and gene expression, as described in Supplementary Notes, are available at JGI and NCBI database [https://data.jgi.doe.gov/refine-download/phytozome?organism=Msinensis&expanded=497&_gl=1*13m4cmx*_ga*MTQwODM0NDIwMy4xNjk4MDQwMzEw*_ga_YBLMHYR3C2*MTY5ODA0MDMwOS4xLjEuMTY5ODA0MDgwMS4wLjAuMA..] [https://www.ncbi.nlm.nih.gov/bioproject/PRJNA575573] [https://www.ncbi.nlm.nih.gov/bioproject/PRJNA346689].


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES