Skip to main content
PLOS One logoLink to PLOS One
. 2019 Mar 21;14(3):e0214001. doi: 10.1371/journal.pone.0214001

The transcriptome analysis of Protaetia brevitarsis Lewis larvae

Zhongjie Li 1,*, Miaomiao Meng 1, Shasha Li 1, Bo Deng 1
Editor: Isabelle Chemin2
PMCID: PMC6428405  PMID: 30897120

Abstract

Larvae of the pest Protaetia brevitarsis are used to treat infections in traditional Chinese medicine. However, genomic information about this non-model species is currently lacking. To better understand the fundamental biology of this non-model species, its transcriptome was obtained using next generation sequencing and then analyzed. A total of 7.62 Gb of clean reads were obtained, which were assembled into 169,087 transcripts corresponding to 142,000 annotated unigenes. These unigenes were functionally classified according to Gene Ontology (GO), euKaryotic Ortholog Groups of proteins (KOG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations. A total of 41,921 unigenes were assigned to 56 GO terms, 21,454 unigenes were divided among 26 KOG categories, and 16,368 unigenes were assigned to 32 KEGG pathways. In addition, 19,144 simple sequence repeats (SSRs) were identified. Furthermore, several kinds of natural antimicrobial peptides and proteins, 4 histones with potential antimicrobial activity, and 41 potential antimicrobial peptide sequences were identified. These data are the first reported whole transcriptome sequence of P. brevitarsis larvae, which represents a valuable genomic resource for studying this species, thus promoting the utilization of its medical potential.

Introduction

The beetle species Protaetia brevitarsis Lewis is widely distributed in China, and it is a pest of plants, including vegetable crops. In traditional Chinese medicine, its larvae are used to treat microbial infections. However, very little research has been conducted on this species. Moreover, there is also no genomic data from this non-model organism.

Transcriptome data represent an essential type of genome data and offer an opportunity to explore the genomes of non-model organisms. Owing to the rapid development of next generation sequencing (NGS) technologies, such as the Illumina sequencing platform, the transcriptomes of non-model organisms can be characterized quickly, inexpensively, and accurately [1, 2]. Accordingly, the transcriptomes of many non-model organisms have been characterized by NGS, such as the green odorous frog Odorrana margaretae [3], the inland robust scorpion Urodacus yaschenkoi [4], the land snail Koreanohadra kurodana [2], the Chinese red-headed centipede Scolopendra subspinipes mutilans [5], and several luminescent beetle species [6].

The P. brevitarsis larvae live on the ground and feed on rotten food. This type of habitat and diet lead them to encounter many different kinds of pathogenic microbes, which has imposed strong selective pressures on their immune systems. The insect immune system is composed of two elements: cellular defense responses and humoral defense responses [7]. Insect antimicrobial components secreted by the insect humoral immune system, especially antimicrobial peptides and proteins (AMPs), play an important role in defense against microorganisms, which may be the molecular basis of the anti-infection effects of P. brevitarsis larvae used in medicine. Many insect AMPs have been identified, characterized, and categorized according to familiar sequences and structures, including linear-form, disulfide bridges, and proline- or glycine-rich regions [8]. Additionally, AMPs from insects also show great potential as anti-infection agents [9].

In this study, the transcriptome profile of P. brevitarsis larvae was constructed using the Illumina sequencing platform. After de novo transcriptome assembly, bioinformatic analyses were conducted to annotate the functional direction of the assembled sequences. The analyses focused on P. brevitarsis larvae AMPs. This research provides the first large-scale transcriptome data from P. brevitarsis larvae, which improves the current understanding and medical utilization of this species while enabling the discovery of important functional genes.

Materials and methods

Ethics statement

All animal collection and utility protocols were approved by the Henan University of Science and Technology of Biology Animal Use Ethics Committee.

Insect collection

The Protaetia brevitarsis Lewis larvae used in this study were obtained from a breeder in Henan, China. After being washed in 0.9% saline, the samples were immediately frozen in liquid nitrogen and stored at -80°C until use.

cDNA library construction, Illumina sequencing, de novo assemble, and gene annotation

Total RNA was extracted using TRIzol reagent (Life Technologies, Carlsbad, CA, USA), and cDNA library construction and Illumina sequencing of the samples using an Illumina HiSeqTM 2000 sequencer (Illumina, San Diego, CA, USA) were performed by Novogene Bioinformatics Technology Co., Ltd. (Beijing, China).

De novo transcriptome assembly was conducted using the program Trinity [1]. Gene annotation was based on searches of various public databases, including NCBI Nr (non-redundant protein sequences), NCBI Nt (non-redundant nucleotide sequences), Pfam (Protein family), KOG (euKaryotic Ortholog Groups of proteins), Swiss-Prot (a manually annotated and reviewed protein sequence database), KO (Kyoto Encyclopedia of Genes and Genomes Ortholog database), and GO (Gene Ontology). SSR markers were isolated from the transcriptome using MISA (http://pgrc.ipk-gatersleben.de/misa/misa.html).

A detailed description of cDNA library construction, Illumina sequencing, the de novo assemble and gene annotation is given in S1 File.

Potential AMP sequences identification

To identify more potential AMPs and peptide sequences with potential antimicrobial activity in the transcriptome of P. brevitarsis larvae, the assembled unigenes were also BLASTed with known AMPs from The Antimicrobial Peptide Database (APD) [10], Collection of Anti-Microbial Peptides (CAMP) [11], and the Linking Antimicrobial Peptides (LAMP) database [12] using BLAST 2.2.31+ with a sequence similarity cutoff of 80%.

Results and discussion

Transcriptome sequencing and de novo assembly

The P. brevitarsis larvae cDNA library was sequenced using the Illumina HiSeqTM 2000 sequencer, ultimately obtaining 50,796,336 clean reads with 98.09% Q20 and 42.88% GC content after the removal of adaptor sequences, ambiguous nucleotides, and low-quality sequences. The clean reads were assembled into 169,087 transcripts, which corresponded to 142,000 unigenes. A summary of sequencing and assembly results was presented in Tables A and B in S1 File, and the unigene sequences was presented in S2 File. The clean data had been deposited into the Sequence Read Archive (SRA) database of the National Center for Biotechnology Information (NCBI) under the accession number PRJNA516097.

Functional annotation of unigenes

After public databases were used for sequence similarity searches, 56,937 unigenes (40.09% of all unigenes) were successfully matched to annotations in at least one database (Table C in S1 File), which is similar to rates reported in other beetle species [1315], indicating that a large number of species-specific genes, noncoding gene regions, and non-conserved domains could not be matched to annotated genes or that chimeric sequences had occurred as a result of assembly errors [16]. Among the annotated unigenes, 25.87% (36,743) had significant matches in the Nr database, 9.07% (12,881) in the Nt database, 11.52% (16,368) in the KO database, 20.23% (28,727) in the Swiss-Prot database, 28.52% (40,511) in the PFAM database, 29.52% (41,921) in the GO database, and 15.1% (21,454) in the KOG database. Furthermore, there were 4,618 unigenes (3.25%) annotated in all seven databases.

The E-value distribution and similarity distribution of the annotated unigenes in the Nr database identified 49.2% (Fig 1A) of annotated sequences as having strong homology (E-value less than 1E-30), with 77.8% (Fig 1B) of the annotated sequences having a similarity index higher than 60%. The top-hit species distribution showed that 27.1% (Fig 1C) of P. brevitarsis larvae unigene sequences matched with sequences from red flour beetle Tribolium castaneum. This overlap of unigenes was lower than that for other beetle species, such as the pine shoot beetle Tomicus yunnanensis (62.48%), the salt marsh beetle Pogonus chalceus, and the nipa palm hispid Octodonta nipae (72.6%) [1719]. However, this value was similar with respect to the seven-spot ladybird beetle Coccinella septempunctata (28.5%) [15]. In the NCBI protein database, the number of protein sequences from T. castaneum and the mountain pine beetle Dendroctonus ponderosae were 42,319 and 49,666, respectively. But, only 3.4% of P. brevitarsis larvae unigene sequences matched with those of D. ponderosae. It was indicated that Protaetia brevitarsis Lewis had near evolution distance with T. castaneum.

Fig 1. E-value distribution, similarity distribution, and top-hit species of Protaetia brevitarsis Lewis Larvae unigenes blast against Nr database.

Fig 1

A: E-value distribution; B: Similarity distribution; C: Top-hit species.

GO categories were used to classify the functions of the unigenes, and the classification of GO categories was conducted using the GOseq R package based on the Wallenius non-central hyper-geometric distribution [20]. According to GO annotation, 41,921 unigenes were classified into three main GO categories: biological process, cellular component, and molecular function (Fig 2). Among the biological process terms, 25 level-2 categories were identified, and cellular process and metabolic process were the most abundant terms. For cellular components, 20 level-2 categories were identified, and cell, cell part, and organelle were the most abundant groups. For molecular function, 11 level-2 categories were identified, and the genes that were associated with binding and catalytic activities were the most represented. Similar gene GO classification distributions had also been reported in transcriptomic studies of other beetles [17, 21, 22]. To further predict putative protein functions, a KOG analysis was performed. Based on our results, 21,454 unigenes were divided into 26 categories (Fig 3), and the largest category was ‘general functional prediction only’ (16.02%), followed by ‘posttranslational modification, protein turnover, chaperones’ (14.38%), ‘signal transduction mechanism’ (12.40%), and ‘translation, ribosomal structure and biogenesis’ (12.04%). The KEGG [23] database was used to identify potential biological pathways represented in the P. brevitarsis larvae transcriptome, and the classification of KEGG pathways was performed using KOBAS software [24]. A total of 16,368 unigenes were classified into five categories, including cellular processes, environmental information processing, genetic information processing, metabolism, and organismal systems, and then assigned to 32 sub-terms (Fig 4). Among the pathways, translation (2391 unigenes) and signal transduction (2207 unigenes) were most highly represented. Similar gene KEGG classifications distributions had also been found in coconut leaf beetle Brontispa longissima [25]. These annotations provide a valuable resource for future research on specific processes, functions, and pathways in P. brevitarsis and related species.

Fig 2. Histogram of the gene ontology (GO) classification of the Protaetia brevitarsis Lewis Larvae unigenes.

Fig 2

Each annotated sequence could be assigned to more than one GO term.

Fig 3. Histogram of the KOG (euKaryotic Ortholog Groups of proteins) classification of the Protaetia brevitarsis Lewis Larvae unigenes.

Fig 3

Fig 4. Histogram of the KEGG Pathway classification of the the Protaetia brevitarsis Lewis Larvae unigenes.

Fig 4

A: Cellular process, B: Environmental information processing, C: Genetic information processing, D: Metabolism, E: Organismal system.

SSR detection from the assembled unigenes

As SSR markers are highly polymorphic, co-dominant in their inheritance, highly specific in their amplification, and repeatable in their genotyping, they are widely used in genetic studies to analyze genetic diversity and biological evolution [26]. And transcriptome sequencing using NGS technologies has greatly accelerated the process of discovering molecular markers in non-model organisms [27].

In this study, a minimum of ten repeat numbers for mononucleotide repeats was selected, six for dinucleotide repeats, and five for tri-, tetra-, penta-, and hexa-nucleotide repeats. Thus, 19,144 SSRs were isolated from 13,933 sequences, with 3,136 sequences containing more than one SSR. Additionally, 1,522 SSRs were found in the compound form. Among the identified SSRs, mononucleotide repeats accounted for the highest percentage (13,550 SSRs, 70.77%) followed by trinucleotide repeats (3888, 20.30%), dinucleotide repeats (1575, 8.24%), and tetranucleotide repeats (110, 0.62%). A similar distribution of repeat units were also found in the nipa palm hispid beetle Octodonta nipae [28]. Frequency of SSRs based on the number of repeat units in Protaetia brevitarsis Lewis Larvae was provided in Table D in S1 File. The five most frequent motif types in the transcriptome were A/T (13,195 SSRs, 68.92%), AAT/ATT (1207, 6.30%), AGG/CCT (1001, 5.23%), AT/AT (861, 4.50%), and AAG/CTT (669, 3.49%). These SSR markers will be useful for assessing genetic diversity and population structure in P. brevitarsis and related species.

AMPs

Drug-resistant bacterial pathogens have caused significant health problem worldwide [29, 30]. AMPs have become a particularly promising class of candidates for overcoming this problem, as they have unique modes of action that differ from those of classical antibiotics and are difficult for bacteria to develop resistance to [31, 32].

Natural AMPs play an important role in immune systems, and they protect multicellular organisms by controlling and/or combating pathogens by killing microbes directly or through functions related to diverse immunomodulatory activities [33]. A greatest diversity of AMPs have been identified in different kinds of insects, including beetles [34, 35]. In this study, several kinds of natural AMPs were identified in the P. brevitarsis larval transcriptome.

Lysozymes

Lysozymes are a group of enzymes with antibacterial activity that ubiquitously exist in diverse organisms, including invertebrates and vertebrates. They can catalyze the hydrolysis of β-1,4-glycosidic bonds between N-acetylglucosamine and N-acetylmuramic acid in the peptidoglycan of bacterial cell walls, which causes bacterial cell lysis and thereby prevents bacterial infections [36]. Based on structural and functional features, lysozymes are categorized into three major groups in the animal kingdom: invertebrate-type (i-type) lysozymes, chicken-type (c-type) lysozymes, and goose-type (g-type) lysozymes [37]. The c- and g-type lysozymes are found in various vertebrate and invertebrate species [38]. And the g-type lysozymes have a transglycosylase SLT domain (Bacterial lytic transglycosylases Domain) which contains three conserved catalytic binding sites, three conserved substrate binding sites, and six conserved cysteine residues. In this study, five of these unigenes (c16827_g1, c82138_g1, c85114_g1, c89942_g2, c91216_g2) were annotated as i-type lysozymes, which is similar to the number found in the beetles Harmonia axyridis and Meligethes aeneus [39]. As shown in Fig 5, the unigene c16827_g1 shares 41.77% identity with the i-type lysozyme (XP_022920100.1) from the beetle Onthophagus Taurus (Fig 5A), c82138_g1 shares 59.87% identity with the i-type lysozyme (XP_023027517.1) from the beetle Leptinotarsa decemlineata (Fig 5B), c85114_g1 shares 50.00% identity with the i-type lysozyme (ALM25916.1) from the beetle Harmonia axyridis (Fig 5C), and c89942_g2 and c912216_g2 share 46.71% and 48.50% identity with the i-type lysozyme (ALM25918.1) from the beetle Harmonia axyridis (Fig 5D). Two of these unigenes (c87216_g2, c89288_g1) were annotated as c-type lysozymes, and they shared 51.15% and 31.49% identity with a c-type lysozyme (BBE27867.1) from Locusta migratoria (Fig 5E). One of these unigenes (c124211_g1) was annotated as a g-type lysozyme, and it shares 71.21% identity with a g-type lysozyme (ADV36303.1) from Physella acuta (Fig 5F), which has shown activity against both E. coli and MRSA [40].

Fig 5. Alignment of the amino acid sequences of putative lysozymes from Protaetia brevitarsis Lewis Larvae with known sequences.

Fig 5

XP_022920100.1: i-type lysozyme from Onthophagus Taurus. XP_023027517.1: i-type lysozyme from Leptinotarsa decemlineata. ALM25916.1 and ALM25918.1: i-type lysozyme from Harmonia axyridis. BBE27867.1: c-type lysozyme from Locusta migratoria. ADV36303.1: g-type lysozyme from Physella acuta; Catalytic residues and the substrate binding sites of g-type lysozymes are marked with black pentagon (★) and black triangle (▼), respectively; and white triangles (▽) show the six conserved cysteine residues.

Defensin

Defensins are small, cationic, and cysteine-rich peptides that include 3–4 intramolecular disulfide bonds. Hundreds of insect defensins have been identified since the first defensins were reported in the flesh fly Sarcophaga peregrina [41] and the black blowfly Phormia terraenovae [42]. They are mainly active against Gram-positive bacteria, and some are also active against Gram-negative bacteria as well as fungi [43]. In this study, one of these unigenes (c23254_g1) was annotated as a defensin, and two (c158345_g1, c79927_g1) were annotated as defensin-like proteins. As shown in Fig 6, the unigene c23254_g1 shares 89.16% identity with defensin-3 (AFD01290.1) from Pinus sylvestris (Fig 6A), c158345_g1 shares 56.52% identity with defensin-like 1 protein (AJQ21502.1) from Mytilus galloprovincialis (Fig 6B), and the c79927_g1 sequence was identical to a defensin-like protein (XP_021821153.1) from Prunus avium (Fig 6C).

Fig 6. Alignment of the amino acid sequences of putative defensin and defensin-like proteins from Protaetia brevitarsis Lewis Larvae with known sequences.

Fig 6

AFD01290.1: defensin-3 from Pinus sylvestris. AJQ21502.1: defensin-like 1 protein from Mytilus galloprovincialis; XP_021821153.1: defensin-like protein from Prunus avium.

PBSIP

The antibacterial peptide PBSIP was previously reported in Protaetia brevitarsis seulensis [44]. In this study, two unigenes (c10173_g1, c75132_g1) were annotated as matching the antibacterial peptide PBSIP, sharing 96.83% and 73.81% sequence identity with PBSIP (ABP97093.1) from Protaetia brevitarsis seulensis (Fig 7A), respectively.

Fig 7. Alignment of the amino acid sequences of putative PBSIP protein and Attacin from Protaetia brevitarsis Lewis Larvae with known sequences.

Fig 7

A: putative PBSIP protein; ABP97093.1 is the PBSIP from Protaetia brevitarsis seulensis. B: putative attacin; AHH34163.1: attacin 2 from Microdera dzhungarica.

Attacin

Attacins are glycine-rich immune proteins originally isolated from the moth Hyalophora cecropia, and they have a broad antibacterial range, particularly for Gram-negative bacteria [43]. In this study, one unigene (c85759_g1) was annotated as attacin, and it has 52.35% identity with attacin 2 (AHH34163.1) from Microdera dzhungarica (Fig 7B).

Coprisin

Coprisin is a cysteine-rich peptide isolated from the dung beetle Copris tripartitus, and it has exhibited broad-spectrum activity against both Gram-positive and Gram-negative bacteria [45]. In this study, three unigenes (c31180_g1, c541_g1, c90139_g6) were annotated as coprisin, with 50.00%, 45.00%, and 62.50% identity shared with coprisin (ABP97087.1) from Copris tripartitus (Fig 8A), respectively.

Fig 8. Alignment of the amino acid sequences of putative coprisin and coleoptericin-like protein from Protaetia brevitarsis Lewis Larvae with known sequences.

Fig 8

A: putative coprisin; ABP97087.1: coprisin from Copris tripartitus. B: putative coleoptericin-like protein; Q25054.1: Holotricin-2 from Holotrichia diomphalia.

Coleoptericin-like protein

Coleoptericin-like proteins, including coleoptericin, acaloleptin, holotricin, and rhinocerosin, are glycine- and proline-rich AMPs with bactericidal activity against Gram-positive and Gram-negative bacteria. In this study, two unigenes (c83917_g1, c83917_g2) were annotated as holotricin, and they share 93.70% and 99.21% identity with Holotricin-2 (Q25054.1) from Holotrichia diomphalia (Fig 8B), respectively. Additionally, one fragment of a coleoptericin gene (c65461_g1) was similar to coleoptericin A (BAB40436.1) from Trypoxylus dichotomus.

Thaumatin-like proteins

Thaumatin-like proteins (TLPs) are polypeptides that share sequence similarity with thaumatin, a sweet-tasting protein originally identified from the fruit of the West African rain forest shrub Thaumatococcus daniellii [46]. TLPs are widely distributed in organisms, including fungi, animals, and plants [47]. They exhibit antifungal activity via membrane permeabilization [48], β-glucan binding and degradation [49], inhibition of enzymes [50], and apoptosis [51]. In this study, three unigenes (c114799_g1, c81196_g1, c82522_g1) were annotated as thaumatin-like protein. The unigene c114799_g1 shares 56.84% identity with a thaumatin-like protein (ANC28051.1) from Polyporus umbellatus (Fig 9A), c81196_g1 shares 92.65% identity with a thaumatin-like protein (P50694.1) from Prunus avium (Fig 9B), and c82522_g1 shares 93.48% identity with a thaumatin-like protein (AJW67289.1) from Pinus massoniana (Fig 9C). Additionally, two fragments of peritrophin-like genes (c119696_g1 and c94535_g1) were also identified, with c119696_g1 and c94535_g1 being identical to a thaumatin-like protein (PQQ05260.1) from Prunus yedoensis var. nudiflora and a thaumatin-like protein (ABE01396.1) from Camellia sinensis, respectively.

Fig 9. Alignment of the amino acid sequences of putative Thaumatin-like protein from Protaetia brevitarsis Lewis Larvae with known sequences.

Fig 9

ANC28051.1: thaumatin-like protein from Polyporus umbellatus; P50694.1: thaumatin-like protein from Prunus avium; AJW67289.1: thaumatin-like protein from Pinus massoniana.

Peritrophin

Peritrophin was first isolated from insect intestines, where it was inferred to protect insects from microorganismal infection [52]. These proteins have a signal sequence and one of three kinds of peritrophin domains: peritrophin-A domains, peritrophin-B domains, or peritrophin-C domains. Peritrophin-A domains are also referred to as chitin-binding type 2 domains [53]. Today, more and more peritrophin-like proteins have also been found in many organisms [54]. In this study, the unigene c72353_g1 was annotated as a mucin-like peritrophin and shared an identity of 28.21% with a mucin-like peritrophin (ACN62986.1) from Popillia japonica (Fig 10A). The unigene c75073_g1 was annotated as peritrophin and shared an identity of 24.02% with peritrophin (ACN62985.1) from Popillia japonica (Fig 10B). The unigene c77120_g1 was annotated as peritrophin-1-like protein and shared an identity of 35.51% with the peritrophin-1 (XP_003251779.1) protein from Apis mellifera (Fig 10C).

Fig 10. Alignment of the amino acid sequences of putative peritrophin from Protaetia brevitarsis Lewis Larvae with known sequences.

Fig 10

ACN62986.1: mucin-like peritrophin from Popillia japonica; ACN62985.1: peritrophin from Popillia japonica; XP_003251779.1: peritrophin-1 protein from Apis mellifera. The chitin-binding domains were lined.

Many other proteins and peptide sequence derived from other protein or peptide have also shown antimicrobial activity [55, 56]. To identify more proteins and peptide sequences with potential antimicrobial activity, the transcripts were BLASTed against AMP databases (APD, CAMP, and LAMP) with an identity cut off 80%. Finally, we identified four unigenes that were annotated as histones and shared high identity with histones with antimicrobial activity in the database. As shown in Fig 11, the unigene c13753_g1 shares an identity of 85.40% with the H2A histone (ID in APD: AP02804) from the Pacific white shrimp Litopenaeus vannamei (Fig 11A), which has anti-Gram positive bacteria activity [57]. The unigenes c117375_g1 and c84048_g2 share identities of 99.03% and 95.15% respectively, with the H4 histone (ID in APD: AP02807) from American cupped oysters Crassostrea virginica (Fig 11B), which has anti-Gram negative bacteria activity [58]. The unigene c92891_g1 shares an identity of 81.75% with the H2B histone (ID in APD: AP02808) from cattle Bos taurus (Fig 11C), which has anti-Gram negative bacteria activity [59], while c86640_g1 and c87143_g1 share identities of 86.76% and 88.24%, respectively, with the H3 histone (ID in APD: AP02809) from cattle Bos taurus (Fig 11D), which has also shown anti-Gram negative bacteria activity [59]. Furthermore, we also identified 41 peptide sequences (Table E in S1 File) predicted to have potential antimicrobial activity on the basis of sequence similarity.

Fig 11. Alignment of the amino acid sequences of putative Histones from Protaetia brevitarsis Lewis Larvae with known sequences with antimicrobial activity.

Fig 11

AP02804: histone H2A from Pacific white shrimp Litopenaeus vannamei in the database APD; AP02807: histone H4 from American cupped oysters Crassostrea virginica in the database APD; AP02808: histone H2B from calf thymus Bos taurus in the database APD; AP02809: histone H3 from calf thymus Bos taurus in the database APD.

Conclusion

This study is the first whole transcriptome analysis of larvae from the non-model species P. brevitarsis, which accordingly will contribute to the exploration of the species through the resulting genetic resources and will thus facilitate further comprehensive studies of this species. Moreover, the natural AMPs, proteins with potential antimicrobial activity, and potential antimicrobial peptide sequences identified from transcripts in P. brevitarsis larvae provide templates for further development of new antimicrobial agents and will promote the utilization of the species for its full medical potential.

Supporting information

S1 File. Detailed description of the methods and supplementary tables.

(DOCX)

S2 File. The final assembly unigenes of Protaetia brevitarsis Lewis Larvae.

(FASTA)

Data Availability

All relevant data are within the paper and its Supporting Information files. Clean reads data that support the findings of this study are available in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) database under the accession number PRJNA516097.

Funding Statement

This work was supported by Doctor Scientific Research Fund of Henan University of Science and Technology (grant number: 4020-13480034), and The National Natural Science Foundation of China (grant number: 81703475). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29(7):644–52. 10.1038/nbt.1883 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Kang SW, Patnaik BB, Hwang HJ, Park SY, Chung JM, Song DK, et al. Transcriptome sequencing and de novo characterization of Korean endemic land snail, Koreanohadra kurodana for functional transcripts and SSR markers. Mol Genet Genomics. 2016;291(5):1999–2014. 10.1007/s00438-016-1233-9 . [DOI] [PubMed] [Google Scholar]
  • 3.Qiao L, Yang W, Fu J,Song Z. Transcriptome profile of the green odorous frog (Odorrana margaretae). PLoS One. 2013;8(9):e75211 10.1371/journal.pone.0075211 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Luna-Ramirez K, Quintero-Hernandez V, Juarez-Gonzalez VR,Possani LD. Whole Transcriptome of the Venom Gland from Urodacus yaschenkoi Scorpion. PLoS One. 2015;10(5):e0127883 10.1371/journal.pone.0127883 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Yoo WG, Lee JH, Shin Y, Shim JY, Jung M, Kang BC, et al. Antimicrobial peptides in the centipede Scolopendra subspinipes mutilans. Funct Integr Genomics. 2014;14(2):275–83. 10.1007/s10142-014-0366-3 . [DOI] [PubMed] [Google Scholar]
  • 6.Wang K, Hong W, Jiao H,Zhao H. Transcriptome sequencing and phylogenetic analysis of four species of luminescent beetles. Sci Rep. 2017;7(1):1814 10.1038/s41598-017-01835-9 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tsuzuki S, Matsumoto H, Furihata S, Ryuda M, Tanaka H, Sung EJ, et al. Switching between humoral and cellular immune responses in Drosophila is guided by the cytokine GBP. Nature Communications. 2014;5 10.1038/ncomms5628 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Imler JL,Bulet P. Antimicrobial peptides in Drosophila: structures, activities and gene regulation. Chem Immunol Allergy. 2005;86:1–21. 10.1159/000086648 . [DOI] [PubMed] [Google Scholar]
  • 9.Yi HY, Chowdhury M, Huang YD,Yu XQ. Insect antimicrobial peptides and their applications. Appl Microbiol Biotechnol. 2014;98(13):5807–22. 10.1007/s00253-014-5792-6 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wang GS, Li X,Wang Z. APD3: the antimicrobial peptide database as a tool for research and education. Nucleic Acids Research. 2016;44(D1):D1087–D1093. 10.1093/nar/gkv1278 WOS:000371261700153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Thomas S, Karnik S, Barai RS, Jayaraman VK,Idicula-Thomas S. CAMP: a useful resource for research on antimicrobial peptides. Nucleic Acids Research. 2010;38:D774–D780. 10.1093/nar/gkp1021 WOS:000276399100121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhao XW, Wu HY, Lu HR, Li GD,Huang QS. LAMP: A Database Linking Antimicrobial Peptides. Plos One. 2013;8(6). 10.1371/journal.pone.0066557 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Yang H, Su T, Yang W, Yang C, Lu L,Chen Z. The developmental transcriptome of the bamboo snout beetle Cyrtotrachelus buqueti and insights into candidate pheromone-binding proteins. PLoS One. 2017;12(6):e0179807 10.1371/journal.pone.0179807 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zou D, Coudron TA, Liu C, Zhang L, Wang M,Chen H. Nutrigenomics in Arma chinensis: transcriptome analysis of Arma chinensis fed on artificial diet and Chinese oak silk moth Antheraea pernyi pupae. PLoS One. 2013;8(4):e60881 10.1371/journal.pone.0060881 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Qi X, Zhang L, Han Y, Ren X, Huang J,Chen H. De novo transcriptome sequencing and analysis of Coccinella septempunctata L. in non-diapause, diapause and diapause-terminated states to identify diapause-associated genes. BMC Genomics. 2015;16:1086 10.1186/s12864-015-2309-3 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Fu N, Wang Q,Shen HL. De Novo Assembly, Gene Annotation and Marker Development Using Illumina Paired-End Transcriptome Sequences in Celery (Apium graveolens L.). Plos One. 2013;8(2). 10.1371/journal.pone.0057686 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhu JY, Zhao N,Yang B. Global transcriptome profiling of the pine shoot beetle, Tomicus yunnanensis (Coleoptera: Scolytinae). PLoS One. 2012;7(2):e32291 10.1371/journal.pone.0032291 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Van Belleghem SM, Roelofs D, Van Houdt J,Hendrickx F. De novo transcriptome assembly and SNP discovery in the wing polymorphic salt marsh beetle Pogonus chalceus (Coleoptera, Carabidae). PLoS One. 2012;7(8):e42605 10.1371/journal.pone.0042605 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Tang B, Chen J, Hou Y,Meng E. Transcriptome immune analysis of the invasive beetle Octodonta nipae (Maulik) (Coleoptera: Chrysomelidae) parasitized by Tetrastichus brontispae Ferriere (Hymenoptera: Eulophidae). PLoS One. 2014;9(3):e91482 10.1371/journal.pone.0091482 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Young MD, Wakefield MJ, Smyth GK,Oshlack A. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol. 2010;11(2):R14 10.1186/gb-2010-11-2-r14 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Sayadi A, Immonen E, Bayram H,Arnqvist G. The De Novo Transcriptome and Its Functional Annotation in the Seed Beetle Callosobruchus maculatus. PLoS One. 2016;11(7):e0158565 10.1371/journal.pone.0158565 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zhang W, Song W, Zhang Z, Wang H, Yang M, Guo R, et al. Transcriptome analysis of Dastarcus helophoroides (Coleoptera: Bothrideridae) using Illumina HiSeq sequencing. PLoS One. 2014;9(6):e100673 10.1371/journal.pone.0100673 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Altermann E,Klaenhammer TR. PathwayVoyager: pathway mapping using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. BMC Genomics. 2005;6:60 10.1186/1471-2164-6-60 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mao X, Cai T, Olyarchuk JG,Wei L. Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary. Bioinformatics. 2005;21(19):3787–93. 10.1093/bioinformatics/bti430 . [DOI] [PubMed] [Google Scholar]
  • 25.Yan W, Liu L, Li CX, Huang SC, Ma ZL, Qin WQ, et al. Transcriptome sequencing and analysis of the coconut leaf beetle, Brontispa longissima. Genet Mol Res. 2015;14(3):8359–65. 10.4238/2015.July.28.2 . [DOI] [PubMed] [Google Scholar]
  • 26.Chen J, Li R, Xia Y, Bai G, Guo P, Wang Z, et al. Development of EST-SSR markers in flowering Chinese cabbage (Brassica campestris L. ssp. chinensis var. utilis Tsen et Lee) based on de novo transcriptomic assemblies. PLoS One. 2017;12(9):e0184736 10.1371/journal.pone.0184736 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Miao N, Zhang L, Li M, Fan L,Mao K. Development of EST-SSR markers for Taxillus nigrans (Loranthaceae) in southwestern China using next-generation sequencing. Appl Plant Sci. 2017;5(8). 10.3732/apps.1700010 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chen Z, Chen J, Zhang X, Hou Y,Wang G. Development of Microsatellite Markers for the Nipa Palm Hispid Beetle, Octodonta nipae (Maulik). Can J Infect Dis Med Microbiol. 2018;2018:9139306 10.1155/2018/9139306 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Levy SB,Marshall B. Antibacterial resistance worldwide: causes, challenges and responses. Nat Med. 2004;10(12 Suppl):S122–9. 10.1038/nm1145 . [DOI] [PubMed] [Google Scholar]
  • 30.Alanis AJ. Resistance to antibiotics: are we in the post-antibiotic era? Arch Med Res. 2005;36(6):697–705. 10.1016/j.arcmed.2005.06.009 . [DOI] [PubMed] [Google Scholar]
  • 31.Fjell CD, Hiss JA, Hancock RE,Schneider G. Designing antimicrobial peptides: form follows function. Nat Rev Drug Discov. 2012;11(1):37–51. 10.1038/nrd3591 . [DOI] [PubMed] [Google Scholar]
  • 32.Nguyen LT, Haney EF,Vogel HJ. The expanding scope of antimicrobial peptide structures and their modes of action. Trends Biotechnol. 2011;29(9):464–72. 10.1016/j.tibtech.2011.05.001 . [DOI] [PubMed] [Google Scholar]
  • 33.Nijnik A,Hancock R. Host defence peptides: antimicrobial and immunomodulatory activity and potential applications for tackling antibiotic-resistant infections. Emerg Health Threats J. 2009;2:e1 10.3134/ehtj.09.001 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Tonk M,Vilcinskas A. The Medical Potential of Antimicrobial Peptides from Insects. Curr Top Med Chem. 2017;17(5):554–575. . [DOI] [PubMed] [Google Scholar]
  • 35.Park JW, Kim CH, Rui J, Park KH, Ryu KH, Chai JH, et al. Beetle immunity. Adv Exp Med Biol. 2010;708:163–80. . [DOI] [PubMed] [Google Scholar]
  • 36.Chipman DM,Sharon N. Mechanism of lysozyme action. Science. 1969;165(3892):454–65. . [DOI] [PubMed] [Google Scholar]
  • 37.Callewaert L,Michiels CW. Lysozymes in the animal kingdom. J Biosci. 2010;35(1):127–60. . [DOI] [PubMed] [Google Scholar]
  • 38.Fu GH, Bai ZY, Xia JH, Liu F, Liu P,Yue GH. Analysis of two lysozyme genes and antimicrobial functions of their recombinant proteins in Asian seabass. PLoS One. 2013;8(11):e79743 10.1371/journal.pone.0079743 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Vogel H, Badapanda C, Knorr E,Vilcinskas A. RNA-sequencing analysis reveals abundant developmental stage-specific and immunity-related genes in the pollen beetle Meligethes aeneus. Insect Mol Biol. 2014;23(1):98–112. 10.1111/imb.12067 . [DOI] [PubMed] [Google Scholar]
  • 40.Guo Y,He H. Identification and characterization of a goose-type lysozyme from sewage snail Physa acuta. Fish Shellfish Immunol. 2014;39(2):321–5. 10.1016/j.fsi.2014.05.029 . [DOI] [PubMed] [Google Scholar]
  • 41.Matsuyama K,Natori S. Purification of three antibacterial proteins from the culture medium of NIH-Sape-4, an embryonic cell line of Sarcophaga peregrina. J Biol Chem. 1988;263(32):17112–6. . [PubMed] [Google Scholar]
  • 42.Lambert J, Keppi E, Dimarcq JL, Wicker C, Reichhart JM, Dunbar B, et al. Insect immunity: isolation from immune blood of the dipteran Phormia terranovae of two insect antibacterial peptides with sequence homology to rabbit lung macrophage bactericidal peptides. Proc Natl Acad Sci U S A. 1989;86(1):262–6. . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Yi HY, Chowdhury M, Huang YD,Yu XQ. Insect antimicrobial peptides and their applications. Applied Microbiology and Biotechnology. 2014;98(13):5807–5822. 10.1007/s00253-014-5792-6 WOS:000337747900002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Yoon HS, Lee CS, Lee SY, Choi CS, Lee IH, Yeo SM, et al. Purification and cDNA cloning of inducible antibacterial peptides from Protaetia brevitarsis (Coleoptera). Arch Insect Biochem Physiol. 2003;52(2):92–103. 10.1002/arch.10072 . [DOI] [PubMed] [Google Scholar]
  • 45.Lee J, Han SY, Ji AR, Park JK, Hong IH, Ki MR, et al. Antimicrobial effects of coprisin on wounds infected with Staphylococcus aureus in rats. Wound Repair and Regeneration. 2013;21(6):876–882. 10.1111/wrr.12112 WOS:000326327100159. [DOI] [PubMed] [Google Scholar]
  • 46.van der Wel H,Loeve K. Isolation and characterization of thaumatin I and II, the sweet-tasting proteins from Thaumatococcus daniellii Benth. Eur J Biochem. 1972;31(2):221–5. . [DOI] [PubMed] [Google Scholar]
  • 47.Petre B, Major I, Rouhier N,Duplessis S. Genome-wide analysis of eukaryote thaumatin-like proteins (TLPs) with an emphasis on poplar. BMC Plant Biol. 2011;11:33 10.1186/1471-2229-11-33 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Vigers AJ, Roberts WK,Selitrennikoff CP. A new family of plant antifungal proteins. Mol Plant Microbe Interact. 1991;4(4):315–23. . [DOI] [PubMed] [Google Scholar]
  • 49.Sakamoto Y, Watanabe H, Nagai M, Nakade K, Takahashi M,Sato T. Lentinula edodes tlg1 encodes a thaumatin-like protein that is involved in lentinan degradation and fruiting body senescence. Plant Physiol. 2006;141(2):793–801. 10.1104/pp.106.076679 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Liu JJ, Sturrock R,Ekramoddoullah AK. The superfamily of thaumatin-like proteins: its origin, evolution, and expression towards biological function. Plant Cell Rep. 2010;29(5):419–36. 10.1007/s00299-010-0826-8 . [DOI] [PubMed] [Google Scholar]
  • 51.Narasimhan ML, Coca MA, Jin J, Yamauchi T, Ito Y, Kadowaki T, et al. Osmotin is a homolog of mammalian adiponectin and controls apoptosis in yeast through a homolog of mammalian adiponectin receptor. Mol Cell. 2005;17(2):171–80. 10.1016/j.molcel.2004.11.050 . [DOI] [PubMed] [Google Scholar]
  • 52.Tellam RL, Wijffels G,Willadsen P. Peritrophic matrix proteins. Insect Biochem Mol Biol. 1999;29(2):87–101. . [DOI] [PubMed] [Google Scholar]
  • 53.Du XJ, Wang JX, Liu N, Zhao XF, Li FH,Xiang JH. Identification and molecular characterization of a peritrophin-like protein from fleshy prawn (Fenneropenaeus chinensis). Mol Immunol. 2006;43(10):1633–44. 10.1016/j.molimm.2005.09.018 . [DOI] [PubMed] [Google Scholar]
  • 54.Huang Y, Ma F, Wang W,Ren Q. Identification and molecular characterization of a peritrophin-like gene, involved in the antibacterial response in Chinese mitten crab, Eriocheir sinensis. Dev Comp Immunol. 2015;50(2):129–38. 10.1016/j.dci.2015.01.002 . [DOI] [PubMed] [Google Scholar]
  • 55.Bjorstad A, Fu H, Karlsson A, Dahlgren C,Bylund J. Interleukin-8-derived peptide has antibacterial activity. Antimicrob Agents Chemother. 2005;49(9):3889–95. 10.1128/AAC.49.9.3889-3895.2005 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Gifford JL, Hunter HN,Vogel HJ. Lactoferricin: a lactoferrin-derived peptide with antimicrobial, antiviral, antitumor and immunological properties. Cell Mol Life Sci. 2005;62(22):2588–98. 10.1007/s00018-005-5373-z . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Patat SA, Carnegie RB, Kingsbury C, Gross PS, Chapman R,Schey KL. Antimicrobial activity of histones from hemocytes of the Pacific white shrimp. Eur J Biochem. 2004;271(23–24):4825–33. 10.1111/j.1432-1033.2004.04448.x . [DOI] [PubMed] [Google Scholar]
  • 58.Dorrington T, Villamil L,Gomez-chiarri M. Upregulation in response to infection and antibacterial activity of oyster histone H4. Fish Shellfish Immunol. 2011;30(1):94–101. 10.1016/j.fsi.2010.09.006 . [DOI] [PubMed] [Google Scholar]
  • 59.Tagai C, Morita S, Shiraishi T, Miyaji K,Iwamuro S. Antimicrobial properties of arginine- and lysine-rich histones and involvement of bacterial outer membrane protease T in their differential mode of actions. Peptides. 2011;32(10):2003–9. 10.1016/j.peptides.2011.09.005 . [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Detailed description of the methods and supplementary tables.

(DOCX)

S2 File. The final assembly unigenes of Protaetia brevitarsis Lewis Larvae.

(FASTA)

Data Availability Statement

All relevant data are within the paper and its Supporting Information files. Clean reads data that support the findings of this study are available in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) database under the accession number PRJNA516097.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES