Abstract
Silk is an important natural fiber of high economic value, and thus genetic study of the silkworm is a major area of research. Transcriptome analysis can provide guidance for genetic studies of silk yield traits. In this study, we performed a transcriptome comparison using multiple silkworms with different silk yields. A total of 22 common differentially expressed genes (DEGs) were identified in multiple strains and were mainly involved in metabolic pathways. Among these, seven significant common DEGs were verified by quantitative reverse transcription polymerase chain reaction, and the results coincided with the findings generated by RNA sequencing. Association analysis showed that BGIBMGA003330 and BGIBMGA005780 are significantly associated with cocoon shell weight and encode uridine nucleosidase and small heat shock protein, respectively. Functional annotation of these genes suggest that these play a role in silkworm silk gland development or silk protein synthesis. In addition, we performed principal component analysis (PCA) in combination with wild silkworm analysis, which indicates that modern breeding has a stronger selection effect on silk yield traits than domestication, and imply that silkworm breeding induces aggregation of genes related to silk yield.
Keywords: silk yield, RNA-seq, multiple strains, PCA
1. Introduction
Domestic silkworm (Bombyx mori) is an economically important insect that was domesticated more than 5000 years ago. In China, the total direct income of silkworm farmers was 111.54 billion yuan in the past five years, the output value of enterprises above a designated size was 641.3 billion yuan, and the export value of real silk products was 16.6 billion US dollars [1]. In addition, silk fibroin has recently been used as a biomaterial and biomedicine for the development and utilization of bone tissue engineering, silk fibroin hydrogels, implant coatings, and nanomaterials [2,3,4,5,6]. The huge industrial value and broad application prospects of silk has thus prompted researchers to develop techniques to improve its yield. Today, traditional breeding methods have been unable to further improve silk quantity. It is thus imperative to use advanced molecular biology methods to identify genes that control silk yield.
Silk traits are quantitative traits that are regulated by multiple genes, and their genetic mechanisms are complex. Currently, quantitative trait loci (QTLs) for multiple cocoon quality traits have been mapped, laying the foundation for gene detection [7,8,9,10,11,12]. Our group previously developed a suitable localization method for the special genetic characteristics of silkworm and detected five loci or chromosomes that were related to cocoon shell weight (CSW) [13]. In addition to constructing QTL maps, several genes related to cocoon quality traits have also been identified using biological methods, such as genome sequencing. Li et al. detected a gene that was related to silk yield by combining a sequencing-based methodology and association analysis [14]. Although these investigations have largely promoted the study of cocoon quantity traits, no silk yield-related genes have been identified and functionally characterized to date.
The silk gland of silkworm, an organ that produces silk, is composed of the anterior silk gland, the middle silk gland (MG), and the posterior silk gland (PG). The middle silk gland and the posterior silk gland produce sericin and silk fibroin, respectively, and silk sericin encapsulates silk fibroin to form silk fibers. The development of the silk gland is closely related to silk yield. Using the silk gland as a material, the researchers screened genes related to silk gland development through transcriptomics and proteomics analyses. Li et al. performed transcriptome analysis using two strains with significant differences in silk yield, namely, Jingsong and Lan10, and detected 1375 DEGs related to silk yield. GO and KEGG analyses indicated that most of these genes are related to protein synthesis [15]. Domestic and wild silkworms also exhibit huge differences in silk yield. Fang et al. compared the transcriptome data of domestic and wild silkworms and found that genes related to protein secretion, metabolism, and tissue development were upregulated during domestication [16]. In addition, silk yield at the protein and micro-RNA levels have been investigated [17,18]. However, due to the fact that these studies use a single pair of strains and that genetic background largely differs among strains, it is possible that some of the screened genes are related to background differences. Therefore, it is essential to use multiple strains with significant differences in cocoon quantity traits for transcriptome analysis to avoid background differences.
We performed RNA sequencing (RNA-seq) using three high-yield and three low-yield strains. Comparisons were made between high- and low-yield strains to obtain nine groups of DEGs, and the number of DEGs varied from 8 to 261. GO and KEGG analyses were performed. To identify more relevant DEGs, we further considered the differential expression of genes between two high-yield strains and two low-yield strains as common DEGs (common DEGs). In addition, we performed Principal Component Analysis (PCA) using wild silkworm data from other investigations and data from this study. The results imply that silkworm breeding induces the aggregation of genes related to silk yield.
2. Results
2.1. Phenotypic Assessment of Cocoons of Various Silkworm Strains
This study used three high-yield strains (872B, Qiufeng, and Xiafang) and three low-yield strains (19-200, 10-710, and 19-460) with significantly different cocoon quantity traits (Figure 1). The three high-yield strains adopted in this study are excellent and practical strains, which are widely used in breeding [19,20,21], and the average CSW is about three times of the selected low-yield strains (Table 1). We investigated the traits of whole cocoon weight, CSW, and pupa weight of 12 silkworm strains, including sequenced strains of males and females (Table 1). Significant differences in cocoon traits were observed among strains. CSW was about four-fold higher in the high-yield strains than in the low-yield strains. In addition, the high-yield strains had two- to three-fold higher cocoon shell rates than the low-yield strains. Silk glands showed different development rates among strains and developmental stages. The developmental rate of the silk glands of strain 19-200 peaked on the day before mounting (Figure 2). We collected samples at this time point to ensure that the majority of genes associated with silk traits were upregulated. Furthermore, different silkworm silk glands have different lengths of development. The accurate selection of the dissection time-point is thus critical to ensure that the silk glands of each strain are at the same stage of development. Therefore, to ensure accuracy of the sequencing data, the fifth instar (5.5 days) of strain 19-200 was used as the standard, and its length was then used in calculating the dissection time-point (Table 2) of each strain.
Table 1.
Strain | Whole Cocoon Weight (g) | Cocoon Shell Weight (g) | Cocoon Shell Rate | Pupa Weight (g) | ||||
---|---|---|---|---|---|---|---|---|
Males | Females | Males | Females | Males | Females | Males | Females | |
19-460 | 0.579 ± 0.016 | 0.772 ± 0.032 | 0.074 ± 0.003 | 0.082 ± 0.005 | 0.127 ± 0.004 | 0.106 ± 0.004 | 0.505 ± 0.014 | 0.691 ± 0.029 |
19-450 | 0.611 ± 0.03 | 0.75 ± 0.057 | 0.078 ± 0.005 | 0.082 ± 0.007 | 0.128 ± 0.006 | 0.109 ± 0.005 | 0.532 ± 0.027 | 0.668 ± 0.051 |
10-710 | 0.656 ± 0.029 | 0.848 ± 0.035 | 0.08 ± 0.005 | 0.084 ± 0.004 | 0.123 ± 0.004 | 0.099 ± 0.005 | 0.575 ± 0.025 | 0.763 ± 0.034 |
05-036 | 0.626 ± 0.024 | 0.799 ± 0.051 | 0.112 ± 0.006 | 0.122 ± 0.011 | 0.179 ± 0.005 | 0.152 ± 0.006 | 0.514 ± 0.018 | 0.677 ± 0.041 |
19-200 | 0.714 ± 0.038 | 0.928 ± 0.07 | 0.103 ± 0.009 | 0.118 ± 0.016 | 0.143 ± 0.008 | 0.126 ± 0.009 | 0.612 ± 0.031 | 0.81 ± 0.055 |
HB | 1.115 ± 0.038 | 1.338 ± 0.083 | 0.276 ± 0.012 | 0.276 ± 0.022 | 0.248 ± 0.006 | 0.206 ± 0.007 | 0.839 ± 0.03 | 1.062 ± 0.064 |
Qiubai | 1.296 ± 0.038 | 1.565 ± 0.05 | 0.304 ± 0.013 | 0.315 ± 0.012 | 0.235 ± 0.009 | 0.201 ± 0.006 | 0.992 ± 0.034 | 1.25 ± 0.043 |
Qiufeng | 1.233 ± 0.05 | 1.482 ± 0.065 | 0.308 ± 0.013 | 0.326 ± 0.013 | 0.25 ± 0.008 | 0.22 ± 0.009 | 0.925 ± 0.042 | 1.157 ± 0.059 |
7532 | 1.349 ± 0.111 | 1.643 ± 0.036 | 0.312 ± 0.011 | 0.329 ± 0.012 | 0.232 ± 0.015 | 0.2 ± 0.004 | 1.037 ± 0.106 | 1.315 ± 0.026 |
872B | 1.307 ± 0.034 | 1.558 ± 0.045 | 0.329 ± 0.009 | 0.331 ± 0.016 | 0.252 ± 0.009 | 0.213 ± 0.006 | 0.978 ± 0.036 | 1.227 ± 0.033 |
871B | 1.379 ± 0.038 | 1.806 ± 0.066 | 0.364 ± 0.007 | 0.428 ± 0.023 | 0.264 ± 0.006 | 0.237 ± 0.006 | 1.016 ± 0.034 | 1.377 ± 0.045 |
Xiafang | 1.488 ± 0.066 | 1.796 ± 0.056 | 0.375 ± 0.014 | 0.391 ± 0.01 | 0.252 ± 0.008 | 0.218 ± 0.004 | 1.113 ± 0.057 | 1.404 ± 0.048 |
Table 2.
Strain | Time of the Fifth Instar (days) |
The Time Point of Silk Gland Dissected (day) |
---|---|---|
10-710 | 7.5 | 6.5 |
19-200 | 6.5 | 5.5 |
19-460 | 5.5 | 4.5 |
872B | 8.5 | 7.5 |
Xiafang | 7.5 | 6.5 |
Qiufeng | 8.0 | 7.0 |
2.2. Overview of Transcriptome Sequencing Data
For each strain, we randomly selected three larvae, from which the entire silk gland was isolated and RNA was extracted. RNA sequencing was performed using the Illumina HiSeqTM 4000 system [22]. After filtering the data for adaptor sequences, unknown sequences (N), and low-quality reads, 45.9 gigabases (Gb), 46.8 Gb, and 45.1 Gb, 46.5 Gb, 44.7G, and 46.9G of clean reads were obtained for strains, 10-710, 19-200, 19-460, 872B, Xia Fang, and Qiu Bai, respectively. We compared the clean reads with the silkworm reference genome, and the percentage of total reads in the six transcriptome databases ranged from 52.1% to 72.3% (Table 3). The average number of clean reads was about 46 million. Among all strains, 29.6%–40.84% of the clean reads were aligned to about 9,000 genes, of which strain 872B had the highest number of genes, in which 9,397 reads could be aligned to the silkworm reference genome (Table S1). The number of new transcripts for each strain ranged from 842 to 981 (Table S2).
Table 3.
Strain | Number of Raw Reads | Number of Clean Reads | Number of Clean Bases | Q20 (%) | GC Content (%) |
---|---|---|---|---|---|
10-710 | 54,026,974 | 45,861,944 | 6,879,291,600 | 96.14 | 50.69 |
19-200 | 54,023,574 | 46,787,486 | 7,018,122,900 | 97.07 | 49.31 |
19-460 | 54,024,582 | 45,123,438 | 6,768,515,700 | 96.51 | 51.19 |
872B | 55,663,714 | 46,508,788 | 6,976,318,200 | 96.19 | 51.96 |
Xiafang | 56,894,572 | 44,654,180 | 6,698,127,000 | 95.78 | 52.37 |
Qiufeng | 57,300,696 | 46,940,002 | 7,041,000,300 | 95.92 | 53.00 |
2.3. Differential Expressed Gene Function Annotation and Functional Analysis
The expression of all genes was estimated using the RPKM (reads per kb per million reads) method [23]. The following comparison strategy for screening DEGs was adopted in this study (Figure 3). By comparing the three high-yield strains with the three low-yield strains, nine groups of DEGs were obtained. The edgeR function in Bioconductor was employed for differential expression analysis [24]. All the DEGs are presented in Table 4. The number of DEGs in each group varied from 8 to 261 (Table S3). The gene sequences were aligned to the NR (Non-Redundant Protein Sequence Database) library using BLAST to extract DEG annotation information. The topGO function in the software, Bioconductor, was employed for DEG enrichment analysis [25].
Table 4.
Gene ID | Chromosome | Regulation (High-/Low-Yield Strains) |
Gene Annotation |
---|---|---|---|
BGIBMGA001700 | 11 | Down | Ubiquitin-related modifier 1 homolog [Bombyx mori] |
BGIBMGA001816 | 11 | Up | Uncharacterized protein LOC101745939 [Bombyx mori] |
BGIBMGA002441 | 9 | Up | gi|512926587|ref|XP_004931123.1|/2.3936e-58/PREDICTED: uncharacterized protein LOC101741881 [Bombyx mori] |
BGIBMGA002629 | 5 | Down | gi|237648976|ref|NP_001153665.1|/4.06608e-85/odorant binding protein LOC100301497 precursor [Bombyx mori] |
BGIBMGA005780 | 5 | Up | gi|512896232|ref|XP_004923862.1|/2.0943e-140/PREDICTED: protein lethal (2) essential for life-like [Bombyx mori] |
BGIBMGA014291 | 5 | Down | gi|827563795|ref|XP_004933964.2|/6.52866e-100/PREDICTED: alcohol dehydrogenase-related 31 kDa protein-like, partial [Bombyx mori] |
BGIBMGA003230 | 2 | Up | gi|930671460|gb|KPJ12187.1|/0/RNA-directed RNA polymerase L [Papilio machaon] |
BGIBMGA003330 | 15 | Up | gi|512892835|ref|XP_004923030.1|/0/PREDICTED: probable uridine nucleosidase 2 [Bombyx mori] |
BGIBMGA004397 | 20 | Down | gi|525342977|ref|NP_001266309.1|/0/low molecular mass 30 kDa lipoprotein 19G1-like precursor [Bombyx mori] |
BGIBMGA004399 | 20 | Down | gi|379046488|gb|AFC87805.1|/0/30K protein 7 [Bombyx mori] |
BGIBMGA004614 | 27 | Up | gi|827548595|ref|XP_012546487.1|/0/PREDICTED: heat shock protein 70 A2-like [Bombyx mori] |
BGIBMGA007353 | 3 | Up | gi|827541356|ref|XP_012543946.1|/0/PREDICTED: uncharacterized protein LOC105841300 [Bombyx mori] |
BGIBMGA008677 | 7 | Up | gi|913296126|ref|XP_013183364.1|/9.54277e-117/PREDICTED: uncharacterized protein LOC106129371 [Amyelois transitella] |
BGIBMGA008712 | 7 | Up | gi|827546128|ref|XP_012545526.1|/0/PREDICTED: Bardet-Biedl syndrome 5 protein homolog [Bombyx mori] |
BGIBMGA010133 | 7 | Down | gi|512927483|ref|XP_004931346.1|/0/PREDICTED: regucalcin-like [Bombyx mori] |
BGIBMGA009092 | 3 | Down | gi|827031951|gb|AKJ54535.1|/4.53413e-66/fungal protease inhibitor [Bombyx mori] |
BGIBMGA009093 | 3 | Down | gi|512898429|ref|XP_004924398.1|/1.63675e-30/PREDICTED: zonadhesin-like isoform X1 [Bombyx mori] |
BGIBMGA010892 | 22 | Down | gi|827542143|ref|XP_012544229.1|/4.51314e-92/PREDICTED: zonadhesin-like isoform X1 [Bombyx mori] |
BGIBMGA013130 | 16 | Up | gi|512909270|ref|XP_004926880.1|/0/PREDICTED: glucosylceramidase-like [Bombyx mori] |
BGIBMGA013477 | 6 | Up | gi|512908146|ref|XP_004926609.1|/0/PREDICTED: synaptic vesicle glycoprotein 2C-like [Bombyx mori] |
For functional annotation of the DEGs, GO enrichment analysis was performed. GO terms with a corrected p < 0.05 were considered significantly enriched with DEGs. GO functional enrichment analysis of DEGs of the six strains revealed significant enrichment in the categories of biological process and molecular function. Further assessment revealed that the annotation information of nine DEGs were similar. In the functional category of biological process, significant DEG enrichment between the high- and low-yield strains was observed for the subcategories of cellular process, metabolic process, and single-organism process. For the functional category of cellular components, DEG enrichment was also detected in the subcategories of extracellular and extracellular regions. In terms of molecular function, DEG enrichment was observed in the subcategories of binding and catalytic activity (Figure S1). BLAST analysis of the DEGs to the KEGG database was performed to obtain annotation information relating to pathways [26]. Based on the resulting list of DEGs, a hypergeometric test was used to perform pathway enrichment analysis of DEGs, which showed that the endoplasmic reticulum pathway in protein processing of metabolic pathways was enriched with DEGs between the high-and low-yield silkworms (Figure S2).
2.4. Quantitative Verification of DEGs and Multistrain Association Analysis
Figure 4 shows the DEGs identified from the comparison of multiple groups. Twenty genes that were differentially expressed in more than four of the nine groups of DEGs were defined as common DEGs (Table 4), which included nine genes that were upregulated in low-yield strains, and 11 genes that were upregulated in high-yield strains. Five of the 20 common DEGs were detected in at least five groups of BLAST data.
We validated five significant common DEGs in the MGs and PGs of the strains that were used for sequencing by qPCR (Figure 5), and the results coincided with the RNA-seq data. BGIBMGA004399, BGIBMGA009092, and BGIBMGA009093 genes in the MGs, and BGIBMGA005780 and BGIBMGA003330 genes in the PGs were differentially expressed between the high- and low-yield strains. We further analyzed the association between the five significant common DEGs and silk yield traits in the 12 strains with different silk yields by qPCR (Figure S3 and Figure 6), which indicated that the expression of BGIBMGA003330 and BGIBMGA005780 in the silk gland is closely related to the silk fibroin generated (Figure 6).
2.5. PCA of Wild Silkworm, Local Strains, and Practical Strains
Cocoon quality traits undergo two stages of selection, namely, the domestication process from wild silkworm strains to local strains, and the breeding process from local species to practical species. PCA was employed to detect differences in gene expression patterns among various strains by analyzing similarities in major strain components. Thus, differences in selection for silkworm cocoon quality traits in these two processes can be inferred. In this study, the data of three local strains and three practical strains were combined with two wild silkworm data from other studies [16] to conduct PCA. The results showed that the main components of the two wild silkworm strains and the three local strains were distributed, whereas those of the practical strains were clustered together (Figure 7). Compared to the domestication process, modern breeding imparts stronger selection pressure on genes that are related to silk yield, so that silkworms of different high-yield strains exhibit similar expression patterns. The common genes that are highly expressed in the three high-yield silkworm strains are likely to be important genes that control silk yield. Based on these results, we identified genes that are highly expressed in all three high-yield strains, followed by functional annotation (Table 4). Most of these genes are ribosomal proteins and structural proteins, which presumably play an important role in the biosynthesis of silk protein.
3. Discussion
In this study, multiple silkworm strains with different cocoon colors were used as research materials. By screening the common DEGs, the background interference caused by the particularity of a single strain, such as the genes controlling cocoon color, could be better avoided, making the results more accurate and reliable. RNA-seq analysis of various silkworm strains identified up to 261 DEGs in each pair of samples, and 22 common DEGs were screened. These common DEGs mostly function in energy metabolism and protein synthesis. The results suggest that the energy metabolic rate is related to silk gland development. In addition, protein synthesis affects silk protein synthesis efficiency, thus affecting silkworms’ silk production.
Common DEGs were identified by association analysis, and gene BGIBMGA003330 and BGIBMGA005780 were significantly associated with CSW. From the results of quantitative verification, we can see that the expression levels of genes BGIBMGA003330 and BGIBMGA005780 presented significant differences between the high-yield and low-yield strains in the PGs. The PGs are used to produce silk fibroin. Silk sericin, wrapped in the outer layer of silk fibroin to protect and bind it, is water-soluble and soluble in water, acid, and alkali solutions. However, silk fibroin only swells in water, and is insoluble in water. In the process of production and utilization, silk fibroin is separated by sericin melting during the process of filature, which is the main material for silk products. It can be seen that the increase of silk fibroin production is the core of the increase of silk yield. We speculate that BGIBMGA003330 and BGIBMGA005780 may be related to silk fibroin synthesis. Genes BGIBMGA003330 and BGIBMGA005780 encode uridine nucleosidase and small heat shock protein (sHSP), respectively. Uridine nucleosidase plays a fundamental role in the interconversion of pyrimidine nucleotides and in the reutilization of pyrimidine nucleosides. Uridine nucleosidase is used to decompose uridine into uracil and ribose [27], then uracil is incorporated into the mononucleotide, UMP (Uridine monophosphate) [28]. UMP is the composition of nucleic acid involved in the basic life activities of organisms, such as heredity, development, and growth [29]. Ribose, of fundamental importance in the pyrimidine nucleotides’ biosynthesis from exogenously supplied or endogenously formed bases and nucleosides, is involved in the biosynthesis of storage compounds and the synthesis of structural building compounds [30]. The role of this gene in development and protein synthesis may be related to the development of silk glands or silk protein synthesis, thereby affecting the silk yield.
Small heat shock protein is a molecular chaperone with strong anti-aggregation properties. Previous studies have shown that sHSP enhances heat tolerance [31], protects cells from stress [32], prevents protein misfolding [33], restores the natural concept of unfolded or partially folded peptides [34], alters mitochondrial metabolism [35], controls mitochondria protein quality to extend the lifespan [36], influences the rate of embryonic development, and prevents spontaneous diapause [37]. sHSP are not only expressed independently in the heat shock response, but also play an important role in some animal development processes, which may be related to silk gland development, such as regulating cell movement [38]; regulating intermediate filament assembly [39]; inhibiting apoptotic signaling [40]; binding to certain kinases to activate and protect them from heat inactivation [41]; and allowing cancerous cells to escape the immunosurveillance mediated by death ligands [42]. There is no evidence related to silk gland development in the current report, but in our study, it was found that this gene is significantly associated with the silkworm CSW. Moreover, there is also evidence that sHSP maintains protein structure and enhance the growth of tumors in vivo [42]. So, it is also possible that these may also have other functions that are related silk gland development that have yet to be discovered.
Besides, the common DEGs, BGIBMGA004399 and BGIBMGA008165, encode a 30K protein that is involved in embryonic development [43], energy storage [44], and the immune response [45] in the silkworm, plays an important role in energy metabolism, and contributes to inhibiting apoptosis [46]. We postulate that these genes affect the rate of silk protein synthesis by regulating energy metabolism, thereby affecting silk yield traits.
In this study, PCA results attracted our attention. PCA indicated that low-yield strains significantly vary in terms of gene expression patterns, whereas high-yield strains have highly similar expression patterns (Figure 6). Previous studies have shown that the linkage groups that control the QTLs for different parents overlap [47,48,49,50], suggesting that unlike the long domestication process, modern genetic breeding has a strong selective effect on silk yield traits. The rapid accumulation of superior genes for cocoon traits or the generation of new genes that increase yield can lead to higher breeding values in the short term. Genes that are highly expressed in the three high-yield strains are likely to be important genes that control silk yield. We have identified that genes that are highly expressed in the three high-yield strains are relatively expressed at low levels in the three low-yield strains, and were functionally annotated. Most of these genes are ribosomal proteins and structural proteins, which may play an important role in the biosynthesis of silk protein.
In summary, we have described an approach to circumvent the effect of background differences using multi-strand RNA-seq alignments to more accurately screen for genes related to silk yield. The differentially expressed genes identified in this study may serve as the object of the functional study, and it is necessary to deeply study its function and the mechanism of affecting silk yield.
4. Materials and Methods
4.1. Silkworm Breeding and Sample Preparation
The selected materials were six silkworm strains for RNA-seq, including three high-yield strains (Qiufeng, Xiafang, and 872B), and three low-yield strains (10-710, 19-200, and 19-460), and 12 strains (Table 1) with different cocoon traits and silk yields for correction analysis. The silkworms were obtained from the silkworm resource bank of Southwest University, Chongqing, China. The larvae were fed fresh mulberry leaves and maintained at conditions of sTable 14 h light and 10 h dark photoperiod at 25 ± 1 °C and 75% ± 3% RH. Intact silk glands were dissected and frozen immediately in liquid nitrogen and then stored at −80 °C for sequencing. In this study, three samples were prepared as biological replicates.
The silkworm cocoons of different strains were collected for phenotypic assessment, and these silkworms were reared at the same time and in the same conditions. The cocoons of the low-yield silkworms (19-460, 10-710, and 19-200) were much smaller than the high-yield strains (872B, Qiufeng, and Xiafang). The statistical results of CSW show significant differences between the low- and high-yield strains.
4.2. RNA Library Construction and Sequencing
Total RNA was extracted using TRIzol reagent according to the manufacturer’s recommendations (Invitrogen, Burlington, ON, Canada), and DNA was digested with DNase I. Eukaryotic mRNAs are enriched with oligo(dT) magnetic beads. Using a thermomixer, the mRNAs were sheared into short fragments under the appropriate temperature, and a strand of cDNAs were synthesized using the interrupted mRNAs as template. Then, a two-stranded synthesis reaction system was used to synthesize the two-stranded cDNA, and kits were used for purification and recovery, cohesive end-repair, adding the base “A” to the 3′ end of the cDNAs, and attaching the adapter, followed by fragment size selection and PCR amplification. The constructed library was qualified with the Agilent 2100 Bioanalyzer and the ABI StepOne Plus Real-Time PCR system and then sequenced using the Illumina HiSeqTM 4000. The data from the Illumina HiSeqTM 4000 sequence were designated as raw reads or raw data [22], which were then subjected to quality control (QC) to determine that the sequencing data is suitable for subsequent analysis. After QC, the raw reads were filtered to obtain clean reads, and a SOAP aligner/SOAP2 [51] was used to compare the clean reads to the reference sequence. A BLAST search was then performed to determine the distribution and coverage of reads on the reference sequence to assess whether the comparison results passed the second QC of alignment, thereby ensuring that subsequent analysis is based on high-quality clean reads.
Reference sequence and gene model annotations were downloaded from the Silkworm Genome Database (SilkDB; http://www.silkdb.org/silkdb/) [52]. We compared the clean data of each sample with the reference gene set of the silkworm using the short reads software, SOAPaligner/SOAP2 [51], and compared the clean data of each sample with the reference genome using TopHat (http://ccb.jhu.edu/software/tophat/index.shtml) [53]. The results were then used in the analyses for alternative splicing and prediction of new transcripts.
4.3. Quantitative and Differential Analysis of Transcripts
We used the reads per kilobase per million reads (RPKM) [23] method to estimate gene expression levels. The differential expression of two samples was screened using the edgeR [24] function in Bioconductor. The edgeR function assumes that the count of sequencing reads is a negative binomial distribution for each gene. Hypothesis testing was performed based on this theoretical distribution. The differentially expressed gene screening conditions were set to FDR ≤ 0.05 |log2ratio| ≥ 1 [54]. Genes with similar expression patterns usually have similar functions. To analyze gene expression patterns, we used the R software and used the Euclidean distance as the distance calculation formula to perform hierarchical clustering of DEGs and experimental conditions.
4.4. Differential Gene Function Annotation and Functional Analysis
We obtained the reference sequence for all silkworm genes from the NCBI database and used BLASTX for protein annotation. Then, we employed the BLAST software to align the gene sequences to the NR library and extracted the GO annotation information for all genes from the Gene Ontology database (http:/www.geneontology.org/). Based on the list of DEGs, we performed GO enrichment analysis of DEGs using the topGO function in the software, Bioconductor [25]. The KEGG analysis [26] involves a BLAST search of genes in the KEGG database to annotate pathway information on the DEGs. Then, based on the list of DEGs obtained, the pathway enrichment analysis of DEGs was performed using hypergeometric testing to find the pathway that was significantly enriched with DEGs compared to the entire genomic background.
4.5. Expression Quantity Identification of DEGs
To verify RNA-seq data, we used real-time quantitative PCR (qPCR) to identify DEGs for expression quantity identification. Intact silk glands were dissected according to the timetable for sample preparation. In addition, other tissues of the fifth instar 3-day larvae of 19-200 and 872B, including the head, epidermis, midgut, Malpighian tubule, fat body, hemolymph, middle silk gland, posterior silk gland, trachea plexus, testis, nerves, and ovaries, were dissected for tissue expression identification of DEGs. Total RNA was extracted using the RNApure ultrapure total RNA Rapid Extraction Kit (BioTek) strictly following the manufacturer’s instructions. Ultraviolet spectrophotometry was used to determine RNA integrity and purity. The cDNA was extracted using the PrimeScript RT reagent kit with a gDNA Eraser kit, and cDNA purity was assessed by PCR using primers specific to Actin3 (A3), a silkworm housekeeping gene. The CDS sequences of the confirmed genes were extracted from the Silkworm Genome Database (http://www.silkdb.org/silkdb/) [42] and used in designing primers to span at least one intron. qPCR was performed using the CFX96TM Real-Time PCR Detection System (Bio-Rad, Hercules, CA, USA) and SYBR Green qPCR Mix (Bio-Rad) reagents. qPCR was conducted in a reaction volume of 10 μL containing 1 μL of the template, 5 μL of 2 × SYBR Green II, 5 μL of ddH2O, and 0.3 μL of the specific primers. The PCR conditions were as follows: Pre-denaturation at 95 °C for 30 s; followed by 40 cycles of denaturation at 95 °C for 5 s and annealing at 60 °C for 30 s; and a melting curve at 65 °C for 5 s, with a stepwise temperature increase of 0.5 °C until 95 °C. The volume of the reaction system was 10 μL, and three technical replicates were used per sample. The expression levels of each gene of the two strains were compared based on a t-test. Differences in gene expression between high- and low-yield strains were considered significantly different at p < 0.05.
Abbreviations
DEGs | Differentially expressed genes |
PCA | Principal component analysis |
QTL | Quantitative trait loci |
CSW | Cocoon shell weight |
Supplementary Materials
Supplementary materials can be found at http://www.mdpi.com/1422-0067/19/12/3718/s1.
Author Contributions
Methodology, Y.L. and F.D.; software, Y.L. and R.G.; validation, Y.L., C.L. and F.D.; formal analysis, Y.L. and W.Z.; investigation, Y.L. and H.Z.; resources, Y.L., H.H. and F.D.; data curation, Y.L. and W.Z.; writing—original draft preparation, Y.L.; writing—review and editing, F.D., C.L. and X.T.; visualization, Y.L. and M.H.; supervision, F.D. and C.L.; project administration, F.D., please turn to the CRediT taxonomy for the term explanation. Authorship must be limited to those who have contributed substantially to the work reported.
Funding
This research was funded by the National Natural Science Foundation of China, grant number No. 31830094, No. 31472153; the Hi-Tech Research and Development 863 Program of China Grant, grant number No. 2013AA102507; Funds of China Agriculture Research System, grant number No. CARS-18-ZJ0102.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Commerce Department The development program of cocoon silk industry for the 13th five-year plan. Guangxi Seric. 2017;54:1–4. doi: 10.19553/j.cnki.1006-1657.2017.02.001. [DOI] [Google Scholar]
- 2.Cheung H.Y., Lau K.T., Tao X.M., Hui D. A potential material for tissue engineering: Silkworm silk/PLA biocomposite. Compos. Part B. 2008;39:1026–1033. doi: 10.1016/j.compositesb.2007.11.009. [DOI] [Google Scholar]
- 3.Mottaghitalab F., Hosseinkhani H., Shokrgozar M.A., Mao C., Yang M., Farokhi M. Silk as a potential candidate for bone tissue engineering. J. Control. Release. 2015;215:112–128. doi: 10.1016/j.jconrel.2015.07.031. [DOI] [PubMed] [Google Scholar]
- 4.Guziewicz N., Best A., Perez-Ramirez B., Kaplan D.L. Lyophilized Silk Fibroin Hydrogels for the Sustained Local Delivery of Therapeutic Monoclonal Antibodies. Biomaterials. 2011;32:2642–2650. doi: 10.1016/j.biomaterials.2010.12.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wang C., Wang S., Yang Y., Jiang Z., Deng Y., Song S., Yang W., Chen Z.G. Bioinspired, biocompatible and peptide-decorated silk fibroin coatings for enhanced osteogenesis of bioinert implant. J. Biomater. Sci. Polym. Ed. 2018;29:1595–1611. doi: 10.1080/09205063.2018.1477316. [DOI] [PubMed] [Google Scholar]
- 6.Kharlampieva E., Kozlovskaya V., Gunawidjaja R., Shevchenko V.V., Vaia R., Naik R.R., Kaplan D.L., Tsukruk V.V. Flexible Silk—Inorganic Nanocomposites: From Transparent to Highly Reflective. Adv. Funct. Mater. 2010;20:840–846. doi: 10.1002/adfm.200901774. [DOI] [Google Scholar]
- 7.Shi J., Heckel D.G., Goldsmith M.R. A genetic linkage map for the domesticated silkworm, Bombyx mori, based on restriction fragment length polymorphism. Genet. Res. 1995;66:109–126. doi: 10.1017/S0016672300034467. [DOI] [Google Scholar]
- 8.Yasukochi Y. A dense genetic map of the silkworm, Bombyx mori, covering all chromosomes based on 1018 molecular markers. Genetics. 1998;150:1513–1525. doi: 10.1093/genetics/150.4.1513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mach V., Takiya S., Ohno K., Handa H., Imai T., Suzuki Y. Silk gland factor-1 involved in the regulation of Bombyx sericin-1 gene contains fork head motif. J. Biol. Chem. 1995;270:9340–9346. doi: 10.1074/jbc.270.16.9340. [DOI] [PubMed] [Google Scholar]
- 10.Miao X.-X., Xu S.-J., Li M.-H., Li M.-W., Huang J.-H., Dai F.-Y., Marino S.W., Mills D.R., Zeng P., Mita K., et al. Simple sequence repeat-based consensus linkage map of Bombyx mori. Proc. Natl. Acad. Sci. USA. 2005;102:16303–16308. doi: 10.1073/pnas.0507794102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lie Z., Min Q., Fang-Yin D., Ai-Chun Z., Cheng L. A dense linkage map of the silkworm (Bombyx mori) based on AFLP markers. Acta Entomol. Sin. 2008;51:246–257. [Google Scholar]
- 12.Zhan S., Huang J., Guo Q., Zhao Y., Li W., Miao X., Goldsmith M.R., Li M., Huang Y. An integrated genetic linkage map for silkworms with three parental combinations and its application to the mapping of single genes and QTL. BMC Genom. 2009;10:389. doi: 10.1186/1471-2164-10-389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Li C., Zuo W., Tong X., Hu H., Qiao L., Song J., Xiong G., Gao R., Dai F., Lu C. A composite method for mapping quantitative trait loci without interference of female achiasmatic and gender effects in silkworm, Bombyx mori. Anim. Genet. 2015;46:426–432. doi: 10.1111/age.12311. [DOI] [PubMed] [Google Scholar]
- 14.Li C., Tong X., Zuo W., Luan Y., Gao R., Han M., Xiong G., Gai T., Hu H., Dai F., et al. QTL analysis of cocoon shell weight identifes BmRPL18 associated with silk protein synthesis in silkworm by pooling sequencing. Sci. Rep. 2017;7:17985. doi: 10.1038/s41598-017-18277-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Li J., Qin S., Yu H., Zhang J., Liu N., Yu Y., Hou C., Li M. Comparative Transcriptome Analysis Reveals Different Silk Yields of Two Silkworm Strains. PLoS ONE. 2016;11:e0155329. doi: 10.1371/journal.pone.0155329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fang S.-M., Hu B.-L., Zhou Q.-Z., Yu Q.-Y., Zhang Z. Comparative analysis of the silk gland transcriptomes between the domestic and wild silkworms. BMC Genom. 2015;6:60. doi: 10.1186/s12864-015-1287-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wang S.-H., You Z.-Y., Ye L.-P., Che J., Qian Q., Nanjo Y., Komatsu S., Zhong B.-X. Quantitative Proteomic and Transcriptomic Analyses of Molecular Mechanisms Associated with Low Silk Production in Silkworm Bombyx mori. J. Proteome Res. 2014;13:735–751. doi: 10.1021/pr4008333. [DOI] [PubMed] [Google Scholar]
- 18.Qin S., Danso B., Zhang J., Li J., Liu N., Sun X., Hou C., Luo H., Chen K., Zhang G., et al. MicroRNA profile of silk gland reveals different silk yields of three silkworm strains. Gene. 2018;653:1–9. doi: 10.1016/j.gene.2018.02.019. [DOI] [PubMed] [Google Scholar]
- 19.Xiang Z., Zhu Y., Lin Y., Chen P., Zhang M., Yin Z., Chen Z. The breeding and promotion of summer-autumn silkworm strains Xiafang × Qiubai. Newsl. Seric. Sci. 1995;2:2–6. [Google Scholar]
- 20.He X., Zhou J., Huang Y., Wang Y. Analysis on the Results of Laboratory Identification of “Qiufeng × Baiyu” over the Years. Bull. Seric. 2010;41:17–22. [Google Scholar]
- 21.Yang B. The breeding of 871 x 872 silkworm in autumn. Newsl. Seric. 2006;26:29–31, 45. [Google Scholar]
- 22.Cock P.J., Fields C.J., Goto N., Heuer M.L., Rice P.M. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2010;38:1767–1771. doi: 10.1093/nar/gkp1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mortazavi A., Williams B.A., McCue K., Schaeffer L., Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods. 2008;5:621–628. doi: 10.1038/nmeth.1226. [DOI] [PubMed] [Google Scholar]
- 24.Robinson M.D., McCarthy D.J., Smyth G.K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Abdi H. The Bonferonni and Šidák Corrections for Multiple Comparisons. Encycl. Meas. Stat. 2007;3:103–107. [Google Scholar]
- 26.Kanehisa M., Araki M., Goto S., Hattori M., Hirakawa M., Itoh M., Katayama T., Kawashima S., Okuda S., Tokimatsu T., et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008;36:D480–D484. doi: 10.1093/nar/gkm882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Morrow G., Kim H.J., Pellerito O., Bourrelle-Langlois M., Le Pécheur M., Groebe K., Tanguay R.M. Changes in Drosophila mitochondrial proteins following chaperone-mediated lifespan extension confirm a role of Hsp22 in mitochondrial UPR and reveal a mitochondrial localization for cathepsin D. Mech. Ageing Dev. 2016;155:36–47. doi: 10.1016/j.mad.2016.02.011. [DOI] [PubMed] [Google Scholar]
- 28.Magni G., Fioretti E., Ipata P.L., Natalini P. Bakers’ yeast uridine nucleosidase. Purification, composition, and physical and enzymatic properties. J. Biol. Chem. 1975;250:9–13. [PubMed] [Google Scholar]
- 29.Magni G., Pallotta G., Natalini P., Ruggieri S., Santarelli I., Vita A. Inactivation of uridine nucleosidase in yeast. Purification and properties of an inactivating protein. J. Biol. Chem. 1978;253:2501–2503. [PubMed] [Google Scholar]
- 30.Quan G., Duan J., Fick W., Kyei-Poku G., Candau J.N. Expression profiles of 14 small heat shock protein (sHSP) transcripts during larval diapause and under thermal stress in the spruce budworm, Choristoneura fumiferana (L.) Cell Stress Chaperones. 2018 doi: 10.1007/s12192-018-0931-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Raggi-Ranieri M., Ipata P.L. Partial purification and properties of uridine nucleosidase from Baker’s yeast. Ital. J. Biochem. 1971;20:27–44. [PubMed] [Google Scholar]
- 32.Gong C.S., Claypool T.A., McCracken L.D., Maun C.M., Ueng P.P., Tsao G.T. Conversion of pentoses by yeasts. Biotechnol. Bioeng. 1983;25:85–102. doi: 10.1002/bit.260250108. [DOI] [PubMed] [Google Scholar]
- 33.He L.H., Chen J.Y., Kuang J.F., Lu W.J. Expression of three sHSP genes involved in heat pretreatment-induced chilling tolerance in banana fruit. J. Sci. Food Agric. 2012;92:1924–1930. doi: 10.1002/jsfa.5562. [DOI] [PubMed] [Google Scholar]
- 34.Ehrnsperger M., Gräber S., Gaestel M., Buchner J. Binding of non-native protein to Hsp25 during heat shock creates a reservoir of folding intermediates for reactivation. EMBO J. 1997;16:221–229. doi: 10.1093/emboj/16.2.221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Carra S., Rusmini P., Crippa V., Giorgetti E., Boncoraglio A., Cristofani R., Naujock M., Meister M., Minoia M., Kampinga H.H., et al. Different anti-aggregation and pro-degradative functions of the members of the mammalian sHSP family in neurological disorders. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2013;368 doi: 10.1098/rstb.2011.0409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Van Aken O., De Clercq I., Ivanova A., Law S.R., Van Breusegem F., Millar A.H., Whelan J. Mitochondrial and Chloroplast Stress Responses Are Modulated in Distinct Touch and Chemical Inhibition Phases. Plant Physiol. 2016;171:2150–2165. doi: 10.1104/pp.16.00273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Jofré A., Molinas M., Pla M. A 10-kDa class-CI sHsp protects E. coli from oxidative and high-temperature stress. Planta. 2003;217:813–819. doi: 10.1007/s00425-003-1048-x. [DOI] [PubMed] [Google Scholar]
- 38.Vígh L., Török Z., Balogh G., Glatz A., Piotto S., Horváth I. Membrane-regulated stress response: A theoretical and practical approach. Adv. Exp. Med. Biol. 2007;594:114–131. doi: 10.1007/978-0-387-39975-1_11. [DOI] [PubMed] [Google Scholar]
- 39.Perng M.D., Huang Y.S., Quinlan R.A. Purification of Protein Chaperones and Their Functional Assays with Intermediate Filaments. Methods Enzymol. 2016;569:155–175. doi: 10.1016/bs.mie.2015.07.025. [DOI] [PubMed] [Google Scholar]
- 40.Arrigo A.P., Firdaus W.J., Mellier G., Moulin M., Paul C., Diaz-latoud C., Kretz-remy C. Cytotoxic effects induced by oxidative stress in cultured mammalian cells and protection provided by Hsp27 expression. Methods. 2005;35:126–138. doi: 10.1016/j.ymeth.2004.08.003. [DOI] [PubMed] [Google Scholar]
- 41.van de Klundert F.A., de Jong W.W. The small heat shock proteins Hsp20 and alphaB-crystallin in cultured cardiac myocytes: Differences in cellular localization and solubilization after heat stress. Eur. J. Cell Biol. 1999;78:567–572. doi: 10.1016/S0171-9335(99)80022-3. [DOI] [PubMed] [Google Scholar]
- 42.Arrigo A.P. sHsp as novel regulators of programmed cell death and tumorigenicity. Pathol. Biol. (Paris) 2000;48:280–288. [PubMed] [Google Scholar]
- 43.Zhong B.-X., Li J.-K., Lin J.-R., Liang J.-S., Su S.-K., Xu H.-S., Yan H.-Y., Zhang P.-B., Fujii H. Possible Effect of 30K Proteins in Embryonic Development of Silkworm Bombyx mori. Planta. 2003;217:813–819. doi: 10.1111/j.1745-7270.2005.00044.x. [DOI] [PubMed] [Google Scholar]
- 44.Shi X.F., Li Y.N., Yi Y.Z., Xiao X.G., Zhang Z.F. Identification and Characterization of 30 K Protein Genes Found in Bombyx mori (Lepidoptera: Bombycidae) Transcriptome. J. Insect Sci. 2015;15 doi: 10.1093/jisesa/iev057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yang J.P., Ma X.X., He Y.X., Li W.F., Kang Y., Bao R., Chen Y., Zhou C.Z. Crystal structure of the 30 K protein from the silkworm Bombyx mori reveals a new member of the β-trefoil superfamily. J. Struct Biol. 2011;175:97–103. doi: 10.1016/j.jsb.2011.04.003. [DOI] [PubMed] [Google Scholar]
- 46.Park T.H. Anti-Apoptosis Engineering for Mammalian Cell Culture using 30K Protein and its Gene Originating from Bombyx mori. Asian Pac. Confed. Chem. Eng. Congr. Progr. 2005;2004:348. doi: 10.11491/apcche.2004.0.348.0. [DOI] [Google Scholar]
- 47.Sima Y.H., Li B., Xu H.M., Chen D.X., Sun D.B., Zhao A.C., Lu C., Xiang Z.H. Study on location of QTLs controlling cocoon traits in silkworm. Yi Chuan Xue Bao. 2005;32:625–632. [PubMed] [Google Scholar]
- 48.Lie Z., Cheng L., Tang Y., Sima Y. Mapping of major quantitative trait loci for economic traits of silkworm cocoon. Genet. Mol. Res. 2010;9:78–88. doi: 10.4238/vol9-1gmr676. [DOI] [PubMed] [Google Scholar]
- 49.Bin L.I., Cheng L.U., Zhao A.C., Xiang Z.H. Multiple Interval Mapping for Whole Cocoon Weight and Related Economically Important Traits QTL in Silkworm (Bombyx mori) Agric. Sci. China. 2006;5:798–804. [Google Scholar]
- 50.Cheng L.U., Bin L.I., Aichun Z., Xiang Z. QTL mapping of economically important traits in Silkworm (Bombyx mori) Sci. China Ser. C Life Sci. 2004;47:477. doi: 10.1360/03yc0260. [DOI] [PubMed] [Google Scholar]
- 51.Li R., Yu C., Li Y., Lam T.W., Yiu S.M., Kristiansen K., Wang J. SOAP2: An improved ultrafast tool for short read alignment. Bioinformatics. 2009;25:1966–1967. doi: 10.1093/bioinformatics/btp336. [DOI] [PubMed] [Google Scholar]
- 52.Duan J., Li R., Cheng D., Fan W., Zha X., Cheng T., Wu Y., Wang J., Mita K., Xiang Z., et al. SilkDB v2.0: A platform for silkworm (Bombyx mori) genome biology. Nucleic Acids Res. 2010;38:D453–D456. doi: 10.1093/nar/gkp801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kim D., Pertea G., Trapnell C., Pimentel H., Kelley R., Salzberg S.L. TopHat2: Accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Kim K., van de Wiel M.A. Effects of dependence in high-dimensional multiple testing problems. BMC Bioinform. 2008;9:114. doi: 10.1186/1471-2105-9-114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.