Abstract
Coffee production is a global industry valued at approximately 173 billion US dollars. One of the main challenges facing coffee production is the management of the coffee berry borer (CBB), Hypothenemus hampei, which is considered the primary arthropod pest of coffee worldwide. Current control strategies are inefficient for CBB management. Although biotechnological alternatives, including RNA interference (RNAi), have been proposed in recent years to control insect pests, characterizing the genetics of the target pest is essential for the successful application of these emerging technologies. In this study, we employed RNA-seq to obtain the transcriptome of three developmental stages of the CBB (larva, female and male) to increase our understanding of the CBB life cycle in relation to molecular features. The CBB transcriptome was sequenced using Illumina Hiseq and assembled de novo. Differential gene expression analysis was performed across the developmental stages. The final assembly produced 29,434 unigenes, of which 4,664 transcripts were differentially expressed. Genes linked to crucial physiological functions, such as digestion and detoxification, were determined to be tightly regulated between the reproductive and nonreproductive stages of CBB. The data obtained in this study help to elucidate the critical roles that several genes play as regulatory elements in CBB development.
Subject terms: RNA sequencing, RNA sequencing, Transcriptomics, Transcriptomics, Entomology
Introduction
Coffee (Coffea spp.) is one of the most traded commodities in the world and is the second-most consumed beverage, with more than 9 million tons being consumed annually1. Currently cultivated in over 70 countries, the coffee industry is valued at approximately 173 billion US dollars, representing an important source of employment around the world2. Crop yield is reduced by more than 30 species of insect pests3. Among these pests, the coffee berry borer (CBB), Hypothenemus hampei (Ferrari, 1867) (Coleoptera: Curculionidae), is considered the most damaging pest for the coffee industry, causing annual global losses in excess of 500 million US dollars4.
The CBB life cycle occurs within the coffee seed, with the adult female laying eggs in galleries formed throughout the endosperm. After hatching, the larvae feed on the coffee seed, reducing grain quality and increasing seed susceptibility to pathogen attack5. The endophytic behavior of CBB limits the use of traditional management methods due to poor cost-effectiveness, biosafety concerns and the development of insect resistance4. Biological control and volatile repellents have shown moderate efficiency for CBB management under specific field conditions. However, the success of these strategies remains limited when applied under a wide range of environmental conditions6,7. A number of alternatives have been proposed for CBB control based on biotechnological approaches. Molecules with known entomotoxic activity, such as the Bacillus thuringiensis (Bt) proteins Cry1Ba and Cry3Aa, have been successfully tested against H. hampei first instar larvae8. Furthermore, molecules with inhibitory effects on digestive enzymes have been shown to generate toxic effects in CBB by reducing the enzymatic activity of endogenous alpha-amylases9,10 and inhibiting endopeptidases11. More recent technologies for crop protection, such as the highly specific RNA interference (RNAi) mechanism, have shown promising results for the management of coleopteran pests12.
In addition to the application of RNAi in pest control, the discovery of RNAi has also enabled the identification of gene function with high accuracy13. Crop protection using this technology has been well-studied through the administration of double-stranded RNA (dsRNA) molecules, which trigger specific degradation of homologous mRNA in the target insect. Silencing genes with essential functions has been suggested to be a useful approach to ensure insect mortality, thereby increasing host plant resistance to such pests14–16. For more detailed information about RNAi mechanisms and their application in pest management, we recommend a recent review17.
‘Omic’ technologies are important tools not only for molecular, genetic and cellular research on insects but also for identifying and selecting target genes for RNAi12,18. The availability of sequence databases is very useful for a better understanding of host-pathogen interactions19, feeding mechanisms20, development21, and defense pathways in phytophagous insects22. Additionally, a significant number of studies using transcriptomic-based approaches have focused on identifying genes involved in developmental processes, with expression profiles among different life stages reported in several insect species21,23,24. Recent studies have reported such gene expression analyses across different developmental stages in lepidopteran pests of global economic relevance24–26. Similarly, a number of such studies have been performed on coleopteran pests21,27,28. Advances have also been described with regard to the expression patterns of chemosensory-related proteins during nonreproductive and reproductive phases and the differences associated with sex differentiation26,28. Despite such recent investigations, a gene expression analysis across different life cycle stages has not been performed to date in H. hampei.
The female genome sequence database is a valuable resource for CBB with a genome size of 163 Mb containing more than 19,000 predicted protein-coding genes29. RNA-seq data and genome sequencing of CBB have enabled the identification of important features, including horizontal gene transfer events, a significant repertoire of antimicrobial peptides, noticeable expression of genes encoding glycosidases and characterization of different detoxification pathways, including caffeine detoxification enzymes29. Nevertheless, no studies to date have assessed how the expression of these genes varies across the CBB developmental stages. In this work, we present the sequenced transcriptome of three different developmental stages of H. hampei. Using a de novo assembly method, we obtained a set of 53,978 transcripts clustered in 29,434 unigenes. Ultimately, 4,664 of these transcripts were found to be differentially expressed, with genes with known essential functions for insect fitness being highlighted as potential RNAi targets.
Results
Sequencing and De Novo transcriptome assembly
Illumina sequencing of nine H. hampei libraries (female, male and L2 larva; three replicates per stage) yielded a total of 265.3 million paired-end (PE) reads, totaling ~66.8 Gbp. The library size ranged from 16.3 to 49.2 million PE reads. Reads below a quality score of 28 (~24%) were removed, with a total of 201.8 high-quality PE reads being submitted to digital expression analysis. Additionally, only 95.1 million high-quality PE reads (~47%) were used for transcriptome assembly after digital normalization. A de novo approach was employed for transcriptome assembly, as the percentage of accurately mapped reads on the available reference genome29 was too small (<40%) to enable robust expression analysis. Clean reads were assembled into 116,587 contigs. A total of 56,738 (48.7%) contigs were identified as putatively homologous based on BLAST alignment against the NR database. After filtering noncoding sequences and 2,732 (2.4%) contaminant sequences (Supplementary File S1), a final assembly of ~106.6 Mbp was obtained, containing 53,978 contigs and 29,434 unigenes (longest isoform). The assembly quality showed 98.9% completeness. A more detailed list with the assembly statistics is given in Table 1. This transcriptome assembly project has been deposited at GenBank (NCBI database) under the submission number SUB4491034. In addition to transcriptome data, the genome size of CBB females was estimated by flow cytometry at 2n = 400 Mbp (Supplementary Fig. S1).
Table 1.
Assembly Statistic | Prefiltered assembly* | Postfiltered assembly | ||
---|---|---|---|---|
Number of Unigenes | 80,046 | 29,434 | ||
Number of Contigs | 116,587 | 53,978 | ||
GC % | 34.99 | 35.77 | ||
Completed BUSCOs | 1,054 (98.9%) | |||
Single-copy BUSCOs | 536 (50.3%) | |||
Duplicated BUSCOs | 518 (48.6%) | |||
Contigs | Contigs | Unigenes | Contigs | Unigenes |
Median (bp) | 705 | 573 | 1,549 | 1,114 |
Mean (bp) | 1,258.25 | 970.67 | 1,975.49 | 1,609.92 |
Total assembled bases: | 146,695,858 | 77,697,918 | 106,632,933 | 47,386,471 |
Contig N10 (bp) | 5,626 | 4,739 | 6250 | 5735 |
Contig N30 (bp) | 3,288 | 2,546 | 3948 | 3551 |
Contig N50 (bp) | 2,079 | 1,388 | 2795 | 2427 |
*Prefiltered assembly includes noncoding and contaminant sequences.
Functional annotation
Functional annotation was performed to provide an overview of protein-coding sequences among the assembled CBB contigs and to predict gene functions associated with each contig. All transcripts from the final assembly were annotated against the NR, InterPro, GO, Pfam and KEGG databases. For the 53,978 contigs annotated in the NR database, more than 75% of the hits for top-hit species were against coleopteran species. Furthermore, when comparing our transcripts with the data available in the CBB genome database, 51,048 (94.6%) transcripts had at least one hit with the protein sequences predicted from the genome, and 16,252 (84.5%) of the 19,222 predicted proteins were aligned to at least one of our transcripts. As a result, H. hampei was the organism with most number of hits in blast analysis, Dendroctonus ponderosae was the 2nd top-hit species with 29,502 (54.4%) annotated contigs and Tribolium castaneum was the 3rd top-hit species with 11,245 (20.7%) annotated contigs (Supplementary Fig. S2). For annotation, a total 45,759 GO terms (1,176 single GOs and 1,888 single GOs) were distributed into 98 functional categories according to GO classification, all belonging to the three main GO categories: biological process (50 subcategories), molecular function (21 subcategories), and cellular component (27 subcategories), which comprised 17,036, 20,101 and 14,739 contigs, respectively (Fig. 1a). Additionally, a total of 19,027 (35.2%) contigs within 8,611 (29.3%) unigenes were annotated with at least one Pfam code. The three protein family domains with most associated sequences were protein kinase (PF00069), with 1,683 contigs (8.8%), cytochrome p450 (PF00067), with 1,256 contigs (6.6%) and 7-transmembrane receptors type 1 (PF00001), with 1,029 contigs (5.4%) (Fig. 1b). Finally, 2,685 contigs (1,499 unigenes) were associated with at least one enzyme code following KEGG database annotation. Overall, 344 enzymes were identified and classified into 121 metabolic pathways (Fig. 1c).
Digital gene expression profiling
An average of 73.2% of high-quality filtered reads (ranging from 68.9% to 75.3% across libraries) were successfully mapped against our final assembly to enable FPKM-based analysis of expression values for transcripts for each developmental stage. All three replicate libraries for each developmental stage showed a high correlation (Pearson r = 0.86–0.99) and were used for expression analysis. A total of 4,664 transcripts (3,484 unigenes) were differentially expressed across the three evaluated developmental stages of H. hampei (Fig. 2a). Using the female life stage as a reference, 676 and 625 transcripts were upregulated and downregulated, respectively, in L2 larva. In contrast, 3,069 and 294 transcripts were upregulated and downregulated, respectively, in males. Furthermore, among all the DEGs, a total of 316 transcripts comprising 162 upregulated and 154 downregulated transcripts were differentially expressed in both male and L2 larva compared to expression in the female. Four main sets of DEGs (L2 larvae upregulated, L2 larvae downregulated, male upregulated and male downregulated) were generated (Fig. 2b). The majority of the DEGs were expressed with fold change values >2 and ≤5.5 (36.6). In contrast, DEGs showing values >12.5 and ≤15.6 were less abundant (1.9%). A plot showing the distribution of fold change values is presented in Supplementary Fig. S3. Surprisingly, 803 DEGs (17.2%) were “incompletely” annotated, matching only genes associated with unknown functions, hypothetical proteins and uncharacterized proteins, thereby indicating the need for improved annotation and genetic resources for the CBB.
Gene ontology enrichment analysis
GO enrichment analysis was performed separately for all four sets of DEGs. After filtering redundant GO terms, a total of 117 GO terms were associated with all the DEG sets, of which 32, 24, 61 and 0 were enriched in the L2 larva upregulated, L2 larva downregulated, male upregulated and male downregulated sets, respectively. Additionally, 8 GO terms were identified to be shared between the L2 larva upregulated and male upregulated sets. The most significant GO terms for each set of DEGs across the different developmental stages are shown in Fig. 3a (full list provided in Supplementary File S2). Interestingly, genes related to drug metabolism, chitin metabolism, proteolysis, and cuticle regeneration were upregulated in both L2 larvae and males. Genes related to developmental processes, carbohydrate metabolism, defense responses and cell signaling pathways were upregulated only in L2 larva. Additionally, genes related to transport processes, neuronal activity, sleep behavior and vitamin binding were downregulated only in L2 larva.
Enzyme enrichment
Enzyme enrichment was performed to identify highly specific metabolic pathways associated with each developmental stage. Overall, 15 pathways were associated with 12 enzymes that showed significant enrichment (FDR > 0.05) in our DEG sets. Interestingly, most of the pathways (14) were associated with the male upregulated set, while the remaining pathway was associated with the L2 larva upregulated set (Fig. 3b). On the other hand, no enzyme enrichment was observed for the male downregulated set.
DEGs between males and females
From the total number of unigenes that were differentially expressed, 2,124 unigenes (68.3% of total DEGs) were upregulated in males, of which 1,979 (56.8% of total DEGs) were exclusively upregulated in males and 145 were upregulated in L2 larvae. For a more detailed analysis of this predominant group of DEGs, unigenes associated with enriched GO terms/pathways were grouped by gene function (Fig. 4). Notably, transposable element-related proteins (TEs) and reproductive organ-related proteins were mainly upregulated in males. Among the TE-related proteins, genes encoding 41 transposases (TSs) and 20 TE-derived proteins were identified as DEGs. Among the TEs and TSs, those belonging to the Tc1/mariner superfamily were the most abundant, with 19 and 26 DEGs, respectively, being identified.
DEGs between larva and adult stages
We identified several differentially expressed genes that could be key elements for understanding the physiological and metabolic differences between reproductive and nonreproductive stages of the CBB by comparing L2 larva and female libraries (Supplementary Table S1). Genes annotated with cytochrome p450 (CYP) detoxification family domains were both upregulated (5 unigenes) and downregulated (7 unigenes) in the larva. Interestingly, for cuticle metabolism-related genes, all DEGs were upregulated in larvae and males (25 unigenes each). The DEGs that were annotated with functions related to cuticle and chitin metabolism were grouped according to their predicted proteins and are shown in Fig. 5. Finally, chemosensory candidate genes belonging to the Odorant Binding Protein (OBP) family (Fig. 6) were clustered in different classes using phylogenetic analysis. Most of the OBP genes were upregulated in adults (4), with two of them being exclusively expressed in females and belonging to the minus-C OBP class. Additionally, the only OBP upregulated in larvae was grouped within this class. No OBPs were upregulated in males compared with females.
RNA-seq data validation by real-time quantitative reverse transcription PCR (RT-qPCR)
RT-qPCR was conducted to validate the differentially expressed transcripts identified by DEG analyses of RNA-seq data. Six genes related to developmental processes were validated: juvenile hormone (JH)-related enzymes (JH esterase and JH epoxide hydrolase), cuticular proteins analogous to peritrophins (CPAPs) (CPAP3A1 and CPAP3C5) and proteins involved in chitin metabolism (hexosaminidase I and chitinase 10). Overall, the RT-qPCR results were consistent with the RNA-seq data with a correlation value of 0.79 for the entire set of genes that were validated (Fig. 7).
Discussion
Multiple studies have sought to identify alternative methods for CBB management based on the physiological, ecological and behavioral characteristics of this damaging pest30–32. However, few studies have described the molecular features of CBB (i.e., its genome structure, transcription profiles or proteome)29,33. Transcriptomics is a useful approach for providing complementary data for available genomes, as well as gene expression data, for an organism under a specific condition or developmental stage. Considering that sequencing insect transcriptomes can enable the complete identification of novel genes, with functional characterization of large datasets of genes and transcript abundance, this approach can provide valuable information for insect pest control strategies. In addition to the transcriptomes of well-characterized model organisms, such as T. castaneum34 and D. ponderosae35, the transcriptomes of different developmental stages of other coleopteran insects, including Cylas formicarius21, Batocera horsfieldi36 and Cyrtotrachelus buqueti27, have been recently sequenced, increasing sequencing data that are relevant for research into many other insect targets.
In this study, we used high-throughput sequencing to generate a 47.4-Mbp assembly of 29,434 unigenes, which is considerably larger than the number predicted in the draft genome of the CBB (~28.4 Mbp) with a total of 19,222 predicted genes, of which only 15,546 were expressed in females29. Additionally, the predicted CBB genome size determined by flow cytometry (~200 Mbp) was slightly higher than that described for the CBB draft genome (~163 Mbp)29. Although the number of sequences in our transcriptome is higher, approximately 15% of the CBB genome predicted proteins were not aligned to any of our transcripts. Those genes could be only expressed in phases, such as pupa and eggs, that were not sequenced in this work or even expressed in specific physiological conditions. Otherwise, we found that 2,930 transcripts from our assembly did not have a single hit with the predicted proteins of the CBB genome. It is important to note these transcripts as possible new genes that were not identified in the currently available genome. Another fact supporting the need to have sequenced the presented transcriptome is that the N50 of our assembly (2,427) is almost 1.5 times higher than the current transcriptome (1,638), also published by Vega et al.29. Thus, we provide a transcriptome with larger fragments than currently published29. It is important to highlight that the current versions of both the genome and transcriptome of the CBB were obtained by sequencing RNA from individuals with more than 10 years of meridic diet in the laboratory. It is expected that the variability observed in such a population should be different from the variability observed in the population used in the present work, which was obtained from insects raised in parchment coffee for only one generation. Therefore, when comparing transcriptomes, our data set is a more realistic and comprehensive representation of the gene set and transcript expression patterns in the CBB, since the population used in this work was maintained in conditions closer to the natural conditions of field populations of the CBB.
In this report, we present additional information complementary to the draft genome reported by Vega et al.29, including not only sequences obtained from female but also from male and larval phases. In our dataset, 3.7% and 8.1% of genes were differentially expressed between L2 larvae and females and between females and males, respectively. Previous reports have shown that the percentage of DEGs between nonreproductive and reproductive stages (1–5.5%) is slightly lower than the percentage of DEGs between sexes in the adult stage (2.7–8.8%) in other coleopteran species21,27,36. An important factor to be considered in future studies is that CBB males only exhibit one instar in the larval stage, while females have two instars. We observed greater differences in gene expression when comparing males and females or males and L1 larvae than when comparing males and L2 larvae, since the last larval stage is a female-specific stage37. In this work, we focus not only on identifying target genes specific for larval instars but also for females. Early stages are usually the most appropriate targets for the management of insect pests. However, due to the endophytic behavior of the CBB’s early stages, this approach severely limits the method used for control. In the case of larvae, transgenic plants should be more efficient. Otherwise, nontransformative methods could suit better when applied to females, which is the only stage of the CBB that leaves the endosperm. Hence, only females are exposed to control methods that are not able to permeate the endosperm of the coffee bean. Thus, understanding the expression profile and RNAi target identification in larval and adult stages of the CBB is equally important.
From a functional perspective, we observed that for both the GO and the pathway enrichment analysis, the upregulated genes in males were enriched with more functional categories (80 GO terms and 14 KEGG pathways) than any other DEG set. This finding is critical to improve understanding of sex determination in the CBB, in which the sex ratio is distorted towards females with a 10:1 ratio5. The concept of male haploid (arrhenotoky) has been used to explain sex determination in the CBB, and linkage between haplodiploidy and insecticide resistance has been proposed38. However, the genetics associated with the life history of CBB are poorly understood, and the specific genes or metabolic pathways involved in sex determination remain unknown. In this work, we showed that over 68% of the DEGs were upregulated in males, with DEGs associated with TEs and sex determination highlighted. Hernandez-Hernandez et al.39 characterized the TEs present in the CBB genome, revealing that 11.51% of the TEs belong to the Tc1/mariner superfamily. Furthermore, the authors confirmed that at least 8.2% of the CBB genome is composed of TEs. In this study, we identified 724 transcripts annotated with the TE Pfam family (Fig. 1b) and 1,367 unigenes with at least one match to TE-related proteins in BLAST analysis. Thus, at a minimum, 2.5% of the TE-related proteins are expressed in the CBB transcriptome, of which 4.7% are differentially expressed genes. As discussed by Hernandez-Hernandez et al.39, insect reproductive characteristics may be related to low TE content. Haplodiploidy, in which a chromosome set from males is not transmitted to the next generation, prevents the spread of certain TE genes. This fact may explain the higher abundance of genes encoding TE-related proteins in the male mRNA libraries reported in this work.
In addition to the comparison of expression between males and females, comparing the reproductive and nonreproductive stages also revealed interesting gene expression variation across the CBB life cycle. The larva-to-adult transition and developmental regulation in holometabolous insects are critical, not only from an evolutionary and ecological perspective but also from the perspective of developing sustainable methods for insect pest management40. This study describes expression differences between the adult and immature stages of CBB and identified genes that were previously detected in the genome of this insect that could be linked to insect development29. As shown in both GO and pathway enrichment analyses, several genes related to carbohydrate and protein metabolism were differentially expressed in larvae compared to their expression in females. Overall, most of the predicted proteins involved in cellulose and hemicellulose metabolism, such as endoglucanases, cellulases (glycoside hydrolase family 48) and xylanases, were upregulated in larva. Interestingly, these enzymes have been reported to digest plant cell wall components in many coleopteran species, specifically cellulose, hemicellulose and lignocellulose, as their main carbohydrate sources41.
Moreover, genes encoding enzymes with protease activity were upregulated in larvae (serine proteases and thiol proteases) and females (peptidyl-dipeptidases and metalloproteases). Several studies have shown higher expression of digestive enzyme-coding genes in larval stages, probably because these nonreproductive stages, often called the feeding stages, involve a more active feeding behavior than observed in adults21,25. Digestive enzymes are considered potential targets for insect pest management, as their function in insect nutrition is essential, and their inhibition can cause lethal or deleterious effects in the larval stages42. However, it has been reported that insect genes rapidly evolve in response to inhibitors targeting digestive enzymes43, meaning that the identification of novel variants of these genes could be crucial in designing more precise and efficient strategies for pest control44,45.
Other genes that can be considered potential targets for RNAi and insecticide development are those encoding cuticular proteins (CPs) and enzymes involved in chitin metabolism. In insects, the structure of the cuticle is determined by cuticular proteins, which play roles in many physiological processes, such as protection against dehydration, insecticide penetration, pathogens and physical injury46. Therefore, CPs could be used as potential targets for the management of insect pests47. Some genes encoding cuticular binding proteins and cuticular proteins analogous to peritrophins (CPAPs) were differentially expressed in our data set, with higher expression being observed in larvae and males than in females (Fig. 5). Similarly, certain chitin synthases, chitinases and chitin deacetylases showed higher expression in larvae and males than in females. Interestingly, genes involved in chitin degradation appeared upregulated to a greater extent in males compared to larvae or females. Precise control of chitin synthesis and degradation rates is essential for insect homeostasis. Determining the expression profiles of genes that regulate these rates could be useful in the design of pest control strategies48.
Finally, we highlighted a group of genes encoding proteins with chemosensory functions. The multigenic family of OBPs is thought to be involved in chemical recognition, which regulates pivotal behaviors, including host choice, copulation, and reproduction49. In coleopteran adults, olfaction plays a major role in finding mates, locating oviposition sites, foraging, aggregating and avoiding predators50. Despite the importance of the olfaction mechanism, the molecular basis of chemoreception in coleopterans is still poorly understood compared to lepidopterans and dipterans51,52. The OBP family can be separated into four different phylogenetic groups: the classic OBPs, which are characterized by six cysteine residues at conserved positions; the minus-C class, which lost two of the conserved cysteine residues; the plus-C OBPs, which contain four to six additional cysteines and one characteristic proline; and the highly specific antennal binding proteins (ABPII)53. A few studies conducted in coleopteran pests have evaluated the tissue-specific expression profiles of most of the OBPs described in insects, in some cases showing few differences between male and female tissues. However, little is known about the expression profiles of OBP genes across different developmental stages28,54. In this study, we identified 15 full-length candidate genes encoding OBPs, of which six were differentially expressed during different stages. Most of the candidate OBP genes showed the highest expression in adults, mainly in females, and only one of these genes was upregulated in larvae.
Moreover, when we compared the H. hampei and T. castaneum OBP profiles54 in our phylogenetic analysis (Fig. 6), we observed that no OBP genes were upregulated in T. castaneum larvae. Additionally, of a total of 16 OBPs that were differentially expressed in T. castaneum and H. hampei, eight belonged to the minus-C class. These findings suggest that differences in the expression of OBP genes in Coleoptera species may be determined by the transition to a reproductive stage. Since the CBB males never leave the coffee seeds, it is possible that OBPs with higher expression in females are related to the females’ colonizing role, which involves detecting new and suitable coffee seeds. The role of minus-C OBPs in this process has not been determined to date, but its potential as a target for pest management is of great interest. As proposed by Zhou et al. in 2010, interruption of signal perception by blocking OBPs could prevent mating events or affect normal host selection55. Indeed, knockdown of OBP genes in recently published work has shown lethal effects in different insect pests56,57. In the CBB, OBP knockdown may also show lethal effects because of the CBB’s endophytic lifestyle, in which odor-based communication is considered essential to insect reproduction and survival.
Methods
Insects
Second instar larvae and adult females and males of the CBB were collected in 2015 from established laboratory colonies located at the Universidad de Caldas in Manizales, Caldas, Colombia. Insects were obtained from infested coffee plants in the field and then raised in parchment coffee bean. Only insects with one generation raised in the laboratory were collected in 1.5 ml RNase-free tubes with 1 ml of RNAlater stabilization solution (Qiagen, Germany) and then stored at −80 °C until further processing. Samples of 20 mg from the whole insect body were prepared from each developmental stage and used for RNA extraction.
RNA isolation and Illumina sequencing
Total RNA was isolated from each developmental stage of H. hampei using the RNeasy Mini Kit (Qiagen, Valencia, CA, USA) according to the manufacturer’s instructions and stored at −80 °C until use. RNA extraction was performed on three biological replicates. RNA integrity was assessed using a 2100 Bioanalyzer system (Agilent Technologies, Santa Clara, CA, USA) and through 1% denaturing agarose gel electrophoresis. RNA concentration was measured using a NanoDrop 1000 (Thermo-Fisher Scientific, Wilmington, DE, USA). Stage-specific samples from RNA extractions were used for cDNA library construction with a TruSeq RNA Library Prep Kit v2 (Illumina, San Diego, CA, USA). Finally, cDNA libraries were purified using a Ribo-Zero rRNA Removal Kit and sequenced in a 125 bp PE run on an Illumina HiSeqTM 2500 platform at the University of Minnesota Genomics Center (MN, USA).
De Novo transcriptome assembly and functional annotation
The raw reads were cleaned using Trimmomatic v0.33 software58 for removal of adaptors, sequences containing unknown nucleotides (N over 5%) and low-quality sequences (Phred < 28). Initially, clean reads were mapped against the current CBB genome sequences (https://genome.med.nyu.edu/coffee-beetle/cbb.html). TopHat software was used to map our reads against the genome scaffolds59. For the assembly, clean reads were digitally normalized using KHMER v2.0 software60 with a maximum coverage of 50 bp for each read. Normalized reads were assembled into longer fragments (contigs) using Trinity v2.0.6 software61, which generated a set of components (unigenes) containing a variable number of transcripts (isoforms). Transcripts were BLASTed against the National Center for Biotechnology Information (NCBI) nonredundant protein (NR) database using the blastx algorithm with an e-value cutoff < 1E-5. Additionally, since no RefSeq from the current CBB genome is available in the NCBI database, a BLASTx was performed by aligning our transcripts to the predicted protein sequences in the CBB genome project database (https://genome.med.nyu.edu/coffee-beetle/cbb.html). To remove potential microbial sequences and other contaminants from the assembly, in silico curation was performed by manual removal of transcripts that only matched sequences from plants and/or microorganisms in BLAST analysis. The remaining transcripts were annotated using Blast2GO Basic v4.162 by identifying protein domains from the InterPro database63, protein families from the Pfam database64, GO terms (GOs) from the GO database65 and metabolic pathways from the KEGG (Kyoto Encyclopedia of Genes and Genomes) database66. Finally, to estimate the genome size of female CBB, flow cytometry was performed following the protocol of Gregory et al.67.
Analysis of differentially expressed genes
To compare gene expression profiles across three developmental stages (L2 larva, adult male, and adult female) of H. hampei, clean reads of each library were mapped to our assembly using RSEM software68, and abundance estimation was performed by using the script align_and_estimate_abundance.pl in the Trinity package. Expression levels of the transcripts were calculated based on the FPKM normalization method using the edgeR package69. Comparisons of differentially expressed genes (DEGs) were performed using female libraries as a reference. Female libraries were used as reference with the objective of properly showing the two main comparisons in this work, between sexes (female and male) and between reproductive and nonreproductive phases (female and L2 larva), considering that L2 instar is specific to females in the CBB. The criteria for statistical significance of DEGs were a false discovery rate (FDR) less than 0.001 and a log2 (fold change) >2. Finally, GO enrichment analysis was performed for DEG sets using FUNC software70. Removal of redundant GO terms was performed in REVIGO software with an allowed similarity value of 0.571. Visualization of the expression profiles was performed using ClustVis software72.
Phylogenetic analysis
Multiple alignments were performed with the full-length amino acid sequences of chemosensory candidate genes belonging to the odorant binding protein (OBP) family. To this end, sequences from two well-characterized genomes of coleopteran pests (T. castaneum and D. ponderosae) and those identified in our assembly were used. After the removal of a signal peptide in the N-terminal region (SignalP4.1)73, sequences were aligned with MAFFT v7 software74 and further edited with trimAL software75. Phylogenetic trees were constructed using RAxML v8 software76, using the maximum likelihood method with the LG model and 1000 bootstrap replicates. The accession numbers of the OBP sequences from D. ponderosae and T. castaneum are listed in Supplementary File S3. The candidate OBP-coding genes from the H. hampei assembly, along with their BLAST results and FPKM values, are also listed (Supplementary File S4).
Quantitative real-time PCR (RT-qPCR) validation
To validate data obtained from RNA-seq, total RNA was extracted from different samples of those used in the sequencing but treated under the same experimental conditions. This approach was used to improve the reliability of the expression patterns by considering technical and biological variation. Insect cDNA was synthesized from 1000 ng of RNA using a QuantiTect Reverse Transcription Kit (Qiagen, Valencia, CA, USA) following the manufacturer’s instructions. The RT-qPCR reactions included 2 μl of diluted cDNA (1:50), 5 μl of Fast SYBR® Green Master Mix (Applied Biosystems, Foster City, CA, USA), 0.2 μl of forward and reverse primers at 10 μM, and 2.6 μl of nuclease-free water, for a total volume of 10 μl. Primers were designed using Primer377 and validated by analysis of their PCR amplification efficiencies (E) and correlation coefficients (R2) (Supplementary Table S2). Both primer efficiency tests and RT-qPCR were performed on a 7500 Fast RT-PCR System (Applied Biosystems, Foster City, CA, USA). The RT-qPCR program included a holding stage of 95 °C for 20 s followed by 40 cycles of denaturation at 95 °C for 3 s and annealing/extension at 60 °C for 30 s (cycling stage). A melting curve was generated to confirm the production of a single peak and to rule out the possibility of primer dimers and nonspecific product formation. The expression of the genes was calculated using the comparative 2−ΔΔCT method78. Relative gene expression analysis was performed using β-actin as a normalizer.
Supplementary information
Acknowledgements
The authors thank the Universidad de Caldas (Manizales, Colombia), the Department of Entomology at the University of Nebraska (Lincoln, NE, USA), Embrapa, Consortium Research Coffee, CAPES, CNPq, and FAPDF (Brazil) for supporting this research. The Colombian National Fund for Science, Technology and Innovation, Francisco Jose de Caldas-COLCIENCIAS (Colombia), also supported this research work (code: 112771250888, contract 253–2016). Special thanks are due to Dr. Robert Miller (professor at the University of Brasilia, Brazil) for his helpful contribution in correcting the English language. This work provides new data and relevant scientific information about the genetics of the Coffee berry borer (H. hampei), the main pest of coffee crops worldwide. As omics sciences are becoming useful tools for study biology of different organisms, we believe this work suits perfectly in Scientific Reports journal due to the contribution to understanding the biology of a species with high importance for biology and agriculture.
Author Contributions
Conceptualization: Maria F. Grossi-de-Sa, Maria C.M Silva, Arnubio Valencia-Jiménez. Data curation & Formal Analysis: Daniel D. Noriega, Paula L. Arias, Helena R. Barbosa, Fabricio B.M. Arraes, Roberta R. Coelho, Roberto C. Togawa, Priscila Grynberg, Maria C.M Silva, Arnubio Valencia-Jiménez. Funding acquisition: Bernardo Villegas, Erika S. Albuquerque, Maria F. Grossi-de-Sa, Maria C.M Silva, Arnubio Valencia-Jiménez. Investigation: Daniel D. Noriega, Paula L. Arias, Helena R. Barbosa, Gustavo A. Ossa, Fabricio B.M. Arraes, Erika S. Albuquerque, Roberto C. Togawa, Priscila Grynberg, Haichuan Wang, Ana M. Vélez, Jorge W. Arboleda, Maria C.M Silva, Arnubio Valencia-Jiménez. Methodology: Daniel D. Noriega, Paula L. Arias, Helena R. Barbosa, Fabricio B.M. Arraes, Erika S. Albuquerque, Roberto C. Togawa, Priscila Grynberg, Haichuan Wang, Ana M. Vélez, Maria F. Grossi-de-Sa, Maria C.M Silva, Arnubio Valencia-Jiménez. Project administration: Bernardo Villegas, Maria C.M Silva, Arnubio Valencia-Jiménez. Resources: Bernardo Villegas, Erika S. Albuquerque, Ana M. Vélez, Maria F. Grossi-de-Sa, Maria C.M Silva, Arnubio Valencia-Jiménez. Supervision: Maria C.M Silva, Arnubio Valencia-Jiménez. Validation: Daniel D. Noriega, Paula L. Arias, Helena R. Barbosa, Gustavo A. Ossa, Maria C.M Silva, Arnubio Valencia-Jiménez. Visualization: Daniel D. Noriega, Paula L. Arias, Helena R. Barbosa, Gustavo A. Ossa, Fabricio B.M. Arraes, Roberta R. Coelho, Roberto C. Togawa, Priscila Grynberg, Arnubio Valencia-Jiménez. Writing ± original draft: Daniel D. Noriega, Paula L. Arias, Helena R. Barbosa, Fabricio B.M. Arraes, Erika S. Albuquerque, Roberta R. Coelho, Maria F. Grossi-de-Sa, Maria C.M Silva, Arnubio Valencia-Jiménez. Writing ± review & editing: Daniel D. Noriega, Fabricio B.M. Arraes, Ana M. Vélez, Maria F. Grossi-de-Sa, Maria C.M Silva, Arnubio Valencia-Jiménez.
Data Availability
The datasets generated during the current study are available in supplementary information. Raw sequencing and transcriptome assembly data were deposited in NCBI databases, but restrictions apply to the availability of these data. Data will be released for public access once this paper is published.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Daniel D. Noriega, Paula L. Arias and Helena R. Barbosa contributed equally.
Maria C. M. Silva and Arnubio Valencia-Jiménez jointly supervised this work.
Contributor Information
Daniel D. Noriega, Email: daniel.nv07@gmail.com
Maria F. Grossi-de-Sa, Email: fatima.grossi@embrapa.br
Arnubio Valencia-Jiménez, Email: arnubio.valencia@ucaldas.edu.co.
Supplementary information
Supplementary information accompanies this paper at 10.1038/s41598-019-49178-x.
References
- 1.ICO, International Coffee Organization, http://www.ico.org/ accessed on April (2018).
- 2.Vega, F. E., Infante, F. & Johnson, A. J. The genus Hypothenemus, with emphasis on H. hampei, the coffee berry borer, in Bark Beetles 427–494 (Elsevier, 2015).
- 3.Barrera, J. F. Coffee pests and their management in Encyclopedia Of Entomology (ed. Capinela, J.) 961–998 (Springer, 2008).
- 4.Infante F. Pest Management Strategies Against the Coffee Berry Borer (Coleoptera: Curculionidae: Scolytinae) Journal of agricultural and food chemistry. 2018;21:5275–5280. doi: 10.1021/acs.jafc.7b04875. [DOI] [PubMed] [Google Scholar]
- 5.Vega FE, Infante F, Castillo A, Jaramillo J. The coffee berry borer, Hypothenemus hampei (Ferrari) (Coleoptera: Curculionidae): a short review, with recent findings and future research directions. Terrestrial Arthropod Reviews. 2009;2:129. doi: 10.1163/187498209X12525675906031. [DOI] [Google Scholar]
- 6.Vega FE, et al. A Potential Repellent Against the Coffee Berry Borer (Coleoptera: Curculionidae: Scolytinae) Journal of Insect Science. 2017;17:122. doi: 10.1093/jisesa/iex095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rodríguez D, Cure JR, Gutierrez AP, Cotes JM. A coffee agroecosystem model: III. Parasitoids of the coffee berry borer (Hypothenemus hampei) Ecological Modelling. 2017;363:96–110. doi: 10.1016/j.ecolmodel.2017.08.008. [DOI] [Google Scholar]
- 8.López-Pazos SA, Rojas Arias AC, Ospina SA, Cerón J. Activity of Bacillus thuringiensis hybrid protein against a lepidopteran and a coleopteran pest. FEMS microbiology letters. 2009;302:93–98. doi: 10.1111/j.1574-6968.2009.01821.x. [DOI] [PubMed] [Google Scholar]
- 9.Valencia A, Bustillo AE, Ossa GE, Chrispeels MJ. α-Amylases of the coffee berry borer (Hypothenemus hampei) and their inhibition by two plant amylase inhibitors. Insect biochemistry and molecular biology. 2000;30:207–213. doi: 10.1016/S0965-1748(99)00115-0. [DOI] [PubMed] [Google Scholar]
- 10.Barbosa AE, et al. α-Amylase inhibitor-1 gene from Phaseolus vulgaris expressed in Coffea arabica plants inhibits α-amylases from the coffee berry borer pest. BMC biotechnology. 2010;10:44. doi: 10.1186/1472-6750-10-44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Pereira RA, et al. Effect of a Bowman-Birk proteinase inhibitor from Phaseolus coccineus on Hypothenemus hampei gut proteinases in vitro. Journal of agricultural and food chemistry. 2007;55:10714–10719. doi: 10.1021/jf072155x. [DOI] [PubMed] [Google Scholar]
- 12.Zhang J, Khan SA, Heckel DG, Bock R. Next-generation insect-resistant plants: RNAi-mediated crop protection. Trends in biotechnology. 2017;35:871–882. doi: 10.1016/j.tibtech.2017.04.009. [DOI] [PubMed] [Google Scholar]
- 13.Fire A, et al. Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature. 1998;391:806–811. doi: 10.1038/35888. [DOI] [PubMed] [Google Scholar]
- 14.Niu X, et al. Control of Western Corn Rootworm (Diabrotica virgifera virgifera) Reproduction through Plant-Mediated RNA Interference. Scientific reports. 2017;7:12591. doi: 10.1038/s41598-017-12638-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Baum JA, et al. Control of coleopteran insect pests through RNA interference. Nature biotechnology. 2007;25:1322. doi: 10.1038/nbt1359. [DOI] [PubMed] [Google Scholar]
- 16.Zhu F, Xu J, Palli R, Ferguson J, Palli SR. Ingested RNA interference for managing the populations of the Colorado potato beetle, Leptinotarsa decemlineata. Pest management science. 2011;67:175–182. doi: 10.1002/ps.2048. [DOI] [PubMed] [Google Scholar]
- 17.Zotti M, et al. RNA interference technology in crop protection against arthropod pests, pathogens and nematodes. Pest management science. 2018;74:1239–1250. doi: 10.1002/ps.4813. [DOI] [PubMed] [Google Scholar]
- 18.Firmino AAP, et al. Transcriptome analysis in cotton boll weevil (Anthonomus grandis) and RNA interference in insect pests. PLoS One. 2013;8:e85079. doi: 10.1371/journal.pone.0085079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Vogel H, Musser RO, Celorio-Mancera M-L. Transcriptome responses in herbivorous insects towards host plant and toxin feeding. Annual plant reviews. 2014;47:197–233. doi: 10.1002/9781118829783.ch6. [DOI] [Google Scholar]
- 20.Yi, X., Qi, J., Zhou, X., Hu, M. Y. & Zhong, G. H. Differential expression of chemosensory-protein genes in midguts in response to diet of Spodoptera litura. Scientific Reports7 (2017). [DOI] [PMC free article] [PubMed]
- 21.Ma L, Wang R, Li X, Gao B, Chen S. Transcriptome and gene expression analysis of Cylas formicarius (Coleoptera: Brentidae) during different development stages. Journal of Insect Science. 2016;16:63. doi: 10.1093/jisesa/iew053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Crava CM, Brütting C, Baldwin IT. Transcriptome profiling reveals differential gene expression of detoxification enzymes in a hemimetabolous tobacco pest after feeding on jasmonate-silenced Nicotiana attenuata plants. BMC genomics. 2016;17:1005. doi: 10.1186/s12864-016-3348-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Leal L, et al. Gene expression profiling across ontogenetic stages in the wood white (Leptidea sinapis) reveals pathways linked to butterfly diapause regulation. Molecular ecology. 2018;27:935–948. doi: 10.1111/mec.14501. [DOI] [PubMed] [Google Scholar]
- 24.Chang JC, Ramasamy S. Transcriptome analysis in the beet webworm, Spoladea recurvalis (Lepidoptera: Crambidae) Insect science. 2018;25:33–44. doi: 10.1111/1744-7917.12375. [DOI] [PubMed] [Google Scholar]
- 25.Tang PA, et al. Characterization of transcriptome in the Indian meal moth Plodia interpunctella (Lepidoptera: Pyralidae) and gene expression analysis during developmental stages. Gene. 2017;622:29–41. doi: 10.1016/j.gene.2017.04.018. [DOI] [PubMed] [Google Scholar]
- 26.Zhang S, et al. Identification and Expression profiling of chemosensory genes in Dendrolimus punctatus Walker. Frontiers in physiology. 2017;8:471. doi: 10.3389/fphys.2017.00471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yang H, et al. The developmental transcriptome of the bamboo snout beetle Cyrtotrachelus buqueti and insights into candidate pheromone-binding proteins. PloS one. 2017;12:e0179807. doi: 10.1371/journal.pone.0179807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Li L, Zhou Y-T, Tan Y, Zhou X-R, Pang B-P. Identification of odorant-binding protein genes in Galeruca daurica (Coleoptera: Chrysomelidae) and analysis of their expression profiles. Bulletin of entomological research. 2017;107:550–561. doi: 10.1017/S0007485317000402. [DOI] [PubMed] [Google Scholar]
- 29.Vega FE, et al. Draft genome of the most devastating insect pest of coffee worldwide: the coffee berry borer, Hypothenemus hampei. Scientific reports. 2015;5:12525. doi: 10.1038/srep12525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Mariño YA, Verle Rodrigues JC, Bayman P. Wolbachia Affects Reproduction and population Dynamics of the Coffee Berry Borer (Hypothenemus hampei): Implications for Biological Control. Insects. 2017;8:8. doi: 10.3390/insects8010008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Morris JR. & Perfecto, I. Testing the potential for ant predation of immature coffee berry borer (Hypothenemus hampei) life stages. Agriculture, Ecosystems & Environment. 2016;233:224–228. doi: 10.1016/j.agee.2016.09.018. [DOI] [Google Scholar]
- 32.Celestino F, et al. Toxicity of castrol oil to coffee berry borer [Hypothenemus hampei (Ferrari) (Coleoptera: Curculiondae: Scolytinae)] Coffee Science. 2015;10:329–336. [Google Scholar]
- 33.Bezerra C, et al. Molecular cloning and characterization of an α-amylase cDNA highly expressed in major feeding stages of the coffee berry borer, Hypothenemus hampei. Gene. 2014;553:7–16. doi: 10.1016/j.gene.2014.09.050. [DOI] [PubMed] [Google Scholar]
- 34.Park Y, et al. Analysis of transcriptome data in the red flour beetle, Tribolium castaneum. Insect biochemistry and molecular biology. 2008;38:380–386. doi: 10.1016/j.ibmb.2007.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Keeling CI, et al. Transcriptome and full-length cDNA resources for the mountain pine beetle, Dendroctonus ponderosae Hopkins, a major insect pest of pine forests. Insect biochemistry and molecular biology. 2012;42:525–536. doi: 10.1016/j.ibmb.2012.03.010. [DOI] [PubMed] [Google Scholar]
- 36.Yang H, et al. Transcriptome analysis in different developmental stages of Batocera horsfieldi (Coleoptera: Cerambycidae) and comparison of candidate olfactory genes. PloS one. 2018;13:e0192730. doi: 10.1371/journal.pone.0192730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gómez J, Chávez BY, Castillo A, Valle FJ, Vega FE. The coffee berry borer (Coleoptera: Curculionidae): how many instars are there? Annals of the Entomological Society of America. 2015;108:311–315. doi: 10.1093/aesa/sav009. [DOI] [Google Scholar]
- 38.Brun L-O, Stuart J, Gaudichon V, Aronstein K, French-Constant R. Functional haplodiploidy: a mechanism for the spread of insecticide resistance in an important international insect pest. Proceedings of the National Academy of Sciences. 1995;92:9861–9865. doi: 10.1073/pnas.92.21.9861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hernandez-Hernandez EM, et al. Genome-wide analysis of transposable elements in the coffee berry borer Hypothenemus hampei (Coleoptera: Curculionidae): description of novel families. Molecular genetics and genomics. 2017;292:565–583. doi: 10.1007/s00438-017-1291-7. [DOI] [PubMed] [Google Scholar]
- 40.Schmitt-Engel C, et al. The iBeetle large-scale RNAi screen reveals gene functions for insect development and physiology. Nature communications. 2015;6:7822. doi: 10.1038/ncomms8822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Pauchet Y, Wilkinson P, Chauhan R. Diversity of beetle genes encoding novel plant cell wall degrading enzymes. PloS one. 2010;5:e15635. doi: 10.1371/journal.pone.0015635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Terra, W., Ferreira, C., Jordao, B. & Dillon, R. Digestive enzymes in Biology Of The Insect Midgut (ed. Lehane, M.)153–194 (Springer, 1996).
- 43.Zhu-Salzman K, Zeng R. Insect response to plant defensive protease inhibitors. Annual review of entomology. 2015;60:233–252. doi: 10.1146/annurev-ento-010814-020816. [DOI] [PubMed] [Google Scholar]
- 44.de Assis Fonseca FC, et al. Sugarcane giant borer transcriptome analysis and identification of genes related to digestion. PloS one. 2015;10:e0118231. doi: 10.1371/journal.pone.0118231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Valencia A, et al. Pyrosequencing the Midgut Transcriptome of the banana weevil Cosmopolites Sordidus (Germar) (Coleoptera: Curculionidae) reveals multiple protease-like transcripts. PloS one. 2016;11:e0151001. doi: 10.1371/journal.pone.0151001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Willis JH. Structural cuticular proteins from arthropods: annotation, nomenclature, and sequence characteristics in the genomics era. Insect biochemistry and molecular biology. 2010;40:189–204. doi: 10.1016/j.ibmb.2010.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Chen E-H, et al. Genome-wide annotation of cuticular proteins in the oriental fruit fly (Bactrocera dorsalis), changes during pupariation and expression analysis of CPAP3 protein genes in response to environmental stresses. Insect biochemistry and molecular biology. 2018;97:53–70. doi: 10.1016/j.ibmb.2018.04.009. [DOI] [PubMed] [Google Scholar]
- 48.Cohen E. Chitin synthesis and degradation as targets for pesticide action. Archives of insect biochemistry and physiology. 1993;22:245–261. doi: 10.1002/arch.940220118. [DOI] [PubMed] [Google Scholar]
- 49.Pelosi P, Calvello M, Ban L. Diversity of odorant-binding proteins and chemosensory proteins in insects. Chemical senses. 2005;30:i291. doi: 10.1093/chemse/bjh229. [DOI] [PubMed] [Google Scholar]
- 50.Wang Y, et al. Identification and evolution of olfactory genes in the small poplar longhorn beetle Saperda populnea. Comparative Biochemistry and Physiology Part D: Genomics and Proteomics. 2018;26:58–68. doi: 10.1016/j.cbd.2018.03.003. [DOI] [PubMed] [Google Scholar]
- 51.Li X, et al. Chemosensory gene families in adult antennae of Anomala corpulenta Motschulsky (Coleoptera: Scarabaeidae: Rutelinae) PloS one. 2015;10:e0121504. doi: 10.1371/journal.pone.0121504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Andersson MN, et al. Antennal transcriptome analysis of the chemosensory gene families in the tree killing bark beetles, Ips typographus and Dendroctonus ponderosae (Coleoptera: Curculionidae: Scolytinae) BMC genomics. 2013;14:198. doi: 10.1186/1471-2164-14-198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Zhou, J.-J. Odorant-binding proteins in insects in Vitamins & Hormones (ed. Litwack, G.) 241–272 (Elsevier, 2010). [DOI] [PubMed]
- 54.Dippel S, et al. Tissue-specific transcriptomics, chromosomal localization, and phylogeny of chemosensory and odorant binding proteins from the red flour beetle Tribolium castaneum reveal subgroup specificities for olfaction or more general functions. BMC genomics. 2014;15:1141. doi: 10.1186/1471-2164-15-1141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zhou J-J, Field LM, He XL. Insect odorant-binding proteins: Do they offer an alternative pest control strategy? Outlooks on Pest Management. 2010;21:31–34. doi: 10.1564/21feb08. [DOI] [Google Scholar]
- 56.Antony B, Johny J, Aldosari SA. Silencing the Odorant Binding Protein RferOBP1768 Reduces the Strong Preference of Palm Weevil for the Major Aggregation Pheromone Compound Ferrugineol. Frontiers in physiology. 2018;9:252. doi: 10.3389/fphys.2018.00252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Dong K, et al. RNAi-induced electrophysiological and behavioral changes reveal two pheromone binding proteins of Helicoverpa armigera involved in the perception of the main sex pheromone component Z11–16: Ald. Journal of chemical ecology. 2017;43:207–214. doi: 10.1007/s10886-016-0816-6. [DOI] [PubMed] [Google Scholar]
- 58.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25:1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Crusoe, M. R. et al. The khmer software package: enabling efficient nucleotide sequence analysis. F1000Research4 (2015). [DOI] [PMC free article] [PubMed]
- 61.Grabherr MG, et al. Nature biotechnology. 2011. Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data; p. 644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Conesa A, et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–3676. doi: 10.1093/bioinformatics/bti610. [DOI] [PubMed] [Google Scholar]
- 63.Zdobnov EM, Apweiler R. InterProScan–an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001;17:847–848. doi: 10.1093/bioinformatics/17.9.847. [DOI] [PubMed] [Google Scholar]
- 64.Bateman A, et al. The Pfam protein families database. Nucleic acids research. 2004;32:D138–D141. doi: 10.1093/nar/gkh121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Consortium GO. The Gene Ontology (GO) database and informatics resource. Nucleic acids research. 2004;32:D258–D261. doi: 10.1093/nar/gkh036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic acids research. 2000;28:27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Gregory T, Nathwani P, Bonnett T, Huber DP. Sizing up arthropod genomes: an evaluation of the impact of environmental variation on genome size estimates by flow cytometry and the use of qPCR as a method of estimation. Genome. 2013;56:505–510. doi: 10.1139/gen-2013-0044. [DOI] [PubMed] [Google Scholar]
- 68.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Prüfer K, et al. FUNC: a package for detecting significant associations between gene sets and ontological annotations. BMC bioinformatics. 2007;8:41. doi: 10.1186/1471-2105-8-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Supek F, Bošnjak M, Škunca N, Šmuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PloS one. 2011;6:e21800. doi: 10.1371/journal.pone.0021800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Metsalu T, Vilo J. ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap. Nucleic acids research. 2015;43:W566–W570. doi: 10.1093/nar/gkv468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nature methods. 2011;8:785. doi: 10.1038/nmeth.1701. [DOI] [PubMed] [Google Scholar]
- 74.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Untergasser S, et al. Primer3—new capabilities and interfaces. Nucleic acids research. 2012;40:115. doi: 10.1093/nar/gks596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Livak K, Schmittgen T. Analysis of relative gene expression data using real-time quantitative PCR and the 2 (-Delta Delta C (T)) CAS PubMed Article. 2001;4:402–408. doi: 10.1006/meth.2001.1262. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated during the current study are available in supplementary information. Raw sequencing and transcriptome assembly data were deposited in NCBI databases, but restrictions apply to the availability of these data. Data will be released for public access once this paper is published.