Abstract
Dragon fruit, pitahaya or pitaya are common names for the species in the Hylocereus group of Selenicereus that produce edible fruit. These Neotropical epiphytic cacti are considered promising underutilized crops and are currently cultivated around the world. The most important species, S. undatus, has been managed in the Maya domain for centuries and is the focus of this article. Transcriptome profiles from stems of wild and cultivated plants of this species were compared. We hypothesized that differences in transcriptomic signatures could be associated with genes related to drought stress. De novo transcriptome assembly and the analysis of differentially expressed genes (DEGs) allowed us to identify a total of 9,203 DEGs in the Hunucmá cultivar relative of wild Mozomboa plants. Of these, 4,883 represent up-regulated genes and 4,320, down-regulated genes. Additionally, 6,568 DEGs were identified from a comparison between the Umán cultivar and wild plants, revealing 3,286 up-regulated and 3,282 down-regulated genes. Approximately half of the DEGs are shared by the two cultivated plants. Differences between the two cultivars that were collected in the same region could be the result of differences in management. Metabolism was the most representative functional category in both cultivars. The up-regulated genes of both cultivars formed a network related to the hormone-mediated signaling pathway that includes cellular responses to auxin stimulus and to hormone stimulus. These cellular reactions have been documented in several cultivated plants in which drought-tolerant cultivars modify auxin transport and ethylene signaling, resulting in a better redistribution of assimilates.
Keywords: Transcriptomics, Domestication, Wild vs cultivars, Cacti, Dragon fruit, Drought tolerance, Pitaya, Maya, Epiphyte
Introduction
Pitahayas are recognized as promising crops in areas with dry climates because they possess adaptations for occupying arid and semi-arid areas, and are creating fresh interest in view of global climate warming (Nerd, Tel-Zur & Mizrahi, 2002; Le Bellec, Vaillant & Imbert, 2006; Mizrahi, 2014; Sosa et al., 2020). The fruit of five species of Neotropical epiphytic cacti in the Hylocereus group of Selenicereus are known as dragon fruit, pitahaya or pitaya (S. costaricencis, S. megalanthus, S. monacanthus, S. ocamponis and S. undatus) (Nerd, Tel-Zur & Mizrahi, 2002; Le Bellec, Vaillant & Imbert, 2006; Mizrahi, 2014). Two species, S. monacanthus and S. undatus, were introduced to Southeast Asia and China and are currently widely cultivated around the world with Vietnam as the leading producer (Mizrahi, 2014; Rodríguez-Canto, 2015; Dahlin et al., 2017).
Pitahayas are considered valuable species, not only as edible fruits but also as a source of compounds such as betalains with beneficial antioxidant properties (Le Bellec, Vaillant & Imbert, 2006; Kamairudin et al., 2014). These components are derived from the pulp and pericarp, and also employed as colorants for food and cosmetics; moreover, the pulp contains abundant pectin (Nerd, Tel-Zur & Mizrahi, 2002; Jiménez-García et al., 2022; Pérez-Orozco & Sosa, 2022). Selenicereus undatus (Haw.) D.R. Hunt is the most widely cultivated species. It develops fruits with a reddish-purple pericarp and whitish pulp (Nerd, Tel-Zur & Mizrahi, 2002; Le Bellec, Vaillant & Imbert, 2006). This species is native to Mexico and Central America and some islands of the Lesser Antilles (Bauer, 2003; Barthlott et al., 2015; Sosa et al., 2020).
Historical evidence indicates that S. undatus was cultivated in pre-Columbian times on the Yucatan Peninsula, documented primarily by chroniclers who came to Mexico in the XVI century such as Diego de Landa who visited the Yucatan in 1560. Diego de Landa in his characterization of pitahayas described the reddish-purple color of the pericarp and the white pulp with small black seeds (Marcus, 1982; Rodríguez-Canto, 2015). Furthermore, it has been proposed that S. undatus was subjected to human selection before 3400 BC in the Maya domain (Colunga-García Marín & Zizumbo-Villarreal, 2004). Paleoethnobotanical analyses in the area of Chunchucmil from the Classic Maya period (A.D. 250–900) suggest that “solares” or homegardens comprised native fruit-bearing trees over which S. undatus was likely grown (Dahlin et al., 2017). Furthermore, pitahayas continue to be cultivated in homegardens or “solares” on the Yucatan Peninsula (Rico-Gray et al., 1990; De Clerck & Negreros-Castillo, 2000; Castro et al., 2018), as well as on small or large plantations (Cálix de Dios, Castillo-Martínez & Caamal-Canché, 2014).
Our focus is to identify genetic divergence among wild and cultivated plants of S. undatus by conducting a transcriptome screen of the stem. Our field work discovered several wild populations of this species in the lowlands of the Gulf of Mexico and on the Yucatan Peninsula. Transcriptomic characterization along with comparative genomics has brought about a revolution in the study of the processes involved in management and domestication in plants and animals (see review by Barrera-Redondo, Piñero & Eguiarte (2020)). Transcriptome research has been carried out in Selenicereus megalanthus, S. monacanthus and S. costaricensis to understand the effect of lighting at night on flowering, the effect of trypsin during storage of fruits (Pang et al., 2020), the response of roots to salt stress (Nong et al., 2019) and the response of plants to cold stress (Zhou et al., 2020) in addition to characterizing the metabolic pathways in betalain biosynthesis in fruits (Quingzhu et al., 2016; Cheng et al., 2020; Li et al., 2020; Hie et al., 2020; Zhang et al., 2021). Moreover, only a few genomes of species in Cactaceae have been assembled; in the case of S. undatus chromosome-scale or plastid genomes were based on cultivars from China such like “Yunnan” and “Guanhuabai” (Chen et al., 2021; Liu et al., 2021). These cultivars are the product of selection focusing on betalain biosynthesis and size of fruits (Chen et al., 2021; Liu et al., 2021). However transcriptomic research in cultivated vs wild S. undatus has not yet been carried out.
Mesoamerica, i.e. southeastern Mexico to the Northwest of Costa Rica, is an important center of plant domestication (Casas et al., 2007; Álvarez-Ríos et al., 2020). In this region, where S. undatus has been cultivated since pre-Columbian times, cultures have manipulated numerous plants using methodical selection, reproductive management techniques and other methods to select important attributes to be maintained in the population (Neto et al., 2016). Furthermore, a number of fruit species have been domesticated by the Maya (e.g. Annona: Larranaga et al., 2017; Caimito: Petersen, Parker & Potter, 2012; Gourd tree: Aguirre-Dugua et al., 2012; Huaya India: Jiménez-Rojas et al., 2019; Mexican plum: Fortuny-Fernández, Ferrer & Ruenes-Morales, 2017). Thus, in this study we explore whether there are unique transcriptomic signatures in the stems of wild and cultivated plants of S. undatus to gain insight into its management. Ecological-based niche modeling forecasted that S. undatus would have the largest predicted distribution, able to withstand dry and semi-dry climates (Sosa et al., 2020). Previous research discovered that variation in organs such as the roots and shoots of useful plants can be attributed to management for adaptation to intermittent drought (e.g. Agave: Huang et al., 2018; Phaseolus: Berny Mier y Teran et al., 2019; Rice: Han et al., 2022).
The objective of this article is to identify the transcriptomic signatures characteristic of wild and cultivated plants in stems of S. undatus, one of the most widely cultivated pitahaya species. Our hypothesis is that cultivated plants exhibit increased transcript levels of genes related to stress tolerance, in particular those involved in the response to hydric stress.
Materials and Methods
Plant materials
In total, eight samples were collected: four wild and four cultivated plants. Wild plants of S. undatus were collected in a ravine in Mozomboa, Veracruz in central Mexico, in the lowlands of the Gulf of Mexico. Plants were growing over legume trees in xerophytic shrubby vegetation, on a steep cliff (C. Ruiz, J. Ornelas & I Acosta 455 XAL) (MO1-, MO1-4, MO2A, MO2D) (Fig. 1). Cultivated plants were collected in two homegardens “solares” on the Yucatan Peninsula, in Hunucmá (HNC-2, HNC-3) and Umán (UMN1, UMN2), Yucatán (Ruiz-Domínguez, Ornelas & V. Sosa 498, 501, XAL) (Fig. 1). Plants were cultivated either on the stone walls that surround these agroecosystems or over legume trees. All collections were cultivated under the same greenhouse conditions for 8 weeks prior to RNA isolation, in a standard substrate (Fig. S1).
Figure 1. Localities of wild and cultivated plants.
Map displaying the locations of the studied wild and cultivated plants of pitahaya (Selenicereus undatus). Wild plants were collected in Veracruz in the lowlands of the Gulf of Mexico in Mozomboa, Veracruz. Plants were growing over legume trees, the fruits are shown. The cultivated plants were collected in two homegardens located in Umán and Hunucmá in the Yucatan Peninsula. They were grown over trees in these agro-ecosystems; flower and fruits are displayed.
Total RNA extraction
Stem tissue (200 mg) from four wild plants (Mozomboa) and two cultivated plants from the two different locations (Hunucmá and Umán) was used. The tissue was ground into a powder using a precooled mortar with liquid nitrogen. The ground powder mix was collected into a microcentrifuge tube and 1 ml TRizol Reagent (Thermo Fisher, Waltham, MA, USA) added. The tubes were vortexed for 30 s and centrifuged at 12,000g for 10 min at 4 °C. The supernatant was transferred into clean microcentrifuge tubes and added was an equal volume of phenol-chloroform-isoamyl alcohol (24:25:1 v/v). The solution was homogenized by vortexing for 30 s and centrifuged 13,000g for 15 min at 4 °C. The aqueous phase was collected in a clean tube and an equal volume of chloroform-isoamyl alcohol (24:25 v/v) added. The tubes were mixed thoroughly by inversion and centrifuged at 13,000g 10 min at 4 °C. The supernatant was recovered in a clean tube, and 750 μl of isopropyl alcohol was added to each tube. Then, the samples were incubated at room temperature for 10 min. After incubation, the tubes were centrifuged at 13,000g for 10 min at 4 °C to obtain pellets of RNA. Finally, the pellets were washed with 75% of EtOH in ultrapure RNase free deionized water treated with DEPC (diethyl pyrocarbonate) and centrifuged at 12,000g for 5 min at 4 °C. RNA was dissolved in RNase-free ddH2O and stored at −80 °C. Total RNA from five samples of each plant was used for library construction. RNA samples were processed by Labsergen (http://labsergen.langebio.cinvestav.mx/en/) for library preparation and sequencing. Libraries were sequenced using the Illumina NextSeq 500 system and paired-end sequencing.
De novo transcriptome assembly and identification of differentially expressed genes (DEGs)
The Illumina NextSeq platform generated approximately 1,200 million reads. Before assembling the raw paired-end reads, we utilized Trimmomatic (ver. 0.40) (Bolger, Lohse & Usadel, 2014) for removal of adaptors and quality filtering (options: SLIDINGWINDOW:4:5 LEADING:5 TRAILING:5 MINLEN:25) (Bolger, Lohse & Usadel, 2014). The quality of raw reads was assessed by FastQC tools (http://www.bioinformatics.babraham.ac.uk/projects/fastqc). After cleaning and filtering the raw reads, they were assembled using Trinity (ver 2.13.1) (Haas et al., 2013) using default parameters for the paired-end assembly method to obtain the differentially expressed genes (Haas et al., 2013). Bowtie was used for reading the alignment against the de novo assembled reference transcriptome and RSEM (ver.1.3.3) (Li & Dewey, 2011) was used to estimate expression level values of alignments from each sample and among biological replicates (Li & Dewey, 2011). We used edgeR (R Bioconductor tools, Reilingen, Germany) (McCarthy, Chen & Smyth, 2012) for the identification of differentially expressed genes (DEGs). Normalization expression values were obtained by TMM normalization (Oshlack & Robinson, 2010). The sequencing data was deposited into NCBI database under ID BioProject PRJNA853281. We used the RnaSeqSampleSize method (Zhao et al., 2018), which is based on the distribution of average gene reads and dispersion from real RNA-seq data, to estimate power and determine if the sample size was sufficient.
Functional annotation and classification
Functional annotation of the transcripts was obtained after the Trinity assembly was carried out with the Trinotate v.3.1.1 pipeline (http://trinotate.github.io). We used the set of open reading frames (ORFs) generated by TransDecoder v5.5.0 (TransDecoder. https://transdecoder.github.io/) to perform a protein-protein and transcript-protein search, BLASTP and BLASTX (e-value 1e−5) respectively, using public databases UniProt/Swiss-Prot protein (http://www.uniprot.org/). Protein domains were identified and annotated using HMMER v3.3 against the Pfam dataset (Finn, Clements & Eddy, 2011). Additionally, transmembrane regions were predicted using TMHMM v2.0 (Krogh et al., 2001); signal peptide prediction was determined using signalP v5.0 (Petersen et al., 2011). RNAmmer v1.2 was used to detect ribosomal RNA genes (Lagesen et al., 2007). Annotation outputs were reported using Trinotate SQLite database. The enrichment analysis was performed using the Cytoscape V3.8.0 (Shannon et al., 2003) and BiNGO (Biological Network Gene Ontology) (Maere, Heymans & Kuiper, 2005), with a hypergeometric statistical test and a p-value of 0.05. We also used the complete annotation file previously prepared with Trinotate as a custom reference. KEGG pathway analysis was performed by KASS (KEGG Automatic Annotation Server https://www.genome.jp/kaas-bin/kaas_main) using the BBH method (Moriya et al., 2007).
Transcriptome completeness was assessed using BUSCO v.3.0.1 (Benchmarking Universal Single-Copy Orthologs) to obtain the percentage of single-copy orthologues from two different data sets: viridiplantae_odb10 and eudicotyledons_odb10 (created in 2017).
We used the Plant Transcription Factor Database V5.0 (http://planttfdb.gao-lab.org/index.php) to assign DEGs to different transcription factor (TF) families by BLASTP and BLASTX (e-value 1e−5) against databases from three different plants belonging to the order Caryophyllales—the same order as pitahayas—(i.e. Amaranthus hypochondriacus, Beta vulgaris and Dianthus caryophyllus) and also from Arabidopsis thaliana, since it represents the best annotated angiosperm genome. The Transcriptome Shotgun Assembly project was deposited at DDBJ/EMBL/GenBank under the accession GKDI00000000. Annotations including gene ontology (GO) are included in Supplemental Material.
Results
Transcriptome sequencing, de novo assembly and functional annotation
A total 36,476,238 raw reads were obtained from eight libraries. After trimming the adaptors and filtering low-quality reads we obtained 29,662,120 (~81%) high-quality paired reads. A total of 406,766 transcripts and 230,996 assembled genes were identified with a median length of 240 base pairs (bp) and an N50 length of 941 bp. The assembled reference transcriptome was 204 Mb with a 43% of GC content. To evaluate the completeness of the assembly, we compared it against two different plant databases (viridiplantae and eudicotyledons) using BUSCO (Simão et al., 2015). Our results show that 56.1% of BUSCO genes are complete when compared against the viridiplantae set, 32.8% are fragmented and 11.1% are missing. From the comparison against the eudicot set, 55.2% are complete, 24.8% are fragmented and 20.0% are missing (Table S1). Based on the Trinotate pipeline, we were able to identify a total of 58,574 nucleotide sequences and 34,767 protein sequences that are related to different biological processes and molecular functions. In the full transcriptome assembly, the two most overrepresented terms within the category of molecular function are ATP binding and metal ion binding. While in the category of biological processes the four most overrepresented terms are photosynthesis, tricarboxylic acid cycle, glycolytic process and transcription DNA-templated (Fig. S2A). Two domains were found with higher frequency in silico translated transcripts: the protein kinase domain (PF00069.25/Pkinase) and the pentatricopeptide repeats (PF13041.6/PPR2), followed by leucine-rich repeat domain (PF13855.6/LRR_8, PF12799.7/LRR_4), RNA-binding domains (PF00076.22/RRM_1), WD domains (PF00400.32/WD40), as well as a p450 domain (PF00067.22/p450) (Fig. S2B).
Analysis of DEGs
A total of 9,203 differentially expressed genes (DEGs) with a p-value (p) < 0.001 and a false discovery rate (FDR) < 0.01 were identified in the Hunucmá cultivar compared with wild plants of Mozomboa. DEGs were distributed as follows: 4,883 up-regulated and 4,320 down-regulated genes. We identified 6,568 DEGs in the Umán cultivar relative to wild plants, and of these, 3,286 correspond to up-regulated genes and 3,282 to down-regulated genes. Gene expression profiles from each sample are shown in a heatmap plot (Fig. 2A). These results show differences between the two cultivated plants. The heatmap and the Pearson’s correlation showed clustering between the replicas for each of the samples and the groups of samples distributed between the two PCs; PC1 represents 36.15% of the variance and PC2 represents an additional 17.78%. When we compared the two sets of DEGs to detect the genes shared among the cultivated plants, we identified 1,246 up-regulated and 1,960 down-regulated genes (Figs. 2B, 2C). The expression profile of DEGs among cultivated plants is very similar (Fig. 2D). Similarity among wild and cultivated plants in a matrix heatmap is shown in Fig. S2. Distribution of DEGs in the cultivars is shown in Fig. S3. The statistical power of this experimental design calculated in RnaSeqSampleSize is 0.57.
Figure 2. Overview of the differentially expressed transcripts from stems.
Overview of the differentially expressed transcripts from stems of Selenicereus undatus. They were estimated in cultivated plants (Hunucmá and Umán) and in wild plants (Mozonboa). (A) Hierarchical clustering analysis of differentially expressed transcripts. Heatmap of relative expression levels for each transcript. The color key indicates the expression levels quantified as log2-FPKM transformation. The wild plants from Mozomboa (MO1-1, MO1-4, MO2A, MO2D), cultivated plants from Hunucmá (HNC-2, HNC-3) and Umán (UMN1, UMN2). (B) Venn diagram showing the number of up-regulated genes in Hunucmá and Umán cultivars. (C) Venn diagram showing the number of down-regulated genes in Hunucmá and Umán cultivars. (D) Heatmap of shared differentially expressed genes (1,246 up-regulated genes and 1,960 down-regulated genes) in cultivated plants from Hunucmá and Umán.
DEGs: functional classification
A total of 2,469 genes from Hunucmá and 1,716 genes from Umán were assigned to the categories: metabolism, environmental information processing, genetic information processing and cellular processes. The profiles of down-regulated genes are very similar in both cultivated plants; Hunucmá has 1,076 genes and Umán 1,042 genes distributed in different categories. However, the profiles of up-regulated genes differ for the cultivated plants: Hunucmá has 1,393 genes, while Umán only has 674 genes distributed across all categories. The main differences can be observed in: cell growth and death and signal transduction. The most overrepresented category in both cultivated plants is metabolism with 1,750 genes from Hunucmá and 1,334 genes from Umán, and within this category 858 and 665 genes from Hunucmá and Umán respectively, are located in global and overview maps, where the most representative modules are metabolic pathways and biosynthesis of secondary metabolites. For the carbohydrate metabolism pathway, there are 265 and 200 Hunucmá and Umán genes, respectively (Fig. 3).
Figure 3. Functional classification of differentially expressed genes (DEGs) in the Hunucmá and Umán cultivated plants.
Barplots indicate the KEGG metabolic pathway annotation of DEGs across four different categories.
GO functional annotation analysis of DEGs showed that in the biological process category the response to stress and response to stimulus are the most significant term for down-regulated genes in both cultivated plants, whereas in the up-regulated genes the enriched GO terms in Hunucmá and Umán are related with cellular process and signaling. This last term is connected to signaling pathway and hormone-mediated signaling pathway, which in turn interact with cellular response to hormone stimulus and auxin mediated signaling pathway. In the molecular function category, the down-regulated genes are related to catalytic activity, enzyme regulator activity and hydrolase activity in both cultivated plants. Nevertheless, the up-regulated genes are involved in binding, a term which is composed of a subset of terms like binding nucleoside and binding nucleotide. The hydrolase activity pathway is also present in up-regulated genes connected by catalytic activity term (Figs. S4 and S5).
DEGs: annotation of transcription factors families
Transcription factors (TFs) are proteins that play an important role during the regulation of gene expression. A total of 221 and 121 DEGs of Hunucmá and Umán, respectively, were identified as TFs (Figs. S6, S7): 38 families were identified in Hunucmá and 30 families in Umán. Most of the DEGs identified as TFs are members of the bHLH (basic helix-loop-helix), MYB (myeloblastosis), ERF (ethylene response factor), NAC [NAM (no apical meristem, Petunia), ATAF1–2 (Arabidopsis thaliana activating factor), and CUC2 (cup-shaped cotyledon, Arabidopsis)], C2H2 (Cys2-His2, zinc finger proteins), HD-ZIP (homeodomain-leucine zipper), WRKY (WRKYGQK domain), bZIP (basic leucine zipper) and ARF (auxin response factor), CAMTA (calmodulin-binding transcription activator) families, among others. The TF family with the highest number of DEGs in Hunucmá is the bHLH family with 28 up-regulated genes and in Umán with 11 up-regulated and four down-regulated genes. The bHLH proteins play an important role in the regulation of growth and development, as well as the response to stress in plants. The second family of TFs with the highest number of DEGs is the MYB family with 20 up-regulated and four down-regulated genes in Hunucmá, while in Umán there were eight up-regulated and four down-regulated MYB genes. Hunucmá has 100 more DEGs than Umán, identified as TFs and due to this result other TF families with more than 10 DEGs can be observed in Hunucmá. ERF family has the third most DEGs with 13, followed by NAC and HD-ZIP with 13 and C2H2 with 11 DEGs (Fig. 4).
Figure 4. Transcription factor families differentially expressed in cultivated plants.
Boxplots show of log2FC value distribution of differentially expressed TFs with pVal < 0.001. The box are proportional to square-roots of the numbers of TFs per family; the crosses indicate the mean of the samples; the box limits indicate the 25th and 75th percentiles as determined by R software (http://shiny.chemgrid.org/boxplotr/). Tukey test to define the whiskers extended in 1.5 times the interquartile range from the 25th and 75th percentiles; the open circles represent the data points and the number of sample points are on the y-axis.
The DEGs shared between the two cultivated plants identified a total of 39 associated with 22 different families of TFs. Twenty-five genes are up-regulated and eight down-regulated, while six have different expression profiles (Fig. 5A). The two families with the highest number of DEGs are bHLH with five and ERF with four. All the genes identified as bHLH are up-regulated, while among the four genes in the ERF family, two are up-regulated and two are down-regulated (Fig. 5B). Different members of the ERF family have been reported as key regulators of various stress responses in plants (Mizoi, Shinozaki & Yamaguchi-Shinozaki, 2012; Xie et al., 2019). Other TF families involved in the response to different types of stress identified with DEGs shared between the two cultivated pitaya plans are WRKY, NAC, MYB and CAMTA.
Figure 5. Differentially expressed transcription factors (TFs) shared between Hunucmá and Umán plants.
(A) UpSet diagram showing shared and unique differentially expressed TFs between Hunucmá and Umán plants, the diagram shows the unique TFs in each cultivated plants. (B) Expression levels of the TFs that are shared in both cultivated plants. The right side bar shows the rows corresponding to the genes per TFs family identified in both cultivated plants.
Discussion
DEGs: differentially expressed genes
Transcriptomic analyses of this project found differences in gene expression levels from the stems of two cultivated plants compared to wild-type plants of S. undatus. This data introduces basic transcriptional information for further research on cacti that produce edible fruit. Interestingly, 2.11% of the genes increased their expression in the stems of cultivated plants from Hunucmá and 1.42% of the Umán cultivar from homegardens, compared to the stems of wild plants. In contrast, 1.87% of the genes decreased their expression in Hunucmá and 1.42% in Umán plants. A number of studies of edible fruit have shown that using either transcriptomic screening or other molecular methods it is possible to understand divergence in genes from cultivated and wild plants, as has this study (e.g. Chichipe: Otero-Arnaiz et al., 2005; Stenocereus: Parra et al., 2010; Ruán-Tejeda et al., 2014; Loquat: Jiang et al., 2016; Strawberry: Qiao et al., 2016, Breadfruit: Laricchia et al., 2018; Prickle pear: López-Palacios et al., 2018; Chia: Peláez et al., 2019; Eggplant: Page et al., 2019; Persimmon: Guan et al., 2017).
DEGs: functional classification and family genes
Approximately half of the DEGs are shared between the two cultivated plants and probably the rest of the differences between the cultivars might be related to human management over time. Our results indicate that metabolism was the most representative category according to the functional classification of KEGG (the database for understanding high-level functions and utilities of the biological system: https://www.genome.jp/kegg/) and similar patterns of down-regulated genes were found between wild and cultivated plants. In contrast, expression levels of genes related to cellular processes are higher in Hunucmá than in Umán plants.
Interestingly, one group of shared genes shows a very similar pattern of expression. We identified that the shared up-regulated genes in the category of biological process are enriched in terms of signaling pathway, which in turn are connected with the hormone-mediated signaling pathway that includes cellular responses to auxin stimulus and cellular responses to hormone stimulus. These cellular reactions have been documented in several cultivated plants such as barley, in which drought-tolerant cultivars modify auxin transport and ethylene signaling, resulting in a better redistribution of assimilates (Hong, Ni & Zhang, 2020; Qiu et al., 2020). Furthermore, evidence of selection of drought-resistance genes has been documented in cultivated plants such as weedy rice (Han et al., 2022), cotton (Shim, Bandillo & Angeles-Shim, 2021) and moth bean (Yundaeng et al., 2019). Thus, the enrichment of genes found in the cultivated plants of pitahaya might be related to the response to drought.
Worldwide, abiotic stress is one of the main limitations for growth and development of crops (Dresselhaus & Hückelhoven, 2018). Moreover, plants can respond to abiotic stress using different strategies that include transcriptional control of different gene networks through the expression of transcription factors (He et al., 2016; Arisha et al., 2020). Dragon fruits are epiphytic cacti which can grow under different environmental conditions, including areas under dry conditions, due to their high efficiency in water consumption and reduction of water loss due to their CAM photosynthesis, which allows them to assimilate carbon with the stomata closed during the day (Nerd, Tel-Zur & Mizrahi, 2002). In plants with CAM photosynthesis, some families of transcription factors that change their expression under drought stress have been recorded. For example, Agave sisalana subjected to drought stress, shows an increase in the expression of the transcription factor families MYB, AP2/ERF and bHLH (Sarwar et al., 2018). Also, in Cynanchum thesioides a plant from the arid and semi-arid areas of Asia, hundreds of DEGs were identified under severe drought stress including transcription factors belonging to the MYC and ERF families (Zhang et al., 2019).
DEGs: transcription factor families
TFs such as bHLH, NAC, B21P, BH21P related to abiotic stress were overexpressed in the cultivated plants from the homegardens of the Yucatan in comparison with the wild type plants from Veracruz. Furthermore, there were differences in expression of these TFs between the cultivars from Umán and Hunucmá, which are located in the same region of the Yucatan. TF divergences in domesticated plants have been encountered in many species, in particular to plants grown under abiotic stress such as chickpea (Moenga et al., 2020) or the ramie (Li et al., 2014). Therefore, it can be hypothesized that the origin and management of the plants in Hunucmá and Umán has been different. TFs related to biotic stress are adaptive, and moreover from our observations of the cultivated collections under greenhouse conditions prior to extracting RNA, more secondary roots and secondary growth in Hunucmá plants were observed in comparison with the Umán plants. Additional samples of cultivated pitahayas on the Yucatan Peninsula and additional transcriptomic research can bring a deeper understanding of the management of S. undatus by the Maya.
Importance and conclusions
De novo transcriptome assembly has evolved as a resourceful and effective approach for answering evolutionary questions (Hölzer, 2020). Functional genomics resources broaden the knowledge of the genes linked to horticultural and agronomically important traits (Rai & Shekhawat, 2014). Transcriptomic research has been conducted on S. megalanthus, S. costaricensis and S. monacanthus to address a number of physiological processes. For instance, transcriptomic research has investigated the effect of lighting at night, the trypsin effect in fruit storage (Pang et al., 2020), the response of roots to salt stress (Nong et al., 2019), the response of plants to cold stress (Zhou et al., 2020), and to characterize metabolic pathways in betalain biosynthesis in fruits (Quingzhu et al., 2016; Cheng et al., 2020; Li et al., 2020; Hie et al. 2020). Recently, for S. undatus, a chromosome level genome found genome duplication of betacyanin biosynthetic genes (Zheng et al., 2021).
The totality of the transcriptomic and genomic research in S. costaricensis, S. monacanthus, and S. undatus has been conducted in plants cultivated in Asia, mostly in China, where large plantations provide produce for regional markets (Yu et al., 2021). The aim of the research conducted here was to compare the transcriptomic signal in wild vs cultivated plants of S. undatus in their native range with confidence in the origin and identification of plants.
Our analyses were performed before the release of the genome of S. undatus (Zheng et al., 2021), therefore we used de novo transcript assembly. This research is the first to investigate transcriptomic signal in stems of wild plants of S. undatus. Plants were collected in the north of the Gulf of Mexico Province, in ravines with tropical dry forests, growing over several legume trees in areas with annual average temperature of 30 °C and annual precipitation of 1,500 mm (Sosa et al., 2020). In contrast, cultivated plants were collected in the Maya domain, in the Yucatan Peninsula, in “solares” or homegardens where they have been managed and cultivated for centuries (Colunga-García Marín & Zizumbo-Villarreal, 2004). Pitahayas are commonly grown by the Maya in the Peninsula over stone walls without shade of trees where annual temperature is higher, up to 35 °C (Castro et al., 2018). S. undatus is planted as well in semiarid regions in the north of Mexico in zones with annual precipitation of 690 mm (Osuna-Enciso et al., 2016). Wild and cultivated plants utilized here were maintained under the same greenhouse conditions for 3 months before conducting our research to identify transcriptomic signatures. Therefore, we expected to find differences in genes related to the plants’ ability to withstand drier habitats comparing wild and cultivated plants. Our results indicate that there are differences in TFs related to biotic stress, in wild and even between cultivated plants. Root and fruit transcriptomics will be further investigated to gain a broader perspective on how people in Mexico have managed this pitahaya, and to be able to propose new strategies and germplasm for cultivating this economically important species.
Fruit crops can provide health and nutrition, they are a rich source of carbohydrates, fats, proteins, energy, vitamins and minerals, as well as dietary fiber. They can prevent various diseases, including diabetes, anemia, and hypertension. They have the potential to provide a source of income and can be sold fresh or in value-added products (Meldrum et al., 2018). There are many neglected and underutilized species that have potential to diversify not only the human diet, but also to increase food production levels, and thus make more sustainable and resilient agro-systems possible (Baldermann et al., 2016). Therefore, research on plants such as pitahayas reveals alternatives to healthy and functional diets with increased value since not only the fruit, but also the by-products of the peels have potential phytochemical benefits.
Supplemental Information
Acknowledgments
We thank Israel Acosta, Catalina Ruiz-Domínguez and Diego F. Angulo their help in the field. The authors also thank the Institute of Ecology A.C. (INECOL) for the access to the high-performance computing system (HUITZILIN) required to carry out this research, and Emanuel Villafán for his assistance as the administrator of this resource. Bianca Delfosse reviewed the English of our manuscript.
Funding Statement
This work was supported by Consejo Nacional de Ciencia y Tecnología, Mexico, CONACYT (PNDP2015/1023) (PNDP2015/1023). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
Victoria Sosa is an Academic Editor for PeerJ.
Author Contributions
Omar Oltehua-López conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.
Mario A. Arteaga-Vázquez conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.
Victoria Sosa conceived and designed the experiments, performed the experiments, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.
DNA Deposition
The following information was supplied regarding the deposition of DNA sequences:
The sequencing data is available at NCBI PRJNA853281.
The Transcriptome Shotgun Assembly project is available at DDBJ/EMBL/GenBank: GKDI00000000.
Data Availability
The following information was supplied regarding data availability:
The molecular data is available at figshare: Oltehua-López, Omar; Sosa, Victoria; A. Arteaga-Vazquez, Mario (2022): data_VS_SUn. figshare. Dataset. https://doi.org/10.6084/m9.figshare.20173444.v1
References
- Aguirre-Dugua et al. (2012).Aguirre-Dugua X, Eguiarte LE, González-Rodríguez A, Casas A. Round and large: morphological and genetic consequences of artificial selection of the gourd tree Crescentia cujete by the Maya of the Yucatan Peninsula. Annals of Botany. 2012;109(7):1297–1306. doi: 10.1093/aob/mcs068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Álvarez-Ríos et al. (2020).Álvarez-Ríos GD, Pacheco-Torres F, Figueredo-Urbina CJ, Casas A. Management, morphological and genetic diversity of domesticated agaves in Michoacán, México. Journal of Ethnobiology and Ethnomedicine. 2020;16(1):3. doi: 10.1186/s13002-020-0353-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arisha et al. (2020).Arisha MH, Aboelnasr H, Ahmad MQ, Liu Y, Tang W, Gao R, Yan H, Kou M, Wang X, Zhang Y, Li Q. Transcriptome sequencing and whole genome expression profiling of hexaploidy sweet potato under salt stress. BMC Genomics. 2020;21(1):197. doi: 10.1186/s12864-020-6524-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baldermann et al. (2016).Baldermann S, Blagojevic L, Frede K, Klopsch R, Neugart S, Neumann B, Ngwene B, Norkeweit J, Schöter D, Schöter A, Schweigert FK, Wiesner M, Schreiner M. Are neglected plants the food for the future? Critical Reviews in Plant Sciences. 2016;35(2):106–119. doi: 10.1080/07352689.2016.1201399. [DOI] [Google Scholar]
- Barrera-Redondo, Piñero & Eguiarte (2020).Barrera-Redondo J, Piñero D, Eguiarte LE. Genomic, transcriptomic and epigenomic tools to study the domestication of plants and animals: a field guide to beginners. Frontiers in Genetics. 2020;11:742. doi: 10.3389/fgene.2020.00742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barthlott et al. (2015).Barthlott W, Burstedde K, Geffert JL, Ibisch PL, Korotkova N, Miebach A, Rafiqpoor MD, Stein A, Mutke J. Distribution maps of Cactaceae. Schumannia. 2015;7:82–90. [Google Scholar]
- Bauer (2003).Bauer R. A synopsis of the tribe Hylocereeae F.B. Cactaceae systematics initiatives. Cactaceae Systematics Initiatives. 2003;17:29. [Google Scholar]
- Berny Mier y Teran et al. (2019).Berny Mier y Teran JC, Konzen ER, Medina V, Palkovic A, Ariani A, Tsai SM, Gilbert ME, Gepts P. Root and shoot variation in relation to potential intermittent drought adaptation of Mesoamerican wild common bean (Phaseolus vulgaris L.) Annals of Botany. 2019;124(6):917–932. doi: 10.1093/aob/mcy221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolger, Lohse & Usadel (2014).Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cálix de Dios, Castillo-Martínez & Caamal-Canché (2014).Cálix de Dios H, Castillo-Martínez R, Caamal-Canché HJ. Caracterización de la producción de pitahayas (Hylocereus spp.) en la zona Maya de Quintana Roo. Agroecología. 2014;9:123–132. [Google Scholar]
- Casas et al. (2007).Casas A, Otero-Arnaiz A, Pérez-Negrón E, Valiente-Banuet A. In situ management and domestication of plants in Mesoamerica. Annals of Botany. 2007;100(5):1101–1115. doi: 10.1093/aob/mcm126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Castro et al. (2018).Castro A, Lascurain-Rangel M, Gómez-Díaz JA, Sosa V. Mayan homegardens in decline: the case of the pitahaya (Hylocereus undatus), a vine cactus with edible fruit. Tropical Conservation Science. 2018;11(3):1–10. doi: 10.1177/1940082918808730. [DOI] [Google Scholar]
- Chen et al. (2021).Chen J, Xie F, Cui Y, Chen C, Lu W, Hu X, Hua Q, Wu Z, Gao D, Zhang Z, Jiang W, Sun Q, Hu H, Qin Y. A chromosome-scale genome sequence of pitaya (Hylocereus undatus) provides novel insights into the genome evolution and regulation of betalain biosynthesis. Horticulture Research. 2021;8(1):164. doi: 10.1038/s41438-021-00612-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng et al. (2020).Cheng C, Xie F, Hua Q, Tel Zur N, Zhang L, Zhang Z, Zhang R, Zhao J, Hu G, Qin Y. Integrated sRNAome and RNA-Seq analysis reveals miRNA effects on betalain biosynthesis in pitaya. BMC Plant Biology. 2020;20:437. doi: 10.21203/rs.2.24037/v3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colunga-García Marín & Zizumbo-Villarreal (2004).Colunga-García Marín P, Zizumbo-Villarreal D. Domestication of plants in Maya lowlands. Economic Botany. 2004;58:S101–S110. doi: 10.1663/00130001(2004)58[S101:DOPIML]2.0.CO;2. [DOI] [Google Scholar]
- Dahlin et al. (2017).Dahlin BH, Ardren T, Hixson DR, Andrews AP. Perishable resources produced for exchange in the Chunchucmil region. In: Hutson SR, editor. Ancient Maya Commerce. Boulder: University Press of Colorado; 2017. pp. 221–239. [Google Scholar]
- De Clerck & Negreros-Castillo (2000).De Clerck FA, Negreros-Castillo P. Plant species of traditional Mayan homegardens of Mexico as analogs for multistrata agroforests. Agroforestry Systems. 2000;48(11):303–317. doi: 10.3390/agronomy8110267. [DOI] [Google Scholar]
- Dresselhaus & Hückelhoven (2018).Dresselhaus T, Hückelhoven R. Biotic and abiotic stress response in crop plants. Agronomy. 2018;8(11):267. doi: 10.3390/agronomy8110267. [DOI] [Google Scholar]
- Finn, Clements & Eddy (2011).Finn RD, Clements J, Eddy SR. HMMER web server: interactive sequence similarity searching. Nucleic Acids Research. 2011;39:W29–W37. doi: 10.1093/nar/gkr367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fortuny-Fernández, Ferrer & Ruenes-Morales (2017).Fortuny-Fernández NM, Ferrer M, Ruenes-Morales MR. Origin domestication and genetic diversity centers of the Mexican plum, Spondias purpurea (Anacardiaceae) Acta Botanica Mexicana. 2017;121:7–38. doi: 10.21829/abm121.2017.1289. [DOI] [Google Scholar]
- Guan et al. (2017).Guan C, Du X, Zhang Q, Ma F, Luo Z, Yang Y. DkPK genes promote natural deastringency in C-PCNA persimmon by up-regulating DkPCC and DkADH expression. Frontiers in Plant Science. 2017;8:149. doi: 10.3389/fpls.2017.00149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haas et al. (2013).Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Philip D, Bowden J, Couger MB, Eccles D, Li B, Macmanes MD, Ott M, Orvis J, Pochet N, Strozzi F, Weeks N, Westerman R, William T, Dewey CN, Henschel R, Leduc RD, Friedman N, Regev A. De novo transcript sequence reconstruction from RNA-Seq: reference generation and analysis with Trinity. Nature Protocols. 2013;8(8):1494–1512. doi: 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han et al. (2022).Han B, Ma XD, Cui D, Wang YJ, Gen LY, Cao GL, Zhang H, Koh HJ, Han LZ. Analysis of evolutionary relationships provides new clues to the origins of weedy rice. Ecology and Evolution. 2022;10(2):891–900. doi: 10.1002/ece3.5948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- He et al. (2016).He Q, Jones DC, Li W, Xie F, Ma J, Sun R, Wang Q, Zhu S, Zhang B. Genome-wide identification of R2R3-MYB genes and expression analyses during abiotic stress in Gossypium raimondii. Scientific Reports. 2016;6(1):22980. doi: 10.1038/srep22980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hie et al. (2020).Hie F, Hua Q, Chen C, Zhang L, Zhang Z, Chen J, Zhang R, Zhao J, Hu G, Zhao J, Qin Y. Transcriptomics-based identification and characterization of glucosyltransferases involved in betalain biosynthesis in Hylocereus megalanthus. Plant Physiology and Biochemistry. 2020;152(39):112–124. doi: 10.1016/j.plaphy.2020.04.023. [DOI] [PubMed] [Google Scholar]
- Hölzer (2020).Hölzer M. A decade of de novo transcriptome assembly: are we there yet? Molecular Ecology Resources. 2020;21(1):11–13. doi: 10.1111/1755-0998.13268. [DOI] [PubMed] [Google Scholar]
- Hong, Ni & Zhang (2020).Hong Y, Ni SJ, Zhang G-P. Transcriptome and metabolome analysis reveals regulatory networks and key genes controlling barley malting quality in response to drought stress. Plant Physiology and Biochemistry. 2020;152:1–11. doi: 10.1016/j.plaphy.2020.04.029. [DOI] [PubMed] [Google Scholar]
- Huang et al. (2018).Huang X, Wang B, Xi J, Zhang Y, He C, Zheng J, Gao J, Chen H, Zhang S, Wu W, Liang Y, Yi K. Transcriptome comparison reveals distinct selection in domesticated and wild Agave species, the important CAM plants. International Journal of Genomics. 2018;2018:5716518. doi: 10.1155/2018/5716518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang et al. (2016).Jiang S, Luo J, Xu F, Zhang X. Transcriptome analysis reveals candidate genes involved in gibberellin-induced fruit setting in triploid loquat (Eryobotrya japonica) Frontiers in Plant Science. 2016;7:1924. doi: 10.3389/fpls.2016.01924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiménez-García et al. (2022).Jiménez-García SN, Garcia-Mier L, Ramirez-Gomez XS, Aguirre-Becerra H, Escobar-Ortiz A, Contreras-Medina LM, Garcia-Trejo JF, Feregrino-Perez AA. Pitahaya peel: a by-product with great phytochemical potential, biological activity, and functional application. Molecules. 2022;27:5339. doi: 10.3390/molecules271653339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiménez-Rojas et al. (2019).Jiménez-Rojas MI, Martínez-Castillo J, Potter D, Dzib GR, Ballina-Gómez HS, Latournerie-Moreno L, Andueza-Noh RH. Morphological diversity of Huaya India fruits (Melicoccus oliviformis Kunth) in the Maya lowlands. Genetic Resources and Crop Evolution. 2019;66(2):513–522. doi: 10.1007/s10722-018-00731-z. [DOI] [Google Scholar]
- Kamairudin et al. (2014).Kamairudin N, Gani SSAG, Masoumi HRF, Hashim P. Optimization of natural lipstick formulation based on pitaya (Hylocereus polyrhizus) seed oil using D-optimal mixture experimental design. Molecules. 2014;19(10):16672–16683. doi: 10.3390/molecules191016672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krogh et al. (2001).Krogh A, Larsson B, Von Heijne G, Sonnhammer ELL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. Journal of Molecular Biology. 2001;305(3):567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
- Lagesen et al. (2007).Lagesen K, Hallin P, Rødland EA, Stærfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Research. 2007;35(9):3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laricchia et al. (2018).Laricchia K, Johnson MG, Ragone D, Williams EW, Zerega NJC, Wickett NJ. A transcriptome screen for positive selection in domesticated breadfruit and its wild relatives (Artocarpus spp.) American Journal of Botany. 2018;105(5):915–926. doi: 10.1002/ajb2.1095. [DOI] [PubMed] [Google Scholar]
- Larranaga et al. (2017).Larranaga N, Albertazzi FJ, Fontecha G, Palmieri M, Rainier H, van Zonneveld M, Hormaza JI. A Mesoamerican origin of cherimoya (Annona cherimola Mill.): implications for the conservation of plant genetic resources. Molecular Ecology. 2017;26(16):4116–4130. doi: 10.1111/mec.14157. [DOI] [PubMed] [Google Scholar]
- Le Bellec, Vaillant & Imbert (2006).Le Bellec F, Vaillant F, Imbert E. Pitahaya (Hylocereus spp.): a new fruit crop, a market with future. Fruits. 2006;61(4):237–250. doi: 10.1051/fruits:2006021. [DOI] [Google Scholar]
- Li & Dewey (2011).Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12(1):323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li et al. (2020).Li X, Liu X, Pang X, Yin Y, Yu Y, Yu H, Yuan Y, Li B. Transcriptomic analysis reveals hub genes and subnetworks related to ROS metabolism in Hylocereus undatus through novel superoxide scavenger trypsin treatment during storage. BMC Genomics. 2020;21:437. doi: 10.21203/rs.2.14059/v3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li et al. (2014).Li T, Tang S, Zhu S, Tanb Q, Zheng X. Transcriptome comparison reveals the patterns of selection in domesticated and wild ramie (Boehmeria nivea L. Gaud) Plant Molecular Biology. 2014;86(1–2):85–92. doi: 10.1007/s11103-014-0214-9. [DOI] [PubMed] [Google Scholar]
- Liu et al. (2021).Liu J, Liu ZY, Zheng C, Niu YF. Complete chrloroplast genome sequence and phylogenetic analysis of dragon fruit (Selenicereus undatus (Haw.) D.R. Hunt) Mitochondrial DNA B Resources. 2021;24(3):1154–1156. doi: 10.1080/23802359.2021.1903356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- López-Palacios et al. (2018).López-Palacios C, Reyes-Agüero JA, Peña-Valdivia CB, Aguirre-Rivera JR. Physical characteristics of fruits and seed of Opuntia sp. as evidence of changes though domestication in the Southern Mexican Plateau. Genetic Resources and Crop Evolution. 2018;66(2):349–362. doi: 10.1007/s10722-018-0712-8. [DOI] [Google Scholar]
- Maere, Heymans & Kuiper (2005).Maere S, Heymans K, Kuiper M. BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics. 2005;21(16):3448–3449. doi: 10.1093/bioinformatics/bti551. [DOI] [PubMed] [Google Scholar]
- Marcus (1982).Marcus J. The plant world of the sixteenth- and seventeenth-century Lowland Maya. In: Flannery KV, editor. Maya Subsistence. New York: Academic Press; 1982. pp. 239–273. [Google Scholar]
- McCarthy, Chen & Smyth (2012).McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40(10):4288–4297. doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meldrum et al. (2018).Meldrum G, Padulosi S, Lochetti G, Roobitaille R, Dlulgheroff S. Issues and prospects for the sustainable use and conservation of cultivated vegetable diversity for more nutrition-sensitive agriculture. Agriculture. 2018;8(7):112. doi: 10.3390/agriculture8070112. [DOI] [Google Scholar]
- Mizoi, Shinozaki & Yamaguchi-Shinozaki (2012).Mizoi J, Shinozaki K, Yamaguchi-Shinozaki K. AP2/ERF family transcription factors in plant abiotic stress. Biochimica et Biophysica Acta (BBA) Gene Regulatory Mechanisms. 2012;1819:86–96. doi: 10.1016/j.bbagrm.2011.08.004. [DOI] [PubMed] [Google Scholar]
- Mizrahi (2014).Mizrahi Y. Vine-cacti pitayas: the new crops of the world. Revista Brasileira de Fruticultura. 2014;36(1):124–138. doi: 10.1590/0100-2945-452/13. [DOI] [Google Scholar]
- Moenga et al. (2020).Moenga SM, Gai Y, Carrasquilla-García N, Perilla-Henao LM, Cook DR. Gene co-expression analysis reveals transcriptomic divergence between wild and cultivated chickpea under drought stress. The Plant Journal. 2020;104(5):1195–1214. doi: 10.1111/tpj.14988. [DOI] [PubMed] [Google Scholar]
- Moriya et al. (2007).Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Research. 2007;35:182–185. doi: 10.1093/nar/gkm321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nerd, Tel-Zur & Mizrahi (2002).Nerd A, Tel-Zur N, Mizrahi Y. New fruits for arid climates. In: Janick J, Whipley A, editors. Perspectives on New Crops and New Uses. Alexandria: ASHS Press; 2002. pp. 349–358. [Google Scholar]
- Neto et al. (2016).Neto EMFL, Sousa JR, Casas A, Alburquerque UP. Plant domestication. In: Albuquerque UP, Alves RRN, editors. Introduction to Ethnobiology. Heidelberg: Springer; 2016. pp. 213–221. [Google Scholar]
- Nong et al. (2019).Nong Q, Zhang M, Chen J, Zhang M, Cheng H, Jian S, Lu H, Xia K. RNA-Seq de novo assembly of red pitaya (Hylocereus polyrhizus) roots and differential transcriptome analysis in response to salt stress. Tropical Plant Biology. 2019;12(2):55–66. doi: 10.1007/s12042-019-09217-3. [DOI] [Google Scholar]
- Oshlack & Robinson (2010).Oshlack A, Robinson MD. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biology. 2010;11(3):R25. doi: 10.1186/gb-2010-11-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osuna-Enciso et al. (2016).Osuna-Enciso T, Valdez-Torres JB, Sañudo-Barajas JA, Muy-Rangel MD, Hernández-Verdugo S, Villarreal-Romero M, Souna.Rodríguéz JM. Reproductive phenology, yield and fruit quality of pitahaya (Hylocereus undatus (How.) Birtton and Rose) in Culiacan Valley, Sinaloa, Mexico. Agrociencia. 2016;50:61–78. [Google Scholar]
- Otero-Arnaiz et al. (2005).Otero-Arnaiz A, Casas A, Hamrick JL, Cruse-Sanders J. Genetic variation and evolution of Polaskia chichipe (Cactaceae) under domestication in the Tehuacán Valley, central Mexico. Molecular Ecology. 2005;14(4):1603–1611. doi: 10.3732/ajb.90.4.593. [DOI] [PubMed] [Google Scholar]
- Page et al. (2019).Page A, Gibson J, Meyer RS, Chapman MA. Eggplant domestication: pervasive gene flow, feralization, and transcriptomic divergence. Molecular Biology and Evolution. 2019;36(7):1359–1372. doi: 10.1093/molbev/msz062. [DOI] [PubMed] [Google Scholar]
- Pang et al. (2020).Pang X, Li X, Liu X, Cai L, Li B, Li X. Transcriptomic analysis reveals CU/ZN SODs acting as hub genes of SODs in Hylocereus undatus induced by trypsin during storage. Antioxidants. 2020;9(2):162. doi: 10.3390/antiox9020163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parra et al. (2010).Parra F, Casas A, Peñaloza-Ramírez JM, Cortés-Palomec AC, Rocha-Ramírez V, González-Rodríguez A. Evolution under domestication: ongoing artificial selection and divergence of wild and managed Stenocereus pruinosus (Cactaceae) in the Tehuacán Valley, Mexico. Annals of Botany. 2010;106(3):483–496. doi: 10.1093/aob/mcq143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peláez et al. (2019).Peláez P, Orona-Tamayo D, Montes-Hernández S, Valverde ME, Paredes-López O, Cibrián-Jaramillo A. Comparative transcriptomic analysis of cultivated and wild seeds of Salvia hispanica (chia) Scientific Reports. 2019;9(1):9761. doi: 10.1038/s41598-019-45895-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Petersen et al. (2011).Petersen TN, Brunak S, Von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nature Methods. 2011;8(10):785–786. doi: 10.1038/nmeth.1701. [DOI] [PubMed] [Google Scholar]
- Petersen, Parker & Potter (2012).Petersen JJ, Parker IM, Potter D. Origins and close relatives of a semi-domesticated neotropical fruit tree: Chrysophyllum caimito (Sapotaceae) American Journal of Botany. 2012;99:585–604. doi: 10.3732/ajb.1100326. [DOI] [PubMed] [Google Scholar]
- Pérez-Orozco & Sosa (2022).Pérez-Orozco AF, Sosa V. Comparative estimations of betalains and sugars in fruits of five species of Selenicereus (Cactaceae) Acta Botanica Mexicana. 2022;129:e1991. doi: 10.21829/abm129.2022.1991. [DOI] [Google Scholar]
- Qiao et al. (2016).Qiao Q, Xue L, Wang Q, Sun H, Zhong Y, Huang JL, Lei JJ, Zhang TC. Comparative transcriptomics of strawberries (Fragaria spp.) provides insights into evolutionary patterns. Frontiers in Plant Science. 2016;7:1839. doi: 10.3389/fpls.2016.01839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qiu et al. (2020).Qiu CW, Liu L, Feng X, Peng-Fei H, Xiaoyan H, Fangbin C, Wu F. Genome-wide identification and characterization of drought stress responsive microRNAs in Tibetan wild barley. International Journal of Molecular Sciences. 2020;21:2795. doi: 10.3390/jms21082795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quingzhu et al. (2016).Quingzhu S, Chengjie H, Zhe C, Pengkun C, Yuewen M, Jingyu W, Jian Z, Guibing S, Jietang Z, Yonghua Q. Transcriptomic analysis reveals key genes related to betalain biosynthesis in pulp coloration in Hylocereus polyrhizus. Frontiers in Plant Sciences. 2016;6(499):1179. doi: 10.3389/fpls.2015.01179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rai & Shekhawat (2014).Rai MK, Shekhawat NS. Recent advances in genetic engineering for improvement of fruit crops. Plant Cell, Tissue and Organ Culture. 2014;116(1):1–15. doi: 10.1007/s11240-013-0389-9. [DOI] [Google Scholar]
- Rico-Gray et al. (1990).Rico-Gray V, García-Franco JG, Chemas A, Puch A, Sima P. Species composition, similarity and structure of Mayan homegardens in Tixpehual and Tixcacaltuyub, Yucatan, Mexico. Economic Botany. 1990;44:470–487. [Google Scholar]
- Rodríguez-Canto (2015).Rodríguez-Canto A. Pitayas (Stenocereus spp.) and Pitahayas (Hylocereus spp.): history and literature. Acta Horticulturae. 2015;1067:335–342. doi: 10.17660/ActaHortic.2015.1067.46. [DOI] [Google Scholar]
- Ruán-Tejeda et al. (2014).Ruán-Tejeda I, Santerre A, Huerta-Martínez FM, Iñíguez-Dávalos LI, Castro-Félix P. Genetic diversity and relationships among wild and cultivated Stenocereus queretaroensis populations in western Mexico. Biochemical Systematics. 2014;55(2):125–130. doi: 10.1016/j.bse.2014.03.005. [DOI] [Google Scholar]
- Sarwar et al. (2018).Sarwar MB, Ahmad Z, Rashid B, Hassan S, Gregersen PL, Leyva MDO, Nagy I, Asp T, Husnain T. De novo assembly of Agave sisalana transcriptome in response to drought stress provides insight to the tolerance mechanisms. Scientific Reports. 2018;9(1):396. doi: 10.1038/s41598-018-35891-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shannon et al. (2003).Shannon P, Marklel A, Ozier O, Gallga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003;13(11):2498–24504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shim, Bandillo & Angeles-Shim (2021).Shim J, Bandillo NB, Angeles-Shim RB. Finding needles in a haystack: using geo-references to enhance the selection and utilization of land races in breeding resilient cultivars of upland cotton (Gossypium hirsutum L.) Plants (Basel) 2021;10(7):1300. doi: 10.3390/plants10071300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simão et al. (2015).Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- Sosa et al. (2020).Sosa V, Guevara R, Gutiérrez-Rodríguez BE, Ruiz-Domínguez C. Optimal areas and climate change effects on dragon fruit cultivation in Mesoamerica. The Journal of Agricultural Science. 2020;158(6):461–470. doi: 10.1017/S0021859620000775. [DOI] [Google Scholar]
- Xie et al. (2019).Xie Z, Nolan TM, Jiang H, Yin Y. AP2/ERF Transcription factor regulatory networks in hormone and abiotic stress responses in Arabidopsis. Frontiers in Plant Science. 2019;10:228. doi: 10.3389/fpls.2019.00228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu et al. (2021).Yu Z-H, Li J-Q, He S-C, Zhou X-C, Wu J-S, Wang Q, Huang M-Z, Zhu X, Liu X-H, Gog X, Tang W-Y, Xu C-B, Jiang X-L, Hardie WH. Winemaking characteristics of red-fleshed dragon fruit in three locations in Gyizhour Province, China. Food Science & Nutrition. 2021;9(5):2508–2516. doi: 10.1002/fsn3.2196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yundaeng et al. (2019).Yundaeng C, Sompta P, Amkul AK, Kongjaimun A, Kaga A, Tomooka N. Construction of linkage map and genome dissection of domesticated related traits of moth bean (Vigna aconitifolia), a legume crop of arid areas. Molecular Genetics and Genomics. 2019;295:621–635. doi: 10.1007/s00438-019-1536-0. [DOI] [PubMed] [Google Scholar]
- Zhang et al. (2021).Zhang L, Chen C, Xie F, Hua Q, Zhang Z, Zhang R, Chen J, Zhao J, Hu G, Qin Y. A novel WRKY transcription factor HmoWRKY40 associated with betalain biosynthesis in pitaya (Hylocereus monacanthus) through regulating HmoCYP76AD1. International Journal of Molecular Sciences. 2021;22(4):2171. doi: 10.3390/ijms22042171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang et al. (2019).Zhang X, Yang Z, Li Z, Zhang F, Hao L. De novo transcriptome assembly and co-expression network analysis of Cynanchum thesioides: identification of genes involved in drought stress. Gene. 2019;710:375–386. doi: 10.1016/j.gene.2019.05.055. [DOI] [PubMed] [Google Scholar]
- Zhao et al. (2018).Zhao S, Li CI, Guo Y, Sheng Q, Shyr Y. RnaSeqSampleSize: real data based sample size estimation for RNA sequencing. BMC Bioinformatics. 2018;19(1):911. doi: 10.1186/s12859-018-2191-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng et al. (2021).Zheng J, Meinhardt LW, Goenaga R, Zhang D, Yin D. The chromosome-level genome of dragon fruit reveals whole-genome duplication and chromosomal co-localization of betacyanin biosynthetic genes. Horticulture Research. 2021;8(1):63. doi: 10.1038/s41438-021-00501-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou et al. (2020).Zhou J, Wang Z, Mao Y, Wang L, Xiao T, Hu Y, Zhang Y, Ma Y. Proteogenomic analysis of pitaya reveals cold stress-related molecular signature. PeerJ. 2020;8(4):e8540. doi: 10.7717/peerj.8540. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
The molecular data is available at figshare: Oltehua-López, Omar; Sosa, Victoria; A. Arteaga-Vazquez, Mario (2022): data_VS_SUn. figshare. Dataset. https://doi.org/10.6084/m9.figshare.20173444.v1





