Version Changes
Revised. Amendments from Version 1
Two citations (20, 21) were added as suggested by Reviewer 2 to document prior uses of sRNA libraries as confirmatory evidence of secondary sRNAs as products of miRNA targeting. A software analysis was added as suggested by Reviewer 2: “strucVis” for visualization of predicted RNA secondary structures with overlaid sRNA depths display of ShortStack output. Extended data: Figure S1 was revised to include the strucVis analysis and the legend modified as requested by Reviewer 2. The titles of Extended data: Tables S1 and S2 were changed for clarity as requested by Reviewer 1. The legend to Figure 1 was modified as requested by Reviewer 2 to describe the images as T plots of 5’ ends of sRNA alignments, which I previously (citation 7) named as pseudo-degradome analyses and referenced in the software availability description. The legend of Figure 2 was modified to direct readers to Extended data Tables where underlying evidence is documented and file rows are referenced to substantiate claims, as requested by Reviewer 2. Minor changes to the text were made as requested by Reviewers to improve clarity and articulate comparisons made across Figure 2 panels for interpretations underlying claims. Reviewer 2’s suggestion to elaborate on methods was addressed by a statement added at the end of the Software availability section: “The options parameters used for various algorithms are detailed in Extended data: Tables S1- S3.” No changes were made to data in Figures or Extended data Tables in this revision.
Abstract
Chen et al. ( Nature Genet. 51: 1549–1558; Oct. 2019) sequenced Ananas comosus var. bracteatus accession CB5, cultivated for its bright pink-to-red colored fruit, and yellow-fleshed A. comosus accession F153, reporting an improved F153 reference assembly while annotating MICRORNA (MIRNA) loci and gene family expressions relevant to lignin and anthocyanin biosynthesis. An independent article (Xiong et al. Sci. Rep. 8: 1947; 2018) reported var. bracteatus MIRNAs but not MIR828, a negative regulator of anthocyanin and polyphenolics biosynthesis by targeting MYB transcription factors associated with UV light- and sugar-signaling in dicots. MIR828 has been reported in gymnosperms, Amborella (sister to flowering plants), and basal monocot orders Liliales, Asparagales, Zingiberales, Arecales, but not in the Poales, a sister order comprising grasses and ~3,000 species of bromeliads including pineapple. Here I show MIR828 exists in pineapple and directs post-transcriptional gene silencing of mRNAs encoding MYB family members with inferred function to regulate the conspicuous red fruit trait in var. bracteatus. MIR828 plesiomorphy (an ancient basal trait) may shed light on monocot apomorphic fruit development, postulated for 21 monocot families with fleshy fruits as due to homoplasy/convergence driven by tropical climate and/or enticements to vertebrate endozoic seed dispersers.
Keywords: anthocyanins, microRNAs, fruit development, RNA interference, evolution
Chen et al. 1 sequenced Ananas comosus var . bracteatus accession CB5, cultivated for its bright pink-to-red colored fruit, and yellow-fleshed A. comosus accession F153, reporting an improved F153 reference assembly 2 while annotating MICRORNA ( MIRNA) loci 2– 4 and gene family expressions relevant to lignin and anthocyanin biosynthesis. An independent article reported var. bracteatus MIRNAs 5 but not MIR828 6, a negative regulator of anthocyanin and polyphenolics biosynthesis by targeting MYB transcription factors associated with UV light- and sugar-signaling in dicots 7, 8. MIR828 has been reported in gymnosperms 8, Amborella (sister to flowering plants) 9, and basal monocot orders Liliales 8, 10, Asparagales 11, Zingiberales 12, 13, Arecales 14, but not in the Poales, a sister order comprising grasses and ~3,000 species of bromeliads including pineapple 15. Here I show MIR828 exists in pineapple and directs post-transcriptional gene silencing of mRNAs encoding MYB family members with inferred function to regulate the conspicuous red fruit trait in var. bracteatus. MIR828 plesiomorphy (an ancient basal trait) may shed light on monocot apomorphic fruit development, postulated for 21 monocot families with fleshy fruits as due to homoplasy/convergence driven by tropical climate and/or enticements to vertebrate endozoic seed dispersers 16.
The astronomer Carl Sagan popularized the quip (a corollary of Occam’s Razor) “ absence of evidence is NOT evidence of absence.” Taking a conservative approach applies especially to quantitative transcriptomics. For example, the existence of regulatory and post-transcriptional processes increase the complexity of non-coding RNA space, where annotation is sparse. Analysis of Chen et al’s A. comosus MD-2 cultivar leaf small RNA (sRNA) libraries and stranded RNA-seq libraries 4 from flowers and fruits of F153 and CB5 genotypes establish the existence of aco-miR828 ( Extended data: Figure S1 17) and pri-MIR828 expressions, with the novel observation that pri-MIR828 is properly transcribed in leaves, flowers and fruits yet ~50% of mature miR828 species abundance (0.3 reads per million in leaves from 264.7 million reads; Extended data: Table S1 17) are 21 nt, while 93% of the equally abundant 22 nt species appear to have undergone non-templated 3’ uridylation of the 21 nt species 18 ( Extended data: Table S1 17). Analysis of independent bracteatus cultivar leaf sRNA libraries (NCBI SRA SRR5677552-7; 113.2 million reads) 5 of unknown provenance relative to the subject CB5 genotype failed to identify any MIR828 reads.
Fortuitously, a unique aspect of miR828 is that despite its very low abundance, it has very high activity 7, 19 that serves as diagnostic. miR828 guides ARGONAUTE slicing of target MYB mRNAs within the deeply conserved SANT domain region 8 by Watson-Crick complementarity, with consequent knock-on production of easily quantified DICER-mediated sense- and antisense 21 nt phased small-interfering RNAs (phasiRNAs) mapping downstream (3’) on target MYB transcripts. The improved F153 reference assembly 1 contains two candidate miR828-targeted MYBs: Aco017254.1 (LG4), with two introns of 1113 and 1340 nts, and Aco020986.1 (LG14) without RNA-seq evidence of intron splicing, whereas the CB5 bracteatus genome only contains one Aco017254.1 homologous gene with RNA-seq splicing evidence for conserved introns of 1109 and 1332 nt ( Extended data: Table S2 17). When phasiRNA expressions from leaf sRNA libraries 4, 5 are respectively mapped to the F153 and CB5 candidate MYB target mRNAs, it is apparent that F153 MYB transcripts clearly undergo miR828-guided slicing, evidenced by D1(+) phased siRNA reads mapping to the 10 th nucleotide position of miR828 homology to the mRNA target, and unique sense and antisense secondary phasi-RNAs mapping precisely in multiples of 21 nt downstream from the detected slice sites ( Figure 1; Extended data: Figure S2, Table S3 17). The CB5 reference genome target MYB locus Aco017254.1 homolog (contig tig00012294, CABWKS010000088.1:25590-28868rc) encodes two missense codons at residues 41 (M➔R) and 200 (N➔K compared to F153), four silent codon substitutions, and RNA-seq analysis reveals mis-annotation of the CB5 genome which lacks two Gs (at contig residues 28780 and 27566) that exist and result in a CB5 open reading frame of the same size as Aco017254.1 in F153, including six instead of seven trinucleotide GGC glycine codon repeats templated in the CB5 genome at residue 204 ( Extended data: Table S2 17).
In contrast to demonstrated post-transcriptional silencing activity of miR828 on target MYB abundance in F153 leaf samples ( Figure 1), analysis of six bracteatus leaf sRNA libraries from an independent study 5 did not provide any evidence of target MYB Aco017254.1 mRNA slicing or target phasiRNA accumulation ( Extended data: Table S3 17). Taken together, subject to the caveat that the provenance of the bracteatus cultivar used for the sRNA analysis 5 may be different than subject CB5 genotype, the data suggest there may be differences in expression and/or regulation of pri-MIR828, and/or target MYB Aco017254.1 between F153 yellow-fleshed versus CB5 red-fleshed genotypes. Figure 2 shows this is indeed the case in various tissues examined, with the evidence supporting higher pri-MIR828 expression in F153 ovules concordant with lower target MYB Aco017254.1 mRNA levels ( Figure 2A), significant decreases over time from stage 1 early fruit development for target MYBs Aco017254.1 and Aco020986.1 in yellow-fleshed MD-2 cultivar ( Figure 2B), whereas in contrast there is a trend of lower expression of CB5 pri-MIR828 concordant with sustained higher abundance of Aco017254.1 target MYB ( Figure 2C stage 7 versus stage 1) than seen in MD-2 during the ripening stage of red-fleshed CB5 genotype (compare Figure 2C stage 7 showing high CB5 Aco017254.1 abundance to Figure 2B stage 7 showing very low MD-2 Aco017254.1 abundance).
Plant development gives rise to an astounding complexity of shapes, colors, and functions that Darwin called ‘an abominable mystery’ in his efforts to integrate species complexity with the unifying theory of evolution. The observations reported here potentially offer insight into the conservation of a developmental control mechanism whereby miR828 in pineapple, like in dicots silences MYBs inferred to act as positive effectors of anthocyanin biosynthesis that could give rise to the red-fleshed trait in the bracteatus variety. Consistent with this view is the finding from genome-wide functional phylogenomic approaches that ARGONAUTE1, RNA-DEPENDENT RNA POLYMERASE6, and mRNA export factor homolog SILENCING DEFECTIVE5, each required for trans-acting siRNA accumulation, played significant roles in the evolution of monocot metabolic and developmental traits 22, 23. Also of interest in this context is the claim of bracteatus genome authors 24 that convergent expansion in several Crassulacean Acid Metabolism (CAM) bromeliad lineages of XAP5 CIRCADIAN TIMEKEEPER/XCT, a nuclear-localized regulator of blue light responses 25 and sRNA production 26, supports evolution of the myriad metabolic and physiological transitions from C3 to CAM photosynthesis by duplication/differentiation of a highly pleiotropic effector 24.
Data availability
Underlying data
RNA-seq data 1, CB5 reference genome GCA_902506285.1, and F153 improved assembly genome GCA_902162155.1 were downloaded from NCBI via BioProject accession code PRJEB33121. sRNA raw data 5 for A. comosus var. bracteatus leaf libraries were downloaded from NCBI BioProject accession code PRJNA389361. Processed data 4 for MD-2 leaf sRNA libraries Pn_Gr2am_1, Pn_Gr3pm_3, Pn_Gr4pm_1, Pn_Gr4pm_3, Pn_Gr6am_2, Pn_GrMid_3, Pn_Wh10am_2, Pn_Wh10pm_1, Pn_Wh1pm_1, Pn_Wh2am_1, Pn_Wh2am_2, Pn_Wh3pm_1, Pn_Wh3pm_2, Pn_Wh4am_1, Pn_Wh4pm_1, Pn_Wh6am_1, Pn_Wh6am_2, Pn_Wh6pm_1, Pn_Wh6pm_2, Pn_Wh8am_2, Pn_Wh8pm_2, Pn_WhMid_1, and Pn_WhMid_2 were downloaded from https://mpss.danforthcenter.org/dbs/index.php?SITE=pineapple_sRNA.
The original A. comosus F153 reference genome and cDNA fasta file ‘Acomosus_321_v3.cds.fa’ can be browsed online and downloaded at https://phytozome.jgi.doe.gov/ 27.
Extended data
Figshare: A role for MIR828 in pineapple fruit development, https://doi.org/10.6084/m9.figshare.11388051.v2 17
This project contains the following extended data:
Figure S1. strucVis graphical linear output of Aco- MIR828 hairpin structure and sRNA abundance evidence from 23 MD-2 leaf sRNA libraries.
Figure S2. PhaseTank alignment output of leaf MD-2 sRNA libraries mapped to candidate miR828 target MYB cDNAs, F153 improved reference genome.
Table S1: Evidences for pri-MIR828 abundance in F153 and CB5 varieties.
Table S2: RNA-seq evidence for miR828 target MYB abundance.
Table S3: bowtie mapping of sRNAs to miR828 target MYB mRNAs
Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).
Software availability
Fastx-toolkit version 0.0.14 for trimming Illumina universal adapterAGATCGGAAGAGCACACGTCTGAACTCCAGTCA (fastx_clipper option -l 18; discards short reads) and fasta file manipulations (fastx_uncollapser to expand the pre-processed data 4; fastq_to_fasta for inputs to CleaveLand) is available at http://hannonlab.cshl.edu/fastx_toolkit/. FastQC version 0.11.5 for quality control of fastq raw sequence data is available at https://www.bioinformatics.babraham.ac.uk/projects/fastqc/. ShortStack 28 version 3.8.5 (options --mincov 15 --foldsize 340; Extended data: Figure S1, Table S1 17) for comprehensive annotation and quantitation of MIRNAs and sRNA cluster phasing is available at https://github.com/MikeAxtell/ShortStack. strucVis for visualization of predicted RNA secondary structures with overlaid sRNA depths display of ShortStack output ( Extended data: Figure S1, page 2) is available at https://github.com/MikeAxtell/strucVis. CleaveLand 29 version 4.4 for plotting 5’ ends of sRNA libraries as pseudo-degradome inputs (amplified sense and anti-sense siRNAs in phase and derived from miRNA-sliced mRNAs 7, 20, 21) to find sliced miRNA targets is available at https://github.com/MikeAxtell/CleaveLand4. PhaseTank 30 version 1.0 for quantifying and aligning phasiRNAs to miRNA target mRNAs ( Extended data: Figure S2 17) is available at http://phasetank.sourceforge.net/; the linux command sed -i ‘s/-/_x/’ was used to reconfigure fastx_collapser output from concatenated sRNA fasta library files to PhaseTank input style “>t1_xN” where N is number of collapsed reads). Magic-BLAST 31 version 1.5.0 for RNA-seq fastq read alignment ( Extended data: Table S2 row 4 for options parameters 17) to reference genomes is available at https://ncbi.github.io/magicblast/. Blastn 32 version 2.6.0 is available at ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/. Bowtie 33 version 1.1.2 for short read alignment (option -v 1 to allow one mismatch) is available at http://bowtie-bio.sourceforge.net/manual.shtml. RNAfold 34 web server was used for generating miR828 hairpin graphics ( Extended data: Figure S1 17) at http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi. The options parameters used for various algorithms are detailed in Extended data: Tables S1–S3
Acknowledgements
The author thanks the TTU High Performance Computer Center for support in use of the Quanah supercluster, and Sunitha Sukumaran for help with perl scripting.
Funding Statement
Current work in the author’s laboratory is funded by the California Department of Food and Agriculture Pierce’s Disease Board, Agreement Number 19-0259-000-SA.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 2; peer review: 2 approved]
References
- 1. Chen LY, VanBuren R, Paris M, et al. : The bracteatus pineapple genome and domestication of clonally propagated crops. Nat Genet. 2019;51(10):1549–1558. 10.1038/s41588-019-0506-8 [DOI] [PubMed] [Google Scholar]
- 2. Ming R, VanBuren R, Wai CM, et al. : The pineapple genome and the evolution of CAM photosynthesis. Nat Genet. 2015;47(12):1435–1442. 10.1038/ng.3435 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Zheng Y, Li T, Xu Z, et al. : Identification of microRNAs, phasiRNAs and their targets in pineapple. Trop Plant Biol. 2016;9:176–186. 10.1007/s12042-016-9173-4 [DOI] [Google Scholar]
- 4. Wai CM, VanBuren R, Zhang J, et al. : Temporal and spatial transcriptomic and microRNA dynamics of CAM photosynthesis in pineapple. Plant J. 2017;92(1):19–30. 10.1111/tpj.13630 [DOI] [PubMed] [Google Scholar]
- 5. Xiong YY, Ma J, He YH, et al. : High-throughput sequencing analysis revealed the regulation patterns of small RNAs on the development of A. comosus var. bracteatus leaves. Sci Rep. 2018;8(1):1947. 10.1038/s41598-018-20261-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Rajagopalan R, Vaucheret H, Trejo J, et al. : A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 2006;20(24):3407–3425. 10.1101/gad.1476406 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Sunitha S, Loyola R, Alcalde JA, et al. : The role of UV-B light on small RNA activity during grapevine berry development. G3 (Bethesda). 2019;9(3):769–787. 10.1534/g3.118.200805 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Luo QJ, Mittal A, Jia F, et al. : An autoregulatory feedback loop involving PAP1 and TAS4 in response to sugars in Arabidopsis. Plant Mol Biol. 2012;80(1):117–129. 10.1007/s11103-011-9778-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Amborella Genome Project: The Amborella genome and the evolution of flowering plants. Science. 2013;342(6165):1241089. 10.1126/science.1241089 [DOI] [PubMed] [Google Scholar]
- 10. Suzuki K, Suzuki T, Nakatsuka T, et al. : RNA-seq-based evaluation of bicolor tepal pigmentation in Asiatic hybrid lilies ( Lilium spp.). BMC Genomics. 2016;17(1):611. 10.1186/s12864-016-2995-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Harkess A, Zhou J, Xu C, et al. : The asparagus genome sheds light on the origin and evolution of a young Y chromosome. Nat Comm. 2017;8(1):1279. 10.1038/s41467-017-01064-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Wen JZ, Liao JY, Zheng LL, et al. : A contig-based strategy for the genome-wide discovery of microRNAs without complete genome resources. PLoS One. 2014;9(2):e88179. 10.1371/journal.pone.0088179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Roy S, Tripathi AM, Yadav A, et al. : Identification and expression analyses of miRNAs from two contrasting flower color cultivars of Canna by deep sequencing. PLoS One. 2016;11(1):e0147499. 10.1371/journal.pone.0147499 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Xin C, Liu W, Lin Q, et al. : Profiling microRNA expression during multi-staged date palm ( Phoenix dactylifera L.) fruit development. Genomics. 2015;105(4):242–251. 10.1016/j.ygeno.2015.01.004 [DOI] [PubMed] [Google Scholar]
- 15. Barrett CF, Baker WJ, Comer JR, et al. : Plastid genomes reveal support for deep phylogenetic relationships and extensive rate variation among palms and other commelinid monocots. New Phytol. 2016;209(2):855–870. 10.1111/nph.13617 [DOI] [PubMed] [Google Scholar]
- 16. Givnish TJ, Pires JC, Graham SW, et al. : Repeated evolution of net venation and fleshy fruits among monocots in shaded habitats confirms a priori predictions: evidence from an ndhF phylogeny. Proc Biol Sci. 2005;272(1571):1481–1490. 10.1098/rspb.2005.3067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Rock C: A role for MIR828 in pineapple fruit development. figshare.Online resource.2019. 10.6084/m9.figshare.11388051.v2 [DOI] [PMC free article] [PubMed]
- 18. Li J, Yang Z, Yu B, et al. : Methylation protects miRNAs and siRNAs from a 3'-end uridylation activity in Arabidopsis. Curr Biol. 2005;15(16):1501–1507. 10.1016/j.cub.2005.07.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Cuperus JT, Carbonell A, Fahlgren N, et al. : Unique functionality of 22-nt miRNAs in triggering RDR6-dependent siRNA biogenesis from target transcripts in Arabidopsis. Nat Struct Mol Biol. 2010;17(8):997–1003. 10.1038/nsmb.1866 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Allen E, Xie Z, Gustafson AM, et al. : microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell. 2005;121(2):207–221. 10.1016/j.cell.2005.04.004 [DOI] [PubMed] [Google Scholar]
- 21. Shahid S, Kim G, Johnson NR, et al. : MicroRNAs from the parasitic plant Cuscuta campestris target host messenger RNAs. Nature. 2018;553(7686):82–85. 10.1038/nature25027 [DOI] [PubMed] [Google Scholar]
- 22. Lee EK, Cibrian-Jaramillo A, Kolokotronis SO, et al. : A functional phylogenomic view of the seed plants. PLoS Genet. 2011;7(12):e1002411. 10.1371/journal.pgen.1002411 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Cibrián-Jaramillo A, De la Torre-Bárcena JE, Lee EK, et al. : Using phylogenomic patterns and gene ontology to identify proteins of importance in plant evolution. Genome Biol Evol. 2010;2:225–239. 10.1093/gbe/evq012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. De La Harpe M, Paris M, Hess J, et al. : Genomic footprints of repeated evolution of CAM photosynthesis in tillandsioid bromeliads. BioRxiv.preprint.2018. 10.1101/495812 [DOI] [PubMed] [Google Scholar]
- 25. Martin-Tryon EL, Harmer SL: XAP5 CIRCADIAN TIMEKEEPER coordinates light signals for proper timing of photomorphogenesis and the circadian clock in Arabidopsis. Plant Cell. 2008;20(5):1244–1259. 10.1105/tpc.107.056655 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Fang X, Shi Y, Lu X, et al. : CMA33/XCT regulates small RNA production through modulating the transcription of Dicer-Like genes in Arabidopsis. Mol Plant. 2015;8(8):1227–1236. 10.1016/j.molp.2015.03.002 [DOI] [PubMed] [Google Scholar]
- 27. Goodstein DM, Shu S, Howson R, et al. : Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2011;40(Database issue):D1178–D1186. 10.1093/nar/gkr944 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Johnson NR, Yeoh JM, Coruh C, et al. : Improved placement of multi-mapping small RNAs. G3 (Bethesda). 2016;6(7):2103–2111. 10.1534/g3.116.030452 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Addo-Quaye C, Miller W, Axtell MJ: CleaveLand: a pipeline for using degradome data to find cleaved small RNA targets. Bioinformatics. 2009;25(1):130–131. 10.1093/bioinformatics/btn604 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Guo Q, Qu X, Jin W: PhaseTank: genome-wide computational identification of phasiRNAs and their regulatory cascades. Bioinformatics. 2014;31(2):284–286. 10.1093/bioinformatics/btu628 [DOI] [PubMed] [Google Scholar]
- 31. Boratyn GM, Thierry-Mieg J, Thierry-Mieg D, et al. : Magic-BLAST, an accurate RNA-seq aligner for long and short reads. BMC Bioinformatics. 2019;20(1):405. 10.1186/s12859-019-2996-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Zhang Z, Schwartz S, Wagner L, et al. : A greedy algorithm for aligning DNA sequences. J Comput Biol. 2000;7(1–2):203–214. 10.1089/10665270050081478 [DOI] [PubMed] [Google Scholar]
- 33. Langmead B, Trapnell C, Pop M, et al. : Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):R25. 10.1186/gb-2009-10-3-r25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Lorenz R, Bernhart SH, Höner Zu Siederdissen C, et al. : ViennaRNA Package 2.0. Algorithms Mol Biol. 2011;6:26. 10.1186/1748-7188-6-26 [DOI] [PMC free article] [PubMed] [Google Scholar]