Skip to main content
Frontiers in Genetics logoLink to Frontiers in Genetics
. 2020 Feb 28;11:140. doi: 10.3389/fgene.2020.00140

First Genome of the Brown Alga Undaria pinnatifida: Chromosome-Level Assembly Using PacBio and Hi-C Technologies

Tifeng Shan 1,2,3,*, Jianbo Yuan 1,2,3, Li Su 1,2,3, Jing Li 1,2,3, Xiaofei Leng 4, Yan Zhang 4, Hongtao Gao 4, Shaojun Pang 1,2,3,*
PMCID: PMC7058681  PMID: 32184805

Introduction

The brown alga Undaria pinnatifida (Harvey) Suringar is an economically important kelp species native to the Northwest Pacific and has been extensively farmed as human food in East Asia for more than half a century (Yamanaka and Akiyama, 1993). It is also an important resource for extracting biologically active compounds such as fucoidans which have diverse applications in pharmaceutical and cosmetic industries (Zhao et al., 2018; Yoo et al., 2019). Its annual yield worldwide has been more than two million tons since 2012 (http://www.fao.org/fishery/species/2777/en). Nowadays U. pinnatifida has become a cosmopolitan species due to its worldwide spread in recent decades, attracting increasing public attention (South et al., 2017). It has been listed as one of the world's 100 worst invasive species (Lowe et al., 2000), and in Europe has been regarded as one of the top 10 worst invasive species (Gallardo, 2014).

As a member of Laminariales, U. pinnatifida has a life history involving the alternation between two heteromorphic stages, namely the macroscopic sporophyte and the microscopic gametophyte. The haploid gametophyte was preliminarily determined to possess 30 chromosomes (Yabu et al., 1988). Sexual reproduction occurs in the gametophytic phase, in which the eggs discharged by female gametophytes are fertilized by sperms released by male gametophytes. In addition to this major reproductive pattern, parthenogenesis and apogamy have long since been revealed to be important components of the life history (Fang and Dai, 1959; Fang et al., 1979; Nakahara, 1984; Shan et al., 2013). Recently, an unusual monoecious phenomenon has been observed in zoospore-derived gametophytes, which are able to form oogonia and antheridia simultaneously and give rise to sporophytes by selfing (Li et al., 2014; Li et al., 2017). The sporophytes can become mature and release zoospores. All these spores grow into male gametophytes at first, and monoecious phenomena will be observed in some of them under developmental conditions. This finding makes the life cycle of U. pinntifida more complicated than we have traditionally thought. On one hand novel breeding methods have been developed based on these findings (Shan et al., 2013; Li et al., 2017), and on the other hand the versatile reproductive ways are suggested to be beneficial for its worldwide spread. However, the molecular mechanisms underlying the various reproduction means remains unknown. Lack of genomic sequence information hinders such fundamental study in U. pinnatifida. Herein, we report for the first time the complete genome of a male gametophyte of U. pinnatifida at the chromosomal level.

Value of the Data

The genomic sequence data can be used for genetic breeding applications, and elucidation of sex-determination and invasion mechanisms in U. pinnatifida. It has been the first reference genome of the family Alariaceae, which can be used in comparative genomics and evolutionary studies of Laminariales (kelp) species.

Materials and Methods

Sample Collection and DNA Extraction

One male gametophyte clone (designated as M23) of U. pinnatifida, which was established from one zoospore originating from a cultivated mature sporophyte (Pang et al., 2008) in Dalian, China, was used for genome sequencing ( Figure 1A ). Genomic DNA was extracted using the cetyl trimethyl ammonium bromide (CTAB) method according to Shan and Pang (2009). The DNA quantity and quality was assessed with Qubit 3.0 (Thermo Fisher Scientific Inc., Carlsbad, CA, USA) and agarose gel electrophoresis, respectively.

Figure 1.

Figure 1

Genome assembly of Undaria pinnatifida. (A) Photo of sporophyte (left) and male gametophyte (right). (B) Hi-C interaction heatmap showing interactions among 30 chromosomes (the color bar on the right showing the interaction intensity). (C) Circos plot of 30 chromosome-level scaffolds (01–30) showing the genomic features. The color scales from the left (blue) to the right (red) show an increase in repeat coverage/gene density. (D) A Venn diagram showing the number of genes annotated in five databases. (E) A Venn diagram demonstrating orthologous gene families across six species. (F) Phylogenetic tree of the six species (bootstrap values higher than 50% were given).

Genome Survey

A survey of the genome was conducted using the Illumina sequencing. A short-insert DNA library (~280 bp) was established and sequenced by Illumina Novaseq6000 platform (Illumina Inc., San Diego, CA, USA). After discarding low-quality and redundant reads, we obtained 24.3 Gb high-quality paired-end (150 bp) clean reads ( Table 1 ). These reads were employed in the distribution analysis of k-mer (k=19) frequency (Marçais and Kingsford, 2011). The peak of 19-mer was at a depth of 34X, and the genome size was predicted to be ~539 Mb. The heterozygosity and the percentage of repeated sequences were estimated to be 0.48% and 34.2%, respectively. Pilot assembly of the clean reads resulted in a genome of 551 Mb, similar to that estimated by k-mer method.

Table 1.

Summary statistics of genome assembly of Undaria pinnatifida.

Sequencing platform Clean data size (Gb) Application
Illumina Novaseq6000 (short reads) 24.3 Genome survey and correction
PacBio Sequel (long reads) 62.3 Genome assembly
Illumina Novaseq6000 (Hi-C reads) 57.1 Assisted assembly at the chromosomal level
Genome assembly and scaffolding at chromosomal level
Contig number 515
Contig length (bp) 511,028,173
GC% 50.14
Contig N50 (bp) 1,707,374
Scaffold number 114
Scaffold N50 (bp) 16,510,065
Scaffold length (bp) 511,280,173
Chromosome length (bp) 502,827,406
Hi-C mapping percent 98.4%
The predicted repeated sequences
Type Number Length (bp)
DNA 30,267 10,285,717 (2.0%)
LINE 62,383 17,880,936 (3.5%)
LTR 73,036 44,549,843 (8.7%)
RC 2,784 1,247,182 (0.2%)
SINE 326 17,815 (0.003%)
Unknown 785,477 184,368,999 (36.1%)
Low complexity 2,121 370,709 (0.07%)
Satellite 1,041 578,618 (0.1%)
Simple repeat 73,931 16,728,946 (3.3%)
Total 1,031,366 276,028,765 (54.0%)

LINE, long interspersed element; LTR, long terminal repeat; RC, rolling circle; SINE, short interspersed element.

Long-Read Genome Sequencing With PacBio Technology

Genomic DNA was mechanically fragmented into sizes of ~20 kb using a Covaris g-tube (Covaris Inc., Woburn, MA, USA). The Pacific Biosciences single-molecule real-time (SMRT) Bell™ sequencing library was constructed using a Template Prep Kit (PacBio, Menlo Park, CA, USA). After DNA damage and end repair, SMRT adaptors were ligated to generate SMRT Bell™ templates. The Blue Pippin (Sage Science Inc., Beverly, MA, USA) was used to select sizes of the fragments (> 15 Kb). After a second round of DNA end repair, the SMRT Bell™ templates were purified for final sequencing with the PacBio Sequel system (PacBio, Menlo Park, CA, USA). Ten SMRT cells were used to obtain a total of 62.6 Gb (~120X) of raw polymerase reads.

De Novo Genome Assembly

After removal of short and low-quality reads and the adaptor sequences, the raw polymerase reads were converted to 62.3 Gb subreads data, with an N50 length of 11,463 bp. Preliminary assembly was conducted using Falcon v1.2.4 (Chin et al., 2016). All the clean sequencing data was aligned to the assembled contigs with BLASR (Chaisson and Tesler, 2012), and errors in the contigs were corrected using Arrow (SMRT link v6). The Illumina sequencing data was aligned to the contigs using BWA v0.7.15 (Li and Durbin, 2009) for further correction by using Pilon v1.22 (Walker et al., 2014). A draft genome of 616.6 Mb which consisted of 807 contigs was obtained, with an N50 length of 1.8 Mb. The gametophytes of kelp species are known to contain symbiotic bacteria and thus a filtration procedure was conducted to remove potential bacterial contamination (Ye et al., 2015). The contigs were searched against the non-redundant nucleotide (NT) database of the National Center for Biotechnology Information (NCBI; https://www.ncbi.nlm.nih.gov/) with BLASTN and those with the best-hit matches to bacteria were discarded from the genome. The final draft genome was 511.0 Mb and consisted of 515 contigs with an N50 length of 1.71 Mb ( Table 1 ).

High-Throughput Chromosome Conformation Capture (Hi-C) Library Construction and Chromosome-Level Assembly

Gametophytic cells were fixed with formaldehyde (1.44%) and lysed with tissue lysis (40 mM CaCl2, 1 mg mL−1 collagenase). The cross-linked DNA was digested with the restriction enzyme DpnII. Biotinylated residues were added during repair of the sticky ends and the resulting blunt-end fragments were ligated under dilute conditions (Lieberman-Aiden et al., 2009). The DNA was extracted and randomly sheared to fragments of 250–500 bp. The biotin labeled fragments were isolated with magnetic beads, and end repair, dA tailing, adaptor ligation, PCR amplification, and purification were conducted for final construction of Hi-C library. The DNA quantity was preliminarily estimated by Qubit 3.0 and the insert size was tested by Agilent 2100 (Agilent Technologies, Santa Clara, CA, USA). The library concentration was accurately quantified by quantitative PCR. The qualified library was sequenced to produce 150 bp paired-end reads using Illumina Novaseq6000 platform. A total of 57.1 Gb clean dataset was obtained, containing 190,379,612 reads ( Table 1 ).

The Hi-C sequence data was aligned against the draft genome using JUICER v1.6.2 (Durand et al., 2016). Totally 161,141,067 (84.6%) reads were mapped to the genome and 110,009,243 (57.8%) of them were uniquely mapped. The uniquely mapped sequences were analyzed with 3D-DNA software to assist genomic assembly (Dudchenko et al., 2017). The algorithms “misjoin” and “scaffolding” were used to remove the misjoins and obtain scaffolds at the chromosomal level. The algorithm “seal” was employed to find the scaffolds that had been incorrectly removed by the “misjoin”. The heatmap of chromosome interactions was constructed to visualize the contact intensity among chromosomes using JUICER v1.6.2 ( Figure 1B ). As a result 114 scaffolds were assembled with an N50 length of 16.5 Mb. Finally a total of 502.8 Mb genomic sequences were located on 30 chromosomes, accounting for 98.4% of the whole assembled length ( Table 1 ).

Genome Annotation

Tandem repeats and interspersed repeats were identified using TRF v407b (Benson, 1999) and RepeatModeler v1.0.11, respectively. RepeatMasker v4.0.7 was used to mask the predicted and known repeated sequences (Tarailo-Graovac and Chen, 2009) and RepeatProteinMask v4.0.7 was used to mask the known repeated protein sequences. Circos plot of 30 chromosome-level scaffolds (01–30) was constructed to show the genomic features (Krzywinski et al., 2009). Totally 276.0 Mb repeated sequences were masked, accounting for 54.0% of the whole genome length ( Table 1 , Figure 1C ). We used tRNAscan-SE v1.4alpha (Lowe and Eddy, 1997) to predict tRNAs and identified other types of non-coding RNAs (ncRNA) through searching against the Rfam database v11.0 (Griffiths-Jones et al., 2005). A total of 507 ncRNAs were detected, including 367 tRNAs, 93 rRNAs, 43 sn RNAs, and 4 mi RNAs.

Protein-coding genes were predicted through the combination of homology-based, transcriptome-based, and ab initio predictions. The protein sequences of five related species, namely Ectocarpus siliculosus, Thalassiosira pseudonana, Phaeodactylum tricornutum, Aureococcus anophagefferens, and Phytophthora sojae were downloaded from NCBI and aligned against the assembled genome using GeMoMa v1.4.2 (Keilwagen et al., 2016). The full-length transcriptome, which was obtained from a pooled sample of gametophytes and sporophytes of U. pinnatifida in our previous study (unpublished data), were used to predict the open reading frames (ORFs) with PASA v2.0.2. For ab initio prediction, Augustus v3.2.2, GlimmerHMM v3.0.4 (Majoros et al., 2004), SNAP v1.0, and GeneMark-ES v4.33 (Besemer and Borodovsky, 2005) were used to predict the gene structure. All the prediction results were integrated with EVidenceModeler (EVM) (Haas et al., 2008), and the untranslated regions and alternative splicing were predicted with PASA. A total of 14,178 genes were predicted with an average length of 2,089 bp ( Figure 1C ).

Functional annotation of the predicted protein-coding genes were conducted through aligning them against the non-redundant protein (NR), SwissProt, evolutionary genealogy of genes: Non-supervised Orthologous Groups (eggNOG) (Huerta-Cepas et al., 2015) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) databases using the BLASTX with an E value cutoff of 10−5. Annotation by the Gene Ontology (GO) database was performed using Blast2GO software. Totally 12,402 (87.4%) genes were annotated in at least one database ( Figure 1D ).

Completeness and Accuracy of the Assembly

The previously obtained full-length transcripts was aligned against the assembled genome with Gmap and it was found 87.7% of the transcripts could be mapped.

The paired-end reads obtained in genome survey were also aligned against the assembled genome with BWA and 92.2% of them were successfully mapped. The analysis of Benchmarking Universal Single-Copy Orthologs (BUSCO) v3.0.2 (Simão et al., 2015), in combination with TBLASTN, Augustus, and HMMER v3.1b2 (Finn et al., 2011) software, was used to evaluate the completeness of the assembled genome based on eukaryota_odb9 database. The percentage of the identified complete BUSCOs was 82.9% at the protein level, with the fragmented and missing BUSCOs accounting for 8.9% and 8.2%, respectively.

Ortholog and Phylogenetic Analysis

OrthoMCL v2.0.5 was used for ortholog analysis based on the protein datasets from U. pinnatifida, Ectocarpus siliculosus, Phaeodactylum tricornutum, Thalassiosira pseudonana, Phytophthora sojae, and Aureococcus anophagefferens. Totally 15,414 clusters were formed, and 874 of them were single-copy gene clusters ( Figure 1E ). The single-copy gene clusters were aligned using MAFFT v7.45 and the phylogenetic tree was constructed with maximum likelihood method using PhyML (http://www.atgc-montpellier.fr/phyml/binaries.php) ( Figure 1F ).

Data Availability Statement

The raw genome sequencing data have been deposited in the NCBI SRA database under the BioProject accession number PRJNA575605. Genome assembly and annotation data has been deposited at Figshare (https://figshare.com/s/94aebbd77f374b9c6faf). The raw SMRT sequencing data for full-length transcriptome analysis is available in NCBI SRA database with accession numbers SRR8083207, SRR8083208, and SRR8083209.

Author Contributions

SP and TS conceived the study. TS cultured and maintained the gametophyte samples. XL, YZ, and HG cultured the sporophytes. TS, JY, LS, and JL extracted the DNA and performed genome assembly and data analysis. TS, JY, and SP wrote the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (No. 41676128), the Special Research Fund of Key Laboratory of Experimental Marine Biology, Chinese Academy of Sciences (No. KLEMB-SR01), the Biological Resources Program from Chinese Academy of Sciences (KFJ-BRP-017-27), China Agriculture Research System (CARS-50), the Taishan Scholar Program of Shandong Province, and the Foundation for Huiquan Scholar of Institute of Oceanology, Chinese Academy of Sciences.

Conflict of Interest

Authors XL, YZ and HG were employed by company Dalian Haibao Fishery Co., Ltd., China.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Benson G. (1999). Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27 (2), 573–580. 10.1093/nar/27.2.573 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Besemer J., Borodovsky M. (2005). GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res. 33 (suppl_2), W451–W454. 10.1093/nar/gki487 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chaisson M. J., Tesler G. (2012). Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinf. 13 (1), 238.   10.1186/1471-2105-13-238 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chin C. S., Peluso P., Sedlazeck F. J., Nattestad M., Concepcion G. T., Clum A., et al. (2016). Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods 13, 1050.   10.1038/nmeth.4035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Dudchenko O., Batra S. S., Omer A. D., Nyquist S. K., Hoeger M., Durand N. C., et al. (2017). De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356 (6333), 92–95.   10.1126/science.aal3327 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Durand N. C., Shamim M. S., Machol I., Rao S. S., Huntley M. H., Lander E. S., et al. (2016). Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3 (1), 95–98.   10.1016/j.cels.2016.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Fang T., Dai J. (1959). Genetics in multicellular marine algae. J. Shandong Coll. Oceanol. 14, 16–19. [Google Scholar]
  8. Fang Z. X., Dai J. X., Chen D. Q. (1979). Parthenogenesis and the genetic properties of parthenosporophytes of Undaria pinnatifida . Acta Oceanologia Sin. 1, 112–119. [Google Scholar]
  9. Finn R. D., Clements J., Eddy S. R. (2011). HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39 (suppl_2), W29–W37.   10.1093/nar/gkr367 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Gallardo B. (2014). Europe's top 10 invasive species: relative importance of climatic, habitat and socio-economic factors AU - Gallardo, Belinda. Ethol. Ecol. Evol. 26 (2-3), 130–151.   10.1080/03949370.2014.896417 [DOI] [Google Scholar]
  11. Griffiths-Jones S., Moxon S., Marshall M., Khanna A., Eddy S. R., Bateman A. (2005). Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33 (suppl_1), D121–D124. 10.1093/nar/gki081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Haas B. J., Salzberg S. L., Zhu W., Pertea M., Allen J. E., Orvis J., et al. (2008). Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9 (1), R7.   10.1186/gb-2008-9-1-r7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Huerta-Cepas J., Szklarczyk D., Forslund K., Cook H., Heller D., Walter M. C., et al. (2015). eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 44 (D1), D286–D293.   10.1093/nar/gkv1248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Keilwagen J., Wenk M., Erickson J. L., Schattat M. H., Grau J., Hartung F. (2016). Using intron position conservation for homology-based gene prediction. Nucleic Acids Res. 44 (9), e89–e89.   10.1093/nar/gkw092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Krzywinski M., Schein J., Birol I., Connors J., Gascoyne R., Horsman D., et al. (2009). Circos: an information aesthetic for comparative genomics. Genome Res. 19 (9), 1639–1645. 10.1101/gr.092759.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Li H., Durbin R. (2009). Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25 (14), 1754–1760.   10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Li J., Pang S., Shan T., Liu F., Gao S. (2014). Zoospore-derived monoecious gametophytes in Undaria pinnatifida (Phaeophyceae). Chin. J. Oceanol. Limnol. 32 (2), 365–371.   10.1007/s00343-014-3139-x [DOI] [Google Scholar]
  18. Li J., Pang S. J., Shan T. F. (2017). Existence of an intact male life cycle offers a novel way in pure-line crossbreeding in the brown alga Undaria pinnatifida . J. Appl. Phycol. 29 (2), 993–999.   10.1007/s10811-016-1005-2 [DOI] [Google Scholar]
  19. Lieberman-Aiden E., Van Berkum N. L., Williams L., Imakaev M., Ragoczy T., Telling A., et al. (2009). Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326 (5950), 289–293.   10.1126/science.1181369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Lowe T. M., Eddy S. R. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25 (5), 955–964. 10.1093/nar/25.5.955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Lowe S., Browne M., Boudjelas S., De Poorter M. (2000). 100 of the world's worst invasive alien species: a selection from the global invasive species database. Aliens 12, 1–12. [Google Scholar]
  22. Majoros W. H., Pertea M., Salzberg S. L. (2004). TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20 (16), 2878–2879. 10.1093/bioinformatics/bth315 [DOI] [PubMed] [Google Scholar]
  23. Marçais G., Kingsford C. (2011). A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27 (6), 764–770.   10.1093/bioinformatics/btr011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Nakahara H. (1984). Alternation of generations of some brown algae in unialgal and axenic cultures. Sci. Papers Inst. Algol. Res. Faculty Sci. Hokkaido Univ. 7 (2), 77–194. [Google Scholar]
  25. Pang S. J., Shan T. F., Zhang Z. H. (2008). Responses of vegetative gametophytes of Undaria pinnatifida to high irradiance in the process of gametogenesis. Phycol. Res. 56 (4), 280–287.   10.1111/j.1440-1835.2008.00509.x [DOI] [Google Scholar]
  26. Shan T. F., Pang S. J. (2009). Assessing genetic identity of sporophytic offspring of the brown alga Undaria pinnatifida derived from mono-crossing of gametophyte clones by use of amplified fragment length polymorphism and microsatellite markers. Phycol. Res. 57 (1), 36–44.   10.1111/j.1440-1835.2008.00519.x [DOI] [Google Scholar]
  27. Shan T. F., Pang S. J., Gao S. Q. (2013). Novel means for variety breeding and sporeling production in the brown seaweed Undaria pinnatifida (Phaeophyceae): crossing female gametophytes from parthenosporophytes with male gametophyte clones. Phycol. Res. 61 (2), 154–161.   10.1111/pre.12014 [DOI] [Google Scholar]
  28. Simão F. A., Waterhouse R. M., Ioannidis P., Kriventseva E. V., Zdobnov E. M. (2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31 (19), 3210–3212.   10.1093/bioinformatics/btv351 [DOI] [PubMed] [Google Scholar]
  29. South P. M., Floerl O., Forrest B. M., Thomsen M. S. (2017). A review of three decades of research on the invasive kelp Undaria pinnatifida in Australasia: an assessment of its success, impacts and status as one of the world's worst invaders. Mar. Environ. Res. 131, 243–257.   10.1016/j.marenvres.2017.09.015 [DOI] [PubMed] [Google Scholar]
  30. Tarailo-Graovac M., Chen N. (2009). Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. In Bioinf. 25 (1), 4.10.   10.1002/0471250953.bi0410s25 [DOI] [PubMed] [Google Scholar]
  31. Walker B. J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S., et al. (2014). Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PloS One 9 (11), e112963.   10.1371/journal.pone.0112963 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Yabu H., Yasui H., Notoya M. (1988). Chromosome numbers of Undaria pinnatifida f. distans. Bull. Fac. Fish Hokkaido Univ. 39 (6), 13. [Google Scholar]
  33. Yamanaka R., Akiyama K. (1993). Cultivation and utilization of Undaria pinnatifida (wakame) as food. J. Appl. Phycol. 5 (2), 249–253. 10.1007/BF00004026 [DOI] [Google Scholar]
  34. Ye N., Zhang X., Miao M., Fan X., Zheng Y., Xu D., et al. (2015). Saccharina genomes provide novel insight into kelp biology. Nat. Commun. 6, 6986.   10.1038/ncomms7986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Yoo H. J., You D.-J., Lee K.-W. (2019). Characterization and immunomodulatory effects of high molecular weight fucoidan fraction from the sporophyll of Undaria pinnatifida in cyclophosphamide-induced immunosuppressed mice. Marine Drugs 17 (8), 447.   10.3390/md17080447 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Zhao Y., Zheng Y., Wang J., Ma S., Yu Y., White W., et al. (2018). Fucoidan extracted from Undaria pinnatifida: source for nutraceuticals/functional foods. Marine Drugs 16 (9), 321.   10.3390/md16090321 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The raw genome sequencing data have been deposited in the NCBI SRA database under the BioProject accession number PRJNA575605. Genome assembly and annotation data has been deposited at Figshare (https://figshare.com/s/94aebbd77f374b9c6faf). The raw SMRT sequencing data for full-length transcriptome analysis is available in NCBI SRA database with accession numbers SRR8083207, SRR8083208, and SRR8083209.


Articles from Frontiers in Genetics are provided here courtesy of Frontiers Media SA

RESOURCES