Skip to main content
GigaScience logoLink to GigaScience
. 2018 Sep 25;7(11):giy119. doi: 10.1093/gigascience/giy119

The genome of common long-arm octopus Octopus minor

Bo-Mi Kim 1,#, Seunghyun Kang 1,#, Do-Hwan Ahn 1,#, Seung-Hyun Jung 2,#, Hwanseok Rhee 3,#, Jong Su Yoo 2, Jong-Eun Lee 3, SeungJae Lee 3, Yong-Hee Han 4, Kyoung-Bin Ryu 4, Sung-Jin Cho 4,, Hyun Park 1,5,, Hye Suck An 2,
PMCID: PMC6279123  PMID: 30256935

Abstract

Background

The common long-arm octopus (Octopus minor) is found in mudflats of subtidal zones and faces numerous environmental challenges. The ability to adapt its morphology and behavioral repertoire to diverse environmental conditions makes the species a promising model for understanding genomic adaptation and evolution in cephalopods.

Findings

The final genome assembly of O. minor is 5.09 Gb, with a contig N50 size of 197 kb and longest size of 3.027 Mb, from a total of 419 Gb raw reads generated using the Pacific Biosciences RS II platform. We identified 30,010 genes; 44.43% of the genome is composed of repeat elements. The genome-wide phylogenetic tree indicated the divergence time between O. minor and Octopus bimaculoides was estimated to be 43 million years ago based on single-copy orthologous genes. In total, 178 gene families are expanded in O. minor in the 14 bilaterian species.

Conclusions

We found that the O. minor genome was larger than that of closely related O. bimaculoides, and this difference could be explained by enlarged introns and recently diversified transposable elements. The high-quality O. minor genome assembly provides a valuable resource for understanding octopus genome evolution and the molecular basis of adaptations to mudflats.

Keywords: octopus genome, cephalopods, adaptation and evolution, long-read sequencing

Background

Cephalopods (e.g., cuttlefish, nautilus, octopus, and squid) belong to the phylum Mollusca, which is one of the most diverse phylum within Lophotrochozoa. Regardless of their evolutionary, biological, and economic significance, their genome information is still limited to a few species [1–4].

Cephalopods have interesting biological characteristics such as an extraordinary life-history plasticity, rapid growth, short life span, large brain, and sophisticated sense organs with a complex nervous system [5]. The ability to adapt their morphology and behavioral repertoire to diverse environmental conditions and their capacity for learning and memory are common traits in cephalopods but have rarely been observed in other invertebrates [6]. Many cephalopod species have been considered for fisheries and are promising candidates for aquaculture. There are an estimated 1,000 cephalopod species (∼700 known marine-living species), and octopods are among the most well-known representatives of the class, including more than 150 species worldwide [7]. Studies have evaluated the biological machinery underlying the fundamental nervous system functions, strong behavioral plasticity, and learning ability in octopods [8, 9].

Octopus minor (Sasaki, 1920) (NCBI taxon ID:515824), also known as the common long-arm octopus, is a benthic littoral species and a major commercial fishery product with a high annual yield [10]. Octopusminor is relatively small and possesses a shorter life cycle (approximately 1 year), thinner arms, and a lower ratio between head size and arm length compared to those of other Octopus species (Fig. 1a and b). The species is widely distributed in northeast Asia, particularly in coastal regions of South Korea, China, and Japan (Fig. 1c). Most O. minor habitats are mud and mud-sand in well-developed mudflats of coastal regions; they spawn in holes on the mudflat by digging with the whole body. Thus, they are subjected to the harsh environmental conditions of mudflats, including diurnal temperature changes, steep salinity and pH gradients, desiccation, wave action and tides, oxygen availability, and interrupted feeding. Owing to the ability of O. minor to tolerate environmental fluctuations, it is a promising organism for studies of the molecular basis of plasticity and mechanisms underlying adaptation to harsh environmental conditions, although relevant information is scarce. To make full use of this emerging cephalopod model system and to understand the interesting features of O. minor, including its plasticity in mudflats and genetic evolution, a high-quality reference genome is required.

Figure 1:

Figure 1:

Common long-arm octopus (Octopus minor). (a) Photograph of O. minor. (b) Habitat structure of mudflats and phenotypic differences between O. minor and Octopusbimaculoides. Octopusminor has a smaller body size and possesses longer, thinner arms than those of O. bimaculoides. (c) The distribution of O. minor is shown in red. The distribution map was updated from Roper et al. (1984)11.

The published genome and multiple transcriptomes of the California two-spot octopus Octopus bimaculoides have provided valuable information on genomic traits (e.g., gene family expansion, genome rearrangements, and transposable element activity) related to the evolution of neural complexity and morphological innovations [3]. In this study, we report a high-quality genome assembly and annotation for O. minor. We compare the genomes of O. minor and O. bimaculoides and provide evidence that the expansion of genes and/or gene families is related to adaptation to the harsh environmental conditions of mudflats.

Data Description

Genome sequencing and annotation

Octopusminor genomic DNA was extracted from leg muscle tissues. The average coverage of Single-Molecular Real-Time (SMRT) sequences was ∼76-fold using P6-C4 sequence chemistry from genomic DNA libraries that was sequenced using the Pacific Biosciences (PacBio) RS II platform. The average subread length was 9.2 kb (Supplementary Table S1). For genome size estimation, k-mer analysis was performed using Jellyfish ver. 2.1.3 (Jellyfish, RRID:SCR_005491) [12] with paired-end sequences of the genomic DNA libraries. The O. minor genome was estimated to be 5.1 Gb (Supplementary Figs. S1 and S2). The de novo assembly generated using FALCON-Unzip assembler ver. 0.4 was 5.09 Gb with 41,584 contigs (Falcon, RRID:SCR_016089) [13]. Finally, evaluation of the genome completeness was checked using Benchmarking Universal Single-Copy Orthologs (BUSCO) ver. 1.22 (BUSCO, RRID:SCR_015008) [14] (Table 1).

Table 1:

BUSCO evaluated for the completeness of the O. minor genome assembly

Eukaryote Metazoa
Count % Count %
Complete BUSCOs (C) 224 73.9 745 76.2
Complete and single-copy BUSCOs (S) 193 63.7 628 64.2
Complete and duplicated BUSCOs (D) 31 10.2 117 12
Fragmented BUSCOs (F) 26 8.6 82 8.4
Missing BUSCOs (M) 53 17.5 151 15.4
Total BUSCO groups searched 303 978

Total RNA was extracted from 13 tissues (brain, branchial heart, buccal mass, eye, heart, kidney, liver, ovary, poison gland, siphon, skin, and suckers) using the RNeasy Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. RNA quality was confirmed using an Agilent Bioanalyzer. Isoform sequencing was performed using pooled RNA from 13 organs. Library construction and sequencing were performed using PacBio RS II (Supplementary Table S2). The SMRTbell library for Iso-seq was sequenced using 16 SMRT cells (1–2 kb, three cells; 2–3 kb, six cells; and 3–6 kb, seven cells). Reads were identified using the SMRT Analysis ver. 2.3 RS_IsoSeq.1 classification protocol. All full-length reads derived from the same isoform were clustered, and consensus sequences were polished using the TOFU pipeline (isoseq-tofu) [15]. Additionally, chimeras of consensus sequences generated during experiments and TOFU pipeline were removed using in-house script.

MAKER ver. 2.28 was used for genome annotation (MAKER, RRID:SCR_005309) [16]. First, repetitive elements were identified using RepeatMasker ver. 4.0.7 (RepeatMasker, RRID:SCR_012954) [17]. A de novo repeat library was constructed using RepeatModeler ver. 1.0.3 (RepeatModeler, RRID:SCR_015027) [18], including RECON ver. 1.08 [19] and RepeatScout ver. 1.0.5 (RepeatScout, RRID:SCR_014653) [20], with default parameters. Consensus sequences and classification information for each repeat family were generated, and tandem repeats, including simple repeats, satellites, and low-complexity repeats, were predicted using Tandem Repeats Finder [15]. This masked genome sequence was used for ab initio gene prediction with SNAP software (SNAP—SNP Annotation and Proxy Search, RRID:SCR_002127) [21]. Subsequently, alignments of expressed sequence tags with the Basic Local Alignment Search Tool n ver. 2.2.28+ (BLASTN, RRID:SCR_001598) and protein information from tBLASTx ver. 2.2.28+ (TBLASTX, RRID:SCR_011823) were included. The de novo repeat library of O. minor from RepeatModeler was used for RepeatMasker. Proteins from sequenced molluscs (Lottia gigantea, Crassostrea gigas, and Aplysia californica) and an octopus species (O. bimaculoides) were included in the analysis. Transcriptome assembly results were used for expressed sequence tags. Next, MAKER polished the alignments using Exonerate, which provided integrated information for SNAP annotation. Using MAKER, the final gene model was selected and revised considering all information. A total of 30,010 O. minor genes were predicted using MAKER. The Infernal software package ver. 1.1 (Infernal, RRID:SCR_011809) [22] and covariance models from the Rfam (Rfam, RRID:SCR_007891) [23] database were used to identify other noncoding RNAs in the O. minor scaffold. Putative tRNA genes were identified using tRNAscan-SE ver. 1.4 (tRNAscan-SE, RRID:SCR_010835) [24]. tRNAscan-SE uses a covariance model that scores candidates based on their sequence and predicted secondary structures.

The mean size of O. minor genes was 23.6 kb, with an average intron length of 5.4 kb (4.2 introns per gene) (Supplementary Table S3). The O. minor genome contained 30,010 protein-coding genes (Table 2), of which 96% were annotated based on known proteins in public databases, and 79% were similar to O. bimaculoides genes (Supplementary Table S4).

Table 2:

Overview of the assembly and annotation of the Octopus minor genome

Total length (bp) 5,090,349,614
Number of contigs 41,584
Contig N50 (bp) 196,941
Largest contigs (bp) 3,027,443
GC (Guanine-Cytosine) content (%) 36.33
Number of protein-coding genes 30,010

Comparative genomic analyses and duplicate genes

To resolve gene family evolution in the O. minor genome, we classified orthologous gene clusters (Supplementary Table S5) from 14 species and found evidence for the recent expansion of low-copy gene duplicates and the expansion of large gene families. Orthologous groups were identified using both OrthoMCL ver. 2.0.9 [25] and Pfam (Pfam, RRID:SCR_004726) [26] domain assignments. OrthoMCL generated a graphical representation of sequence relationships, which was then divided into subgraphs using the Markov clustering algorithm from multiple eukaryotic genomes [25]. The default parameters and options of OrthoMCL were used for all steps, together with the genomes of 14 species (Supplementary Table S5). For O. minor, the coding sequence from the MAKER annotation pipeline was used. To construct a phylogenetic tree and estimate the divergence time, 202 one:one single-copy orthologous genes were used. Using the Probabilistic Alignment Kit ver.140603 [27], protein-coding genes were aligned with the codon alignment option, and poorly aligned regions with gaps were eliminated using Gblocks ver. 0.91b [28] with a codon model. A maximum-likelihood tree was built using RAxML ver. 8.2.4 (RAxML, RRID:SCR_006086) [29] with 1,000 bootstrap replicates, and the divergence time was calibrated using TimeTree [30]. The average gene gain-loss was identified using CAFE ver. 4.0 [31] with P value < 0.05.

Sequence divergence was estimated by calculating dS values using the yn00 program from the PAML package ver. 4.7a (PAML, RRID:SCR_014932) [32]. The Jukes–Cantor distances were adjusted using the Jukes–Cantor formula dXY = -(3/4)ln(1–4/3D), where D is the proportion of nucleotide differences between the sequences. The time estimation was calibrated by assuming dS of ∼1 is 135 million years [7].

Gene family analyses of specific genes of interest were manually curated using manual gene search methods. Gene or gene family targets identified in the genomes of O. bimaculoides, C. gigas, L. gigantea, Capitella teleta, and Homo sapiens were directly mapped to the O. minor genome database by a local BLAST analysis. Alignments were generated using Clustal Omega ver. 1.2.4 [33] and Multiple Sequence Comparison by Log-Expectation ver. 3.8.31 (MUSCLE, RRID:SCR_011812) [34], and phylogenetic trees were built using FastTree [35] or RAxML with 1,000 bootstrap replicates.

Gene gain-loss analysis indicated significantly greater gene family expansion in O. minor (178 gene families) compared to other species, e.g., interleukin-17, G protein-coupled receptor proteins, zinc-finger of C2H2 type, heat shock protein (HSP) 70 proteins, right and cadherin-like domains (Supplementary Tables S6–S8). The divergence time between O. minor and O. bimaculoides was estimated to be 43 million years ago (Mya) based on single-copy orthologous genes (Fig. 2a) Further, Pfam domain and EggNOG metazoan database searches consistently showed the expansion of gene families, including the cadherin and protocadherin domains and interleukin-17 (Fig. 2b, Supplementary Tables S9 and S10).

Figure 2:

Figure 2:

Gene family analysis for 14 bilaterian species. (a) Divergence times estimated from genome sequences of 14 bilaterian species. (b) Heat map of expanded Pfam domains in the O. minor genome. OM, Octopus minor; OB, Octopus bimaculoides; LG, Lottia gigantea; CG, Crassostrea gigas; PF, Pinctada fucata; LA, Lingula anatina; CT, Capitella teleta; HR, Helobdella robusta; CE, Caenorhabditis elegans; DM, Drosophila melanogaster; DP, Daphnia pulex; SP, Strongylocentrotus purpuratus; MM, Mus musculus; HS, Homo sapiens.

Previously, 168 protocadherin (pcdhs) genes were annotated in the genome of O. bimaculoides, which is the largest number among sequenced metazoan genomes [3]. In the case of the C2H2 zinc-finger gene family, approximately 1,800 C2H2 genes were annotated in the O. bimaculoides genome. The drastic expansions were also observed in the genome of O. minor, as 303 and 2,289 genes were annotated for pcdhs and the C2H2 zinc-finger gene family, respectively. We assume that the expansion at

patterns are unique to the genus Octopus, as the expansion pattern was not detected in squid and the pcdhs seem to have expanded after octopuses diverged from squid (≈135 Mya) [3]. Since we estimated that O. minor diverged from the genus Octopus, the extraordinary expansions of both gene families are presumably Octopus specific.

Transposable element annotation and expansions

The O. minor genome (5.1 Gb) is composed of 44% repetitive sequences and 0.68% coding sequences, while the O. bimaculoides genome (2.7 Gb) made up of 35% repetitive sequences and 1.08% coding sequences. Repeats were dominated by simple repeats (14.7% of genome) and transposable elements (TEs), especially DNA transposons and long interspersed elements (LINEs), which were more abundant in the O. minor genome than in the O. bimaculoides genome (Supplementary Tables S11–S13). In an analysis of genes (i.e., exons and introns) and intergenic sequences, TEs were highly distributed in the intergenic sequence regions in both species (Supplementary Fig. S4). In particular, TE accumulation in intergenic sequence regions was significantly greater in O. minor than in O. bimaculoides. The larger number of gene size and higher repeat content may explain the larger genome of O. minor compared with O. bimaculoides.

TEs are components of animal genomes, with major roles in genome rearrangements and evolution. Based on the mechanism of transposition, TEs are grouped into two main classes, class I retrotransposons, which are subdivided into long terminal repeats (LTRs) and non-LTR retrotransposons (e.g., LINEs and short interspersed elements [SINEs]), and class II DNA transposons [36]. We detected more TEs in the larger genome of O. minor than in the smaller genome of O. bimaculoides. Approximately half of the O. minor genome was composed of TEs (11,547,325 TEs; 44% of the genome), while one-third of the O. bimaculoides genome was composed of TEs (3,887,025 TEs; 35%) (Supplementary Table S11). The majority of class I retrotransposons in the O. minor genome were LINEs (10%), as was also the case in O. bimaculoides (9%), and the proportion of DNA transposons in O. minor (13%) was comparable to that in O. bimaculoides (12%). Interestingly, the O. minor genome had fewer SINEs (1,540 copies; 0.01%) and more rolling-circle (RC) helitrons (121,101 copies; 3.7%) than the O. bimaculoides genome (SINEs: 115,169 copies, 1.8%; RC helitron: 43,735 copies, 0.7%). A Kimura distance analysis revealed that the most frequent TE sequence divergence relative to the TE consensus sequence was ∼7%–10%, with an additional peak at 3% (Fig. 3a) compared to 16%–17% in the O. bimaculoides genome (Fig. 3b and Supplementary Table S11).

Figure 3:

Figure 3:

Transposable element (TE) accumulation history in the Octopus genomes. Kimura distance-based copy divergence analysis of TEs for (a)O. minor and (b)O. bimaculoides. x-axis, K-value; y-axis, genome coverage for each type of TE.

A more recent expansion of LINEs, without an increase in SINEs, was detected in the O. minor genome, while ancient copies of all four types of TEs and an ancient transposition burst of DNA transposons were observed in O. bimaculoides. Using the recent TE expansion in the O. minor genome, we correlated Jukes–Cantor distance measures with dS and identified two unique expansion waves at 0.04 and 0.09 compared to the distribution of O. bimaculoides TEs (Supplementary Figs. S5 and S6). This suggests that a major expansion of TEs in the O. minor genome occurred 11 to 25 Mya, which is after the divergence of O. minor and O. bimaculoides.

Conclusions

Octopus minor has developed morphological and physiological adaptations to match their unique mudflat habitats. In summary, we generated a high-quality sequence assembly for O. minor to elucidate the molecular mechanisms underlying their adaptations. In a direct comparison between the genomes of O. minor and O. bimaculoides, we discovered that they evolved recently and independently from the octopus lineage during the successful transition from an aquatic habitat to mudflats. We also found evidence suggesting that speciation in the genus Octopus is closely related to the gene family expansion associated with environmental adaptation. Finally, in addition to providing insights into the genome size increase via gene family expansion, the O. minor genome sequence also provides an essential resource for studies of Cephalopoda evolution.

Availability of supporting data

The octopus (O. minor) genome project was deposited at the National Center for Biotechnology Information under BioProject number PRJNA421033. The whole-genome sequence was deposited in the Sequence Read Archive (SRA) database under accession number SRX3462978, and isoform sequence from PacBio sequencing data was deposited in the SRA database under accession numbers SRX3478495 and SRX3478496. Other supporting data, including annotations, alignments, and BUSCO results, are available in the GigaScience repository, GigaDB [37].

Additional files

Fig. S1. Estimation of genome size of O. minor based on distribution of 17 k-mer frequency in raw sequencing reads.

Fig. S2. Genome size determination by flow cytometry. The flow cytometry analysis provides as estimation of Propidium iodide (PI) staining. Accepting a haploid genome size estimate of 2.81 Gb for Mouse (Assembly; GRCm38.p6), we estimate the genome size of O. minor to be 5.38 Gb.

Fig. S3. Blast top hit distribution.

Fig. S4. Composition of transposable elements in the regions of gene and intergenic sequence.

Fig. S5. Transposable elements Juke-cantor distance distribution.

Fig. S6. Transposable elements Juke-cantor distance distribution of O. minor.

Table S1. Statistics for SMRT sequencing for the O. minor genome sequencing.

Table S2. Isoform sequencing summary of transcriptome analysis of O. minor using PacBio RSII.

Table S3. Brief summary of gene statistics.

Table S4. Functional annotation statistics of transcriptome assembly.

Table S5. Summary of orthologous gene clusters analyzed in 14 species.

Table S6. CAFE gene family analysis results.

Table S7. Example of top 30 CAFE significantly expanded gene families.

Table S8. Example of top 30 CAFE significantly shrinked gene families.

Table S9. Top 30 expanded Pfam domains.

Table S10. Top 30 expanded EggNOG domains.

Table S11. Statistics of repeat analysis of the O. minor genome.

Table S12. Classifications and frequencies of transposable elements and other repeats.

Table S13. Classifications and frequencies of simple repeats.

Supplementary text commands

Abbreviations

BLAST: Basic Local Alignment Search Tool; BUSCO: Benchmarking Universal Single-Copy Orthologs; LINEs: Long Interspersed Nuclear Elements; LTR: Long Terminal Repeat; Mya: Million years ago; PacBio: Pacific Biosciences; RC: Rolling Circle; SINEs: Short Interspersed Nuclear Elements; SMRT: Single-Molecular Real-Time; TEs: Transposable Elements.

Ethics Statement

No specific permits were required for the described field studies, no specific permissions were required for these locations/activities, and the field studies did not involve endangered or protected species.

Competing interests

The authors declare that they have no competing interests.

Funding

This work was supported by a grant from National Marine Biodiversity Institute of Korea (MABIK)(2018M00900).

Author contributions

H.S.A., H.P., and J.L. conceived the study. H.P., B.K., S.K., D.A., S.J., J.L., H.R., and S.L. performed genome sequencing, assembly, and annotation. S.J., Y.H., K.R., and S.C. performed experiments. J.S.Y., H.S.A., H.P., S.J., and J.L. advised and coordinated the study. B.K., S.K., D.A., and H.P. mainly wrote the paper. All authors contributed to writing and editing the manuscript and supplementary information and to producing the figures.

Supplementary Material

Reviewer_1_Report_(Original_Submission) -- Joseph F. Ryan

6/18/2018 Reviewed

Reviewer_2_Report_(Original_Submission) -- Yi-Jyun Luo

6/24/2018 Reviewed

Reviewer_1_Report_Revision_1 -- Joseph F. Ryan

8/23/2018 Reviewed

Reviewer_2_Report_Revision_1 -- Yi-Jyun Luo

8/24/2018 Reviewed

Response_To_Reviewer_Comments_Revision_1.pdf
Response_To_Reviewer_Comments_(Original_Submission).pdf
GIGA-D-18-00174_Original_Submission.pdf
GIGA-D-18-00174_Revision_1.pdf
GIGA-D-18-00174_Revision_2.pdf
Supplemental Files

ACKNOWLEDGEMENTS

We thank Jong Won Han and Ha Yeun Song of MABIK for the sampling of 18 tissues used for transcriptome assembly, as well as Keekwang Kim of Chungnam National University and Kun-Hee Kim of Chonnam National University for their devotion to estimating the genome size of O. minor by flow cytometry. We also thank Jeollanam-Do Oceans & Fisheries Science Institute for providing octopus embryos.

References

  • 1. Takeuchi T, Kawashima T, Koyanagi R,et al. . Draft genome of the pearl oyster Pinctada fucata: a platform for understanding bivalve biology. DNA Res. 2012, 19, 117–130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Zhang G, Fang X, Guo X, et al. . The oyster genome reveals stress adaptation and complexity of shell formation. Nature. 2012;490(7418):49. [DOI] [PubMed] [Google Scholar]
  • 3. Albertin CB, Simakov O, Mitros T et al. . The octopus genome and the evolution of cephalopod neural and morphological novelties. Nature. 2015;524(7564):220–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Luo Y-J, Takeuchi T, Koyanagi R, et al. . The Lingula genome provides insights into brachiopod evolution and the origin of phosphate biomineralization. Nature Communications. 2015;6:8301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Boyle P, Rodhouse P. Cephalopods: Ecology and Fisheries. Oxford: Blackwell Science Ltd; 2005. [Google Scholar]
  • 6. Hanlon RT, Messenger JB. Cephalopod Behaviour. Cambridge: Cambridge University Press; 1998. [Google Scholar]
  • 7. Guzik MT, Norman MD, Crozier RH. Molecular phylogeny of the benthic shallow-water octopuses (Cephalopoda: Octopodinae). Mol Phylogen Evol. 2005;37(1):235–48. [DOI] [PubMed] [Google Scholar]
  • 8. Hochner B, Shomrat T, Fiorito G. The octopus: a model for a comparative analysis of the evolution of learning and memory mechanisms. Biol Bull. 2006;210(3):308–17. [DOI] [PubMed] [Google Scholar]
  • 9. Mather JA. Cephalopod consciousness: behavioural evidence. Conscious Cogn. 2008;17(1):37–48. [DOI] [PubMed] [Google Scholar]
  • 10. MIFAFF. Food, Agriculture, Forestry and Fisheries Statistical Yearbook. Seoul: Forestry and Fisheries (MIFAFF) Press; 2012. [Google Scholar]
  • 11. Roper CFE, Sweeney MJ, Nauen CE. Cephalopods of The World. An Annotated and Illustrated Catalogue of Species of Interest to Fisheries. FAO species catalogue. Vol. 3. [Google Scholar]
  • 12. Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27(6):764–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Chin C-S, Peluso P, Sedlazeck FJ, et al. . Phased diploid genome assembly with single molecule real-time sequencing. Nat Methods. 2016;13(12):1050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Simão FA, Waterhouse RM, Ioannidis P et al. . BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2. [DOI] [PubMed] [Google Scholar]
  • 15. Gordon SP, Tseng E, Salamov A et al. . Widespread polycistronic transcripts in fungi revealed by single-molecule mRNA sequencing. PLoS One. 2015;10(7):e0132628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Holt C, Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics. 2011;12(1):491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Smit AFA HR, Hubley, Green P. RepeatMasker Open-3.0. 1996-2010(http://www.RepeatMakser.org) (RRID:SCR_012954). [Google Scholar]
  • 18. Bao Z, Eddy SR. Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res. 2002;12(8):1269–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Price AL, Jones NC, Pevzner PA. De novo identification of repeat families in large genomes. Bioinformatics. 2005;21(suppl_1):i351–i8. [DOI] [PubMed] [Google Scholar]
  • 20. Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Korf I. Gene finding in novel genomes. BMC Bioinformatics. 2004;5(1):59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Nawrocki EP, Kolbe DL, Eddy SR. Infernal 1.0: inference of RNA alignments. Bioinformatics. 2009;25(10):1335–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Gardner PP, Daub J, Tate J, et al. . Rfam: Wikipedia, clans and the “decimal” release. Nucleic Acids Res. 2010;39(suppl_1):D141–D5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25(5):955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Li L, Stoeckert CJ Jr, Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13(9):2178–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Finn RD, Bateman A, Clements J et al. . Pfam: the protein families database. Nucleic Acids Res. 2013;42(D1):D222–D30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Löytynoja A, Goldman N. An algorithm for progressive multiple alignment of sequences with insertions. PNAS. 2005;102(30):10557–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Castresana J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 2000;17(4):540–52. [DOI] [PubMed] [Google Scholar]
  • 29. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Hedges SB, Dudley J, Kumar S. TimeTree: a public knowledge-base of divergence times among organisms. Bioinformatics. 2006;22(23):2971–2. [DOI] [PubMed] [Google Scholar]
  • 31. Han MV, Thomas GW, Lugo-Martinez J, et al. . Estimating gene gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Mol Biol Evol. 2013;30(8):1987–97. [DOI] [PubMed] [Google Scholar]
  • 32. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91. [DOI] [PubMed] [Google Scholar]
  • 33. Sievers F, Wilm A, Dineen D et al. . Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega. Molecular Systems Biology. 2011;7(1):539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Price MN, Dehal PS, Arkin AP. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5(3):e9490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Wicker T, Sabot F, Hua-Van A et al. . A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 2007;8(12):973–82. [DOI] [PubMed] [Google Scholar]
  • 37. Kim B, Kang S, Ahn D et al. . Supporting data for “The genome of common long-arm octopus Octopus minor.”. GigaScience Database. 2018. http://dx.doi.org/10.5524/100503. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer_1_Report_(Original_Submission) -- Joseph F. Ryan

6/18/2018 Reviewed

Reviewer_2_Report_(Original_Submission) -- Yi-Jyun Luo

6/24/2018 Reviewed

Reviewer_1_Report_Revision_1 -- Joseph F. Ryan

8/23/2018 Reviewed

Reviewer_2_Report_Revision_1 -- Yi-Jyun Luo

8/24/2018 Reviewed

Response_To_Reviewer_Comments_Revision_1.pdf
Response_To_Reviewer_Comments_(Original_Submission).pdf
GIGA-D-18-00174_Original_Submission.pdf
GIGA-D-18-00174_Revision_1.pdf
GIGA-D-18-00174_Revision_2.pdf
Supplemental Files

Articles from GigaScience are provided here courtesy of Oxford University Press

RESOURCES