Transcriptome of the Deep-Sea Black Scabbardfish, Aphanopus carbo (Perciformes: Trichiuridae): Tissue-Specific Expression Patterns and Candidate Genes Associated to Depth Adaptation

Sergio Stefanni; Raul Bettencourt; Miguel Pinheiro; Gianluca De Moro; Lucia Bongiorni; Alberto Pallavicini

doi:10.1155/2014/267482

. 2014 Sep 17;2014:267482. doi: 10.1155/2014/267482

Transcriptome of the Deep-Sea Black Scabbardfish, Aphanopus carbo (Perciformes: Trichiuridae): Tissue-Specific Expression Patterns and Candidate Genes Associated to Depth Adaptation

Sergio Stefanni ^1,^2,^*, Raul Bettencourt ², Miguel Pinheiro ³, Gianluca De Moro ⁴, Lucia Bongiorni ⁵, Alberto Pallavicini ⁴

PMCID: PMC4182897 PMID: 25309900

Abstract

Deep-sea fishes provide a unique opportunity to study the physiology and evolutionary adaptation to extreme environments. We carried out a high throughput sequencing analysis on a 454 GS-FLX titanium plate using unnormalized cDNA libraries from six tissues of A. carbo. Assemblage and annotations were performed by Newbler and InterPro/Pfam analyses, respectively. The assembly of 544,491 high quality reads provided 8,319 contigs, 55.6% of which retrieved blast hits against the NCBI nonredundant database or were annotated with ESTscan. Comparison of functional genes at both the protein sequences and protein stability levels, associated with adaptations to depth, revealed similarities between A. carbo and other bathypelagic fishes. A selection of putative genes was standardized to evaluate the correlation between number of contigs and their normalized expression, as determined by qPCR amplification. The screening of the libraries contributed to the identification of new EST simple-sequence repeats (SSRs) and to the design of primer pairs suitable for population genetic studies as well as for tagging and mapping of genes. The characterization of the deep-sea fish A. carbo first transcriptome is expected to provide abundant resources for genetic, evolutionary, and ecological studies of this species and the basis for further investigation of depth-related adaptation processes in fishes.

1. Introduction

The deep-sea (>1000 m depth) covers about 70% of the Earth's surface, representing one of the last large unexplored areas on the planet. Only within the last few decades the technology has advanced sufficiently to reach the deep-sea effectively, revealing unexpected high levels of biodiversity and extremely diverse habitats (canyons, cold seeps, hydrothermal vents, deep-water coral reefs, mud volcanoes, seamounts, and trenches) of significant conservation interest and potential high economic values. Deep-sea environments are characterized by extremely high hydrostatic pressures (1 MPa every 100 m), lack of light, and low temperatures (down to 1-2°C). Therefore, fish as well as any other organism living in the deep-sea had to adapt to tolerate conditions of this extreme habitat [1].

First studies on adaptation to high pressure and low temperatures are dated back in the ‘70s and they report comparison of common proteins present in shallow and deep-water fishes [2, 3]. Key enzymes in muscle tissues that exhibit adaptive differences among species at different depths are the lactate dehydrogenase (LDH) and malate dehydrogenase (MDH) presenting differences in structural stability (reviews in [4–6]). More recent studies on evolutionary adaptation of functional genes to high pressure report unique amino acid substitutions in α-skeletal actin and myosin heavy chain (MyHC) proteins in deep-sea fishes [7–10]. For deep-sea species inhabiting hydrothermal vents and cold seeps, environments characterized by high pressure, chronic hypoxia, and high concentrations of toxic compounds, molecular and functional adaptation of hemoglobins (Hbs) are reviewed in Hourdez and Weber [11]. Despite these studies, our knowledge on wide scale gene expression patterns in deep-sea fish remains elusive.

The black scabbardfish, Aphanopus carbo (Lowe 1839), is a bathypelagic species belonging to the Trichiuridae family and is distributed in temperate-cold Atlantic waters at depths between 200 and 1800 m [12, 13]. A. carbo represents a commercially valuable species for several regions of the Iberic peninsula, especially in Madeira where catches have reached up to 1000 tons per year [14] amounting to ca. 55% of the total landings. Recently this species has become increasingly targeted by Portuguese, French, and Irish fishing fleets ([15] and literature therein) and fishery data have shown a constant decline in population [16]. The information available on the biology, maturity, spawning, and growth of this species [17, 18] is scattered. Recent studies are reporting a panmictic distribution of this species in the NE Atlantic with multiple breeding sites at low latitudes [19]. It is also worth mentioning that, in southern locations, this species lives in sympatry with A. intermedius, a close related species with very similar morphology [20], therefore attracting interest for evolutionary studies.

High-throughput sequencing approaches applied to transcriptomics now provide a global perspective on taxonomic and functional profiling of genes expectedly expressed under the influence of environmental conditions in which these organisms live. Also known as next-generation sequencing, these techniques allow for a massive characterization of expressed sequence tags (ESTs) providing an overview of those genes expressed in a given tissue at any given time [21]. In silico analyses of massive gene libraries may serve several interests among others. For instances, from discovery and identification of new genes, characterization of gene expression, to development of novel genetic markers for quantitative trait locus (QTL), and population of genomic analyses. The breadth of next-generation sequencing applications extends over a variety of biological questions including those addressing pertinent questions regarding a species' ecology, life history, and evolution [22, 23].

Previous studies regarding transcriptome sequencing and gene expression studies in deep-water species were mostly limited to hydrothermal vents invertebrates [24], microbial communities in hydrothermal plumes [25], deep-sediments [26], and in the water column [27] leaving vertebrates species virtually under-represented. The present work represents a pioneer study for deep-sea fishes providing new insights into the role of differential gene expression on the environmental adaptation of deep-sea black scabbardfish.

Here we describe the assembly and annotation of the transcriptome of A. carbo obtained by sequencing mRNA libraries of six tissues (spleen, brain, heart, gonads, liver, and muscle) and explore functional genes whose sequence might be associated to depth adaptation. Additionally, we tested the correlation of selected candidate genes comparing the number of contigs against the gene expression normalized to a relative value of 1.0, as determined by qPCR amplification. Furthermore, the screening of the libraries allowed the identification of new EST-simple-sequence repeats (SSRs) and the design of primer pairs suitable for population genetic studies as well as tagging and mapping of genes.

2. Methods

2.1. Fish and Tissue Samples

Specimens of Aphanopus carbo (two males and two females) were collected in 2009 onboard of the RV “Arquipelago.” A. carbo were fished at depth range 1100–1250 m using deep-water long-lines in proximity of the Condor Seamount, located approximately at 15 nm SW of the island of Fayal (Azores, Portugal). The four specimens used in this study were caught on the same longline set and once onboard, the freshly caught animals were dissected and portions (or complete organs) of spleen, brain, heart, gonads, liver, and muscle tissues were preserved in formamide solution and kept at −20°C until RNA extraction was performed.

To validate the correct identification of the species, a small portion of muscle tissue was also preserved in 95% ethanol for molecular screening following Stefanni et al. [28] protocols.

2.2. RNA Extraction and Sequencing

Total RNA was extracted from 20 to 40 mg of each of the six preserved tissues of a pool of four A. carbo individualsusing the RiboPure kit (Ambion, Applied Biosystems). Quantity and purity of the RNA was determined on a 1.4% agarose-MOPS-formaldehyde denaturing gels and by assessing the A _260/280 and A _260/230 ratios using the NanoVue spectrophotometer (GE Healthcare). Poly-A RNA was extracted from 15 μg of each total RNA sample using the Poly(A)Purist mRNA Purification kit according to manufacturer's instructions (Ambion, Applied Biosystems). mRNAs were transcribed into cDNA utilizing Mint-2 cDNA synthesis kit (Evrogen, http://www.evrogen.com/) according to manufacturer's instructions for NGS platforms. Six cDNA libraries were constructed from mRNA of individual pools of tissues and sequenced in a single 454 GS FLX Titanium run. Each of the cDNA libraries was characterised by unique sequence tags (MIDs) that allowed to trace back the sequences generated from single tissues after assembly.

cDNAs were sheared by nebulization to yield random fragments approximately 500–800 bp in length, by applying 30 psi (2,1 bar) of nitrogen for 1 minute on 4 μg of each library. The distribution of fragments was verified on a BioAnalyser DNA 7500 LabChip (Agilent Technologies). The fragmented cDNA sample was end-repaired with T4 DNA polymerase and T4 polynucleotide kinase and adaptor sequences ligated according to the manufacturer's instructions [29]. The fragments were immobilized onto streptavidin beads and nick-repaired with Bst polymerase. The cDNA fragments were denaturated with alkali to yield single stranded cDNA (sscDNA) library. Quality of the library was assayed on a BioAnalyser RNA 6000 Pico LabChip (Agilent Technologies) and quantity measured by spectrofluorimetry with the Quant-iT RiboGreen RNA Assay kit (Invitrogen). A titration was set up at 1, 2, 4, and 8 copies per bead (cpb) in the clonal amplification by emulsion PCR to optimize yield and sequence quality. The percent enrichment of beads carrying the sscDNA was determined and the amount of library input calculated to 18%. A large scale emulsion PCR was set-up based on the previous value and sequenced at Biocant (Cantanhede, Portugal) using the 454 GS FLX Titanium pyrosequencing on a full 70X75 PicoTiterPlate, according to manufacturer's instructions (Roche).

2.3. Bioinformatic Analyses

High quality reads were assembled using Newbler ver. 2.6 (Roche 454) sequence analysis software. All reads were identified and grouped by their unique MIDs to the tissue of origin. Trimming and masking the polyAs was a common procedure for the assembling tool.

The assemblage is characterized by read overlaps and multiple alignments made in nucleotide space. Consensus base-calling and quality value determination for contigs are performed in flow space. The use of flow space in determining the properties of the consensus sequence results in an improved accuracy for the final base-calls. The implementation of this software was performed using default parameters. Assembled contigs were annotated through sequence similarity searches against the National Centre for Biotechnology Information (NCBI) nonredundant (nr) protein database using the BLASTx [30] with a cut-off criterion of an expect-value (e-value) < 10⁻⁶. The contigs that did not find a hit were further processed with ESTScan (http://www.ch.embnet.org/software/ESTScan2.html). The two assemblages of amino acid sequences, resulting from the BLASTx searches at high level of stringency and the ESTScan, were processed by InterProScan for functional annotation of transcripts applying the function for the mapping of gene ontology (GO) terms. The GO method classifies genes within a hierarchy using a systematic nomenclature of attributes that can be assigned to all gene products independently from the organism of origin. To reduce the redundancy in the consensus sequences which correspond to the same gene we used BLASTClust to detect similar assemblies with 95% identity and 90% coverage. All the results from both assemblage methods were loaded into a SQL database developed for this purpose.

To validate the accuracy of the assembly, the resulting contigs were compared to previously sequenced transcriptomes of 6 teleosts including Danio rerio, Gasterosteus aculeatus, Oreochromis niloticus, Oryzias latipes, Takifugu rubripes, and Tetraodon nigroviridis, using tBLASTn [30] to find protein homologs at two levels of stringency (e-value < 10⁻³ and e-value < 10⁻¹⁰).

To identify protein conserved domain specific for each tissue analysed a new annotation was performed with Hmmer against the Pfam database (ver. 25.0). Protein domain representativeness for each tissue was obtained comparing protein domain abundance in a particular tissue versus all the tissues compiled together using a hypergeometric test.

2.4. cDNA Synthesis and qPCR Validation Tests

Fresh cDNA was synthesised from the six mRNAs that were used for pyrosequencing, cDNA synthesis was performed using primers with oligo(dT) and the ThermoScript RT-PCR System (Invitrogen) following the manufacturer's instructions.

A set of 28 genes were selected including candidates that were tissue-specific and genes that were encountered in the tissue expressed at similar as well as at different amounts in all the six libraries, with the aim of covering most of the possible expression scenarios within the dataset. Frequencies of contigs for all candidates genes in the mRNA libraries were obtained by detecting orthologous gene sequences using the BLAST tool included in the A. carbo database.

For the design of all qPCR primers (Table 1) we used the web interface NCBI Primer-BLAST (http://blast.ncbi.nlm.nih.gov/). Alignments of the sequences provided by the output from the internal blast search were used to select all primer sets.

Table 1.

List of targeted genes using qPCR, primer sets specifically designed for this study, size for each of the product, and NCBI accession number for all the EST sequences.

Gene	Primer name	Primer Sequences (5′-3′)	Size (bp)	NCBI Accession #
Elongation factor 1-beta	EF-1B L	GCTTGGACATGTCGGTCTCGTC	229 bp	All_gs454_000396
Elongation factor 1-beta	EF-1B H	GTGGCTGACACCACATCTGGC	229 bp	All_gs454_000396
Ras-related GTP-binding protein A	Rab-1A L	AGTAGCCGTTCCACCTTGTCGG	247 bp	All_gs454_000598
Ras-related GTP-binding protein A	Rab-1A H	TGCCAAGAAACCGTACGTGGGA	247 bp	All_gs454_000598
Basic Transcription Factor 3-like 4	BTF3 L	CCCAAAGTTCAGGCCTCCCTGT	273 bp	All_gs454_000873
Basic Transcription Factor 3-like 4	BTF3 H	TCATGTGCGTCAGTTCGCTTCG	273 bp	All_gs454_000873
Cu/Zn Superoxide Dismutase	SOD-1 L	AAACGTGACTGCAGGAGGGGAT	240 bp	All_gs454_000925
Cu/Zn Superoxide Dismutase	SOD-1 H	CAGTGCTCCTGCTCCATGTTCG	240 bp	All_gs454_000925
2-Cys Peroxiredoxin	PRDX1 L	CCGATAACCTCGCAGCCGATAC	243 bp	All_gs454_000558
2-Cys Peroxiredoxin	PRDX1 H	ACAGTCATTTGCCACCAGCATCA	243 bp	All_gs454_000558
Heat Shock Protein 90	HSP90 L	TGACGATGTCCCCACAGATGAGG	221 bp	All_gs454_000008
Heat Shock Protein 90	HSP90 H	GCAACACTGGTCCACCACACAAC	221 bp	All_gs454_000008
Ferritin, heavy subunit	Ferr L	CCTGCAGCTTGAGAAGAGCGTC	203 bp	All_gs454_000681
Ferritin, heavy subunit	Ferr H	CAAACAGGTACTCGGCCATGCC	203 bp	All_gs454_000681
α ₂ Globin	Hb-A L	AAATTGTTGGCCATGCGGAGGA	208 bp	All_gs454_001919
α ₂ Globin	Hb-A H	CTGAGGTTCAGCAGACCTGCCT	208 bp	All_gs454_001919
β ₂ Globin	Hb-B L	TCGTCTACCCCTGGTGTCAGAG	245 bp	All_gs454_001018
β ₂ Globin	Hb-B H	AACCACAATGGTCAGGCAGTCC	245 bp	All_gs454_001018
Ependymin-1 precursor	EPD-1 L	CAGGTGTGAGGCAGTGCAGT	230 bp	All_gs454_000469
Ependymin-1 precursor	EPD-1 H	ACCCCGATCTCCTCCTGGTG	230 bp	All_gs454_000469
Fatty acid-binding protein, brain	BLBP L	CAACACTTCTTGGCCGGTTTGG	239 bp	All_gs454_001220
Fatty acid-binding protein, brain	BLBP H	GAGAGGAGTTCGACGAAGCCAC	239 bp	All_gs454_001220
CD63-like protein Sm-TSP-2	TSPAN-8 L	TCGCTGGCTGCTCTGAGAAAGA	200 bp	All_gs454_000381
CD63-like protein Sm-TSP-2	TSPAN-8 H	GGTCACGCCGAGCTGTATTCTG	200 bp	All_gs454_000381
Tropomyosin 4 isoform 1	TRPM-1 L	GTGGAGGAGGAGTTGGACCGAG	221 bp	All_gs454_000222
Tropomyosin 4 isoform 1	TRPM1 H	TTGCGAGCCACCTCCTCGTATT	221 bp	All_gs454_000222
C-Myc-binding protein	MYCBP L	CGCCAGTTTACCTGCGTTCCAA	182 bp	All_gs454_001640
C-Myc-binding protein	MYCBP H	GGCCGTCAACAACACCACCTTT	182 bp	All_gs454_001640
Cathepsin S	CTSS L	AACAGCCTACCCCTACACAGCC	200 bp	All_gs454_000156
Cathepsin S	CTSS H	TGTACACACCGTGGCGGTAGAA	200 bp	All_gs454_000156
Transferrin	STF-1 L	AGCTGCACCAGCTTCACAGTTG	215 bp	All_gs454_000004
Transferrin	STF-1 H	AAGGATGGCACCAGACAACCCA	215 bp	All_gs454_000004
Warm Temperature Acclimation related-like 65 kDa protein	HPX L	TGATACCGGGTGGAACCTGGTG	207 bp	All_gs454_000060
Warm Temperature Acclimation related-like 65 kDa protein	HPX H	GCTGCTGTGGAGTGTCCCAAAG	207 bp	All_gs454_000060
Betaine Homocysteine S-methyltransferase	BHTM L	GGGGGTTCGCTGTTACCAAGTG	194 bp	All_gs454_000088
Betaine Homocysteine S-methyltransferase	BHTM H	TGTGAGACAGCAGCCTCAGGAG	194 bp	All_gs454_000088
FUCL1 Fucolectin-1	FUCL1 L	CGCAAACCCTTTGGCTGGTGTA	196 bp	All_gs454_000758
FUCL1 Fucolectin-1	FUCL1 H	GGCTTTTCCTTGGACTGCCAGG	196 bp	All_gs454_000758
Aldolase B	ALDB L	GCCATTGGTCTTGGCCCTGATC	220 bp	All_gs454_000115
Aldolase B	ALDB H	CGCTGTGCCTGGTATCTGCTTC	220 bp	All_gs454_000115
Type-4 ice-structuring protein LS-12 precursor	ISP LS12 L	AAGACCTGACAAACCAGGCCCA	198 bp	All_gs454_001277
Type-4 ice-structuring protein LS-12 precursor	ISP LS12 H	GGAGGATGGCCTCCATCTGCTT	198 bp	All_gs454_001277
Alcohol Dehydrogenase 8a	ADH L	GGCAAGAAGGTGCTGCAGTTCA	228 bp	All_gs454_000105-6
Alcohol Dehydrogenase 8a	ADH H	CATGACTGCAGCCAAACCCACA	228 bp	All_gs454_000105-6
Glyceraldehyde-3-phosphate Dehydrogenase	GAPDH L	GTCAACCACTGACACGTTGGGG	229 bp	All_gs454_000148
Glyceraldehyde-3-phosphate Dehydrogenase	GAPDH H	CGGCATCATTGAGGGCCTGATG	229 bp	All_gs454_000148
Lactate Dehydrogenase-A	LDH-A L	TCTTAACCTGGTGCAGCGCAAC	219 bp	All_gs454_000149
Lactate Dehydrogenase-A	LDH-A H	TGGAGCTTCTCGCCCATGATGT	219 bp	All_gs454_000149
Phosphoglycerate Mutase 2-1 (muscle)	PglyM L	ACACCTCTGTGCTGAAACGTGC	212 bp	All_gs454_000309
Phosphoglycerate Mutase 2-1 (muscle)	PglyM H	CATGGGTGGAGGTGGGATGTCA	212 bp	All_gs454_000309
Heat Shock Protein 70	HSP70 L	CGGTGTTGTGTGCTGGGTGAAA	207 bp	All_gs454_000005
Heat Shock Protein 70	HSP70 H	CCACATAGCTGGGTGTGGTCCT	207 bp	All_gs454_000005
Fructose-bisphosphate Aldolase A	FBPA L	GGAACCAACGGCGAGACAACAA	208 bp	All_gs454_002732
Fructose-bisphosphate Aldolase A	FBPA H	CAATGGGGACGATGCCATGCAT	208 bp	All_gs454_002732
Phosphoglucose Isomerase-2	PGI L	CCACACTGGGCCAATTGTCTGG	217 bp	All_gs454_000011
Phosphoglucose Isomerase-2	PGI H	GGCCTCCTCTGTGGTCTTACCC	217 bp	All_gs454_000011

Tissue	Spleen	Brain	Heart	Gonad	Liver	Muscle	Total
Total EST	15,034	33,337	73,263	157,275	134,523	92,788	544,491
Total bases	3,426,510	8,219,500	17,647,600	37,123,700	31,792,600	23,342,900	129,412,000
Contigs	567	651	1,260	3,875	1,274	626	8,319
Average contig length	470	619	567	465	612	689	555
Contigs e-value < 10⁻⁶	220	420	622	951	617	409	2,440
ESTscan	584	345	977	2,838	634	269	2,715
No similarity	36	70	109	406	211	74	1,128
GO annotation	202	338	473	623	509	307	1,728
InterPro annotation	223	417	610	908	649	395	2,395

Species	Sequences available	A. carbo e-value < 10⁻³	A. carbo e-value < 10⁻¹⁰
Danio rerio	42,787	2,457	2,199
Gasterosteus aculeatus	27,576	2,445	2,223
Oreochromis niloticus	26,763	2,436	2,202
Oryzias latipes	24,674	2,412	2,173
Takifugu rubripes	47,841	2,268	2,062
Tetraodon nigroviridis	23,118	2,300	2,073

Gonads				Heart				Liver
Pfam ID	Domain	Freq	P	Pfam ID	Domain	Freq	P	Pfam ID	Domain	Freq	P
PF00100	Zona pellucida	17	3.15 10⁻¹³	PF00011	HSP20	3	4.03 10⁻⁴	PF00084	Sushi	14	1.44 10⁻¹⁰
PF00125	Histone	8	6.06 10⁻⁵	PF13405	EF hand 4	5	6.77 10⁻⁴	PF00089	Trypsin	20	7.59 10⁻⁹
PF01400	Astacin	3	3.22 10⁻³	PF00412	LIM	3	1.52 10⁻³	PF00079	Serpin	12	5.63 10⁻⁶
PF00069	Pkinase	3	0.01	PF05556	Calsarcin	3	3.60 10⁻³	PF07678	A2M comp	4	1.35 10⁻⁴
PF00653	BIR	2	0.02	PF01576	Myosin tail 1	3	3.60 10⁻³	PF01042	Ribonuc L-PSP	3	1.26 10⁻³
PF13424	TPR 12	2	0.02	PF00056	Ldh 1 N	3	3.60 10⁻³	PF00045	Hemopexin	3	1.26 10⁻³
PF13695	zf-3CxxC	2	0.02	PF05300	DUF737	2	5.50 10⁻³	PF00059	Lectin C	6	1.69 10⁻³
PF09360	zf-CDGSH	2	0.02	PF00992	Troponin	4	6.40 10⁻³	PF00701	DHDPS	4	4.64 10⁻³
PF01712	dNK	2	0.02	PF00595	PDZ	3	6.82 10⁻³	PF00386	C1q	8	6.63 10⁻³
PF00538	Linker histone	2	0.02	PF00022	Actin	3	0.01	PF00021	UPAR LY6	5	0.01
PF10178	DUF2372	2	0.02	PF02874	ATP-synt ab N	2	0.02	PF00754	F5 F8 type C	9	0.01
PF04856	Securin	2	0.02	PF13895	Ig 2	3	0.02	PF08702	Fib alpha	2	0.01
PF00250	Fork head	2	0.02	PF00191	Annexin	2	0.03	PF03982	DAGAT	2	0.01
PF01498	HTH Tnp Tc3 2	3	0.03	PF00212	ANP	2	0.03	PF01048	PNP UDP 1	2	0.01
PF00268	Ribonuc red sm	2	0.06	PF05347	Complex1 LYR	2	0.07	PF01014	Uricase	2	0.01

Gene	Contig code	Spleen	Brain	Gonads	Heart	Liver	Muscle
EF-1B	isotig00991	9	24	33	19	38	54
Rab-1A	isotig02689	4	0	3	3	0	4
BTF3	isotig02213	3	5	7	10	8	2
SOD-1	isotig02222	4	16	32	16	25	14
PRDX1	isotig01838	4	50	59	17	23	27
HSP90	isotig01406	3	64	94	86	86	13
Ferr	isotig01665	33	121	10	210	264	47
Hb-A	isotig06973	406	17	2	24	108	1
Hb-B	isotig00163	2,303	81	16	313	707	22
EPD-1	isotig01567	0	1,190	0	1	0	0
BLBP	isotig02632	0	171	0	0	0	0
TSPAN-8	isotig01397	17	7	3	407	24	9
TRPM-1	isotig01595	1	4	0	637	2	2
MYCBP	isotig01988	0	2	135	1	1	1
CTSS	isotig01659	0	0	135	0	0	0
STF-1	isotig00767	0	1	30	0	3,218	0
HPX	isotig01479	0	0	0	0	1,234	0
BHTM	isotig01489	1	0	0	0	1,024	0
FUCL1	isotig00473	1	0	0	0	22	0
ALDB	isotig01524	0	1	30	0	369	0
ISP LS12	isotig03004	0	0	0	0	211	1
ADH	isotig01491-546	1	2	16	7	292	6
GAPDH	isotig00609	0	8	51	539	134	1,281
LDH-A	isotig01401	2	7	2	5	0	789
PglyM	isotig01642	0	0	0	36	0	241
HSP70	isotig01398	0	0	0	0	2	142
FBPA	isotig00492	2	2	0	233	0	4,471
PGI	isotig01410	0	0	0	26	0	151

Order	Family	Species	Common name	Environment	Climate	Depth range	Gene	NCBI Acc. Nr
Perciformes	Trichiuridae	Aphanopus carbo	Black scabbardfish	M	Deep-water	200–1700	COI	EU854076
Beloniformes	Adrianichthydae	Oryzias latipes	Japanese rice fish	FW + BR	Subtropical	shallow	ACTA1, MDHc, MyHC, COI	NM_001104806, NM_001163134, XM_004071618, AB498066
Scorpaeniformes	Hexagrammidae	Pleurogrammus azonus	Okhotsk atka mackerel	M	Temperate	0–240	ACTA1	AB073381
Perciformes	Scombridae	Scomber scombrus	Atlantic mackerl	M	Temperate	0–200 (0–1000)	ACTA1, COI	EF607093, KC015895
Perciformes	Percihcthyidae	Siniperca chuatsi	Mandarin fish	FW	Temperate	10	ACTA1, MyHC, COI	AY395872, AY454304, NC_015822
Perciformes	Sparidae	Sparus aurata	Gilthead seabream	M	Temperate	1–30 (1–150)	ACTA1	AF190473
Perciformes	Sphyraenidae	Sphyraena idiastes	Pelican barracuda	M	Tropical	3–24	ACTA1, LDH-A, MDHc, mMDH	AF503593, SIU80001, AF390559, AF390561
Tetraodontiformes	Tetraodontidae	Takifugu rubripes	Japanese pufferfish	M + FW + BR	Temperate	0–200 (0–1000)	Hb-A, mMDH, COI	XM_003964767, XM_003965959, HM102315
Scorpaeniformes	Anoplopomatidae	Anoplopoma fimbria	Sablefish	M	Deep-water	0–2740	Hb-B, COI	BT082849, JQ353978
Perciformes	Serranidae	Epinephelus coioides	Orange-spotted grouper	M + BR	Subtropical	1–100	Hb-B	GU982530
Gasterosteiformes	Gasterosteidae	Gasterosteus aculeatus	Three-spined stickleback	M + FW + BR	Temperate	0–100	Hb-B	NM_001267638
Perciformes	Nototheniidae	Notothenia coriiceps	Black rockcod	M	Polar	0–550	LDH-A, MyHC, COI	AF079822, AJ243767, EU326390
Perciformes	Nototheniidae	Notothenia angustata	Maori chief	M	Temperate	0–100	Hb-A, Hb-B	P62363, P29628
Gadiformes	Macrouridae	Coryphaenoides armatus	Abyssal grenadier	M	Deep-water	282–5180	LDH-B, MyHC, COI	AJ609232, AB330140, FJ164497
Gadiformes	Gadidae	Gadus morhua	Atlantic cod	M + BR	Temperate	0–600	LDH-B, COI	AJ609233, KC015385
Gadiformes	Gadidae	Arctogadus glacialis	Arctic cod	M	Deep-water	0–1000	Hb-A, COI	Q1AGS4, KC015200
Perciformes	Latidae	Lates calcarifer	Barramundi	M + FW + BR	Tropical	10–40	LDH-B, COI	FJ439507, JQ431879
Gadiformes	Gadidae	Merlangius merlangus	Whiting	M	Temperate	30–100 (10–200)	LDH-B, COI	AJ609234, JQ623954
Cyprinodontiformes	Poeciliidae	Poecilia reticulata	Guppy	FW + BR	Tropical	Shallow	LDH-B, COI	EF408825, JX968696
Gadiformes	Macrouridae	Trachyrincus murrayi	Roughnose grenadier	M	Deep-water	0–1630	LDH-B, COI	AJ609235, AP008990
Perciformes	Channichthyidae	Chionodraco rastrospinosus	Ocellated icefish	M	Polar	0–1000	LDH-A, COI	AF079829, EU326337
Perciformes	Pomacentridae	Chromis caudalis	Blue-axil chromis	M	Tropical	15–55	LDH-A	AY289558
Perciformes	Gobiidae	Rhinogobiops nicholsii	Blackeye goby	M	Subtropical	?-106	LDH-A	AF079534
Cypriniformes	Cyprinidae	Cyprinus carpio	Common carp	FW + BR	Subtropical	shallow	LDH-A, MyHC, COI	AF076528, D89992, HQ960709
Cyprinodontiformes	Fundulidae	Fundulus heteroclitus	Mummichog	M + FW + BR	Temperate	shallow	LDH-A, LDH-B, COI	L43525, L23792, EU524629
Osmeriformes	Osmeridae	Osmerus mordax	Rainbow smelt	M + FW + BR	Temperate	0–425	MDHc, mMDH	BT075651, BT075600
Salmoniformes	Salmonidae	Salmo salar	Atlantic salmon	M + FW + BR	Temperate	10–23 (0–210)	MDHc, mMDH	BT060183, BT048216
Gadiformes	Macrouridae	Coryphaenoides acrolepis	Pacific grenadier	M	Deep-water	900–1300	MyHC, COI	AB330141, JQ354060
Gadiformes	Macrouridae	Coryphaenoides yaquinae	n.a.	M	Deep-water	3400–5800	MyHC, COI	AB330139, GU440291
Perciformes	Cirrhitidae	Paracirrhites forsteri	Blackside hawkfish	M	Tropical	5–35	MyHC, COI	AJ243770, HQ561521
Perciformes	Carangidae	Seriola dumerili	Greater amberjack	M	Subtropical	18–72 (1–360)	MyHC, COI	AB032020, KC015917
Gadiformes	Gadidae	Boreogadus saida	Polar cod	M + BR	Polar	0–400	Hb-A, Hb-B, COI	DQ125471, Q1AGS6, KC015250

All_gs454_00149 vs.	Diff.	Id. %	Gaps	Acc. nr.
Sphyraena idiastes	16	95.18	0	U80001
Rhinogobiops nicholsii	17	94.88	0	AF079534
Chromis caudalis	21	93.67	0	AY289558
Chionodraco rastrospinosus	26	92.17	1	AF079829
Notothenia coriiceps	27	91.87	1	AF079822
Fundulus heteroclitus	27	91.87	0	L43525
Cyprinus carpio	44	86.75	0	AF076528

All_gs454_00146 vs.	Diff.	Id. %	Acc. nr.
Oryzias latipes	15	95.50	NM_1163134
Sphyraena idiastes	20	93.99	AF390559
Osmerus mordax	30	90.99	BT075651
Salmo salar	35	89.49	BT060183

All_gs454_00104 vs.	Diff.	Id. %	Acc. nr.
Scomber scombrus 1	0	100	EF607093
Iniperca chuatsi 1	0	100	AY395872
Oryzias latipes 1	1	99.73	NM_1104806
Pleurogrammus azonus 1	1	99.73	AB073381
Coryphaenoides acrolepis 1	1	99.73	AB021649
Coryphaenoides cinereus 1	1	99.73	AB021651
Coryphaenoides armatus 2a	2	99.47	AB086240
Coryphaenoides yaquinae 2a	2	99.47	AB086242
Coryphaenoides acrolepis 2a	2	99.47	AB021650
Coryphaenoides cinereus 2a	2	99.47	AB021652
Cyprinus carpio 1	2	99.47	AY395870
Sphyraena idiastes 1	3	99.20	AF503593
Coryphaenoides armatus 2b	5	98.67	AB086241
Coryphaenoides yaquinae 2b	5	98.67	AB086243
Aphanopus carbo 2a	6	98.40	All_gs454_00102
Sparus aurata 1	9	97.60	AF190473

All_gs454_00143 vs.	Diff.	Id. %	Gaps	Acc. nr.
Lates calcarifer	14	95.81	0	FJ439507
Fundulus heteroclitus	21	93.71	0	L23792
Trachyrincus murrayi	47	85.93	0	AJ609235
Coryphaenoides armatus	50	85.03	0	AJ609232
Merlangius merlangus	52	84.43	1	AJ609234
Gadus morhua	55	83.53	1	AJ609233

All_gs454_00001 vs.	Diff.	Id. %	Gaps	Acc. nr.
Paracirrhites forsteri	53	94.95	0	AJ243770
Seriola dumerilii	54	94.86	0	AB032020
Siniperca chuatsi	59	94.38	0	AY454304
Oryzias latipes	62	94.10	2	XM_4071618
Cyprinus carpio	69	93.43	2	D89992
Coryphaenoides acrolepis	86	91.82	1	AB330141
Notothenia coriiceps	86	91.82	1	AJ243767
Coryphaenoides yaquinae	89	91.53	1	AB330139
Coryphaenoides armatus	92	91.25	1	AB330140

Muscle				Brain				Spleen
Pfam ID	Domain	Freq	P	Pfam ID	Domain	Freq	P	Pfam ID	Domain	Freq	P
PF01410	Troponin	7	1.99 10⁻⁵	PF01669	Myelin MBP	4	4.03 10⁻⁵	PF00042	Globin	5	1.33 10⁻⁶
PF02807	COLFI	3	9.67 10⁻⁴	PF01453	B lectin	2	6.44 10⁻³	PF00078	RVT 1	5	1.33 10⁻⁶
PF00041	ATP-gua PtransN	3	3.59 10⁻³	PF00612	IQ	2	6.44 10⁻³	PF00993	MHC II alpha	4	5.02 10⁻⁶
PF02453	fn3	3	3.59 10⁻³	PF11414	Suppressor APC	2	6.44 10⁻³	PF07686	V set	5	2.49 10⁻⁶
PF13895	Reticulon	3	3.59 10⁻³	PF05768	DUF836	2	6.44 10⁻³	PF07654	C1 set	5	4.71 10⁻⁴
PF00365	Ig_2	4	7.97 10⁻³	PF05196	PTN MK N	2	6.44 10⁻³	PF00089	Trypsin	5	2.01 10⁻³
PF01216	PFK	2	9.84 10⁻³	PF04300	FBA	2	6.44 10⁻³	PF01391	Collagen	2	2.30 10⁻³
PF01267	Calsequestrin	2	9.84 10⁻³	PF00287	Na K-ATPase	2	6.44 10⁻³	PF00240	ubiquitin	3	3.28 10⁻³
PF00261	F-actin cap A	2	9.84 10⁻³	PF11032	ApoM	3	8.53 10⁻³	PF09307	MHC2-interact	2	6.67 10⁻³
PF00856	Tropomyosin	2	9.84 10⁻³	PF00061	Lipocalin	4	0.01	PF00643	Zf-B box	2	0.01
PF01661	SET	2	9.84 10⁻³	PF00007	Cys knot	2	0.02	PF13445	Zf-RING LisH	2	0.01
PF01667	Macro	2	0.03	PF01275	Myelin PLP	2	0.02	PF01498	HTH Tnp Tc3 2	2	0.02
PF00036	Ribosomal S27e	2	0.05	PF00300	His Phos 1	2	0.02	PF02301	HORMA	1	0.05
PF01576	efhand	2	0.05	PF00230	MIP	2	0.02	PF14259	RRM 6	1	0.05
PF01410	Myosin_tail_1	2	0.08	PF00091	Tubulin	3	0.02	PF09004	DUF1891	1	0.05

Searching item	Numbers
Total number of sequences examined	7,920
Total size of examined sequences (bp)	4,235,839
Total number of identified SSRs	153
Number of SSR containing sequences	142
Number of sequences containing more than 1 SSR	9
Number of SSRs present in compound formation	8
Di-nucleotide	63
Tri-nucleotide	52
Tetra-nucleotide	35
Penta-nucleotide	3

All_gs454_01074 vs.	Diff.	Id. %	Gaps	Acc. nr.
Notothenia angustata	39	72.73	1	P62363²
Boreogadus saida	43	69.93	0	DQ125471
Arctogadus glacialis	44	69.93	0	DQ125475
Takifugu rubripes	46	67.83	0	XM_3964767
Gadus morhua	53	62.94	0	O42425²

All_gs454_01018 vs.	Diff.	Id. %	Acc. nr.
Epinephelus coioides	27	81.63	GU982530
Anoplopoma fimbria	31	78.91	BT082849
Gasterosteus aculeatus	31	78.91	NM_1267638
Boreogadus saida	40	72.79	Q1AGS6²
Notothenia angustata	42	71.43	P29628²

PERMALINK

Transcriptome of the Deep-Sea Black Scabbardfish, Aphanopus carbo (Perciformes: Trichiuridae): Tissue-Specific Expression Patterns and Candidate Genes Associated to Depth Adaptation

Sergio Stefanni

Raul Bettencourt

Miguel Pinheiro

Gianluca De Moro

Lucia Bongiorni

Alberto Pallavicini

Abstract

1. Introduction

2. Methods

2.1. Fish and Tissue Samples

2.2. RNA Extraction and Sequencing

2.3. Bioinformatic Analyses

2.4. cDNA Synthesis and qPCR Validation Tests

Table 1.

2.5. Characterization of Depth-Related Functional Genes

2.6. EST-SSR Resources for Population Genetics

3. Results and Discussion

3.1. Sequences Assemblage and Functional Annotations

Table 2.

Figure 1.

Table 3.

Table 4.

(a).

(b).

3.2. qPCR Assays and Validation Tests

Figure 2.

Figure 3.

Table 5.

3.3. Candidate Genes Associated to Depth

Table 6.

Table 7.

Figure 4.

Figure 5.

3.4. EST-SSR Resources for Population Genetics

Table 8.

4. Conclusions

Supplementary Material

Acknowledgments

Conflict of Interests

Authors' Contributions

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases