Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2008 Jun;72(9):2071–2094. doi: 10.1111/j.1095-8649.2008.01876.x

A description of the origins, design and performance of the TRAITS–SGP Atlantic salmon Salmo salar L. cDNA microarray

J B Taggart *,, J E Bron *, S A M Martin , P J Seear §,, B Høyheim , R Talbot #, S N Carmichael #, L A N Villeneuve *, G E Sweeney §, D F Houlihan , C J Secombes , D R Tocher *, A J Teale *
PMCID: PMC2610384  PMID: 19125201

Abstract

The origins, design, fabrication and performance of an Atlantic salmon microarray are described. The microarray comprises 16 950 Atlantic salmon-derived cDNA features, printed in duplicate and mostly sourced from pre-existing expressed sequence tag (EST) collections [SALGENE and salmon genome project (SGP)] but also supplemented with cDNAs from suppression subtractive hybridization libraries and candidate genes involved in immune response, protein catabolism, lipid metabolism and the parr–smolt transformation. A preliminary analysis of a dietary lipid experiment identified a number of genes known to be involved in lipid metabolism. Significant fold change differences (as low as 1·2×) were apparent from the microarray analysis and were confirmed by quantitative real-time polymerase chain reaction analysis. The study also highlighted the potential for obtaining artefactual expression patterns as a result of cross-hybridization of similar transcripts. Examination of the robustness and sensitivity of the experimental design employed demonstrated the greater importance of biological replication over technical (dye flip) replication for identification of a limited number of key genes in the studied system. The TRAITS (TRanscriptome Analysis of Important Traits of Salmon)–salmon genome project microarray has been proven, in a number of studies, to be a powerful tool for the study of key traits of Atlantic salmon biology. It is now available for use by researchers in the wider scientific community.

Keywords: Atlantic salmon, DNA microarray, gene expression, immune response, lipid metabolism, smoltification

Introduction

The Atlantic salmon (Salmo salar L.) is an important farmed fish species throughout its native range (western Europe and east coast North America). Over the past 10 years, a substantial Atlantic salmon mariculture industry has also been established in Chile. In indigenous areas, the wild species also underpins valuable sectors of the rural economy founded on sport and ecotourism. Its biology is unusual in that, as an anadromous species, it adapts to very different environments in terms of temperature and salinity at different stages in its life cycle. Atlantic salmon is a high-value food source providing quality protein and oils, and together with other oily fish, it is the most important source of essential omega-3 fatty acids in the human diet. Data compiled from FAO UN database (FishStat Plus) indicate that farmed Atlantic salmon production in the world exceeds 1·2 million tonnes per annum.

Currently, the four most important constraints on commercial production of this species are (a) supply of dioxin-free highly unsaturated oils for the salmon diet, (b) protein growth efficiency, (c) infectious disease and (d) a long and complex life cycle. In 2002, research groups from three U.K. universities (Aberdeen, Cardiff and Stirling), specializing in different aspects of Atlantic salmon biology, formed a partnership to develop and exploit transcriptomic resources to explore the molecular basis of the biology underlying these constraints. The goal of TRAITS (TRanscriptome Analysis of Important Traits of Salmon, http://www.abdn.sfirc/salmon) was to bolster the sustainability of Atlantic salmon farming through identification of genes and metabolic pathways influencing traits that are important in terms of (a) efficiency and sustainability of farm production, (b) welfare of farmed stocks and (c) quality and nutritional value of salmon products for the consumer. This goal was to be achieved through selection of a set of key indicator genes associated with the traits of interest, in order to form the basis of a prototype DNA chip for monitoring salmon health and performance. The underlying strategy (Fig. 1) was to design a primary cDNA microarray based on extant expressed sequence tag (EST) collections together with novel ESTs drawn from subtracted libraries associated with targeted laboratory and, or field ‘challenges’. RNA samples derived from these and other challenges would be interrogated by the cDNA array to identify candidate responder genes and gene pathways. A second more focused oligonucleotide array, comprising mainly responder genes, would then be fabricated and initially validated by interrogation of the same samples that were hybridized to the cDNA array.

Fig. 1.

Fig. 1

Overview of TRanscriptome Analysis of Important Traits of Salmon strategy to generate both a general cDNA and a focused oligonucleotide microarray. qRT-PCR, quantitative real-time polymerase chain reaction analysis. EST, expressed sequence tag; qRT-PCR quantitative real-time polymerase chain reaction.

The development of the TRAITS cDNA microarray was initially reliant upon a c. 11 k EST collection from the European Union (EU)-funded SALGENE project (‘Construction of a genetic body map for Atlantic salmon’; FAIR CT98 4314), in which Stirling had been a partner. However, prior to project start up, a formal collaboration with the Norwegian Salmon Genome Project (SGP, http://www.salmongenome.no) was developed that allowed access to a more extensive physical EST resource (Hagen-Larsen et al., 2005; Adzhubei et al., 2007). The TRAITS–SGP cDNA microarray described in this paper is the product of this collaboration.

Several cDNA microarray platforms have been established for salmonid fish with varying numbers of features: 1380 features (Koskinen et al., 2004), 3700 features (Rise et al., 2004), 4104 features (Ewart et al., 2005), 79 features (Jordal et al., 2005) and 16 008 features (von Schalburg et al., 2005). A number of different biological processes have been examined by microarray in salmonid fish. These include immune responses to bacterial infections (Rise et al., 2004; Ewart et al., 2005; von Schalburg et al., 2005; MacKenzie et al., 2006; Martin et al., 2006; Wynne et al., 2008), viral infections (Purcell et al., 2006) and fungal infections (Roberge et al., 2007). Aspects of physiology have also been examined including nutritional states (Jordal et al., 2005), mobilization of energy reserves (Salem et al., 2006) and stress responses (Cairns et al., 2008). In the above studies, varying complexities of experimental design were employed, using different numbers of replicates and often only genes showing two-fold or higher differences in expression were selected for further scrutiny and, or characterization.

It is widely accepted that the supply of fish meal and oils for the diets of farmed fish is not sustainable (Sargent & Tacon, 1999). One consequence of this has been an increased emphasis on the development of diets based on vegetable oil (VO), which are suitable for aquaculture. A key aspect of salmonid lipid metabolism that is being extensively investigated in this regard is the conversion of C18 polyunsaturated fatty acids (PUFAs), found in VO, to the C20 and C22 highly unsaturated fatty acids (HUFAs), eicosapentaenoate and docosahexaenoate, which are the specific omega-3 fatty acids responsible for the health-promoting properties of fish and fish oil (FO). These components are critical to the maintenance of nutritional quality of farmed fish. Two key enzymes involved in this pathway, Δ5 and Δ6 fatty acyl desaturase (FAD), have been characterized in depth (Hastings et al., 2004; Zheng et al., 2005a). Both these genes have been shown, by quantitative real-time polymerase chain reaction analysis (qRT-PCR), to exhibit diet-dependent differential gene expression (Zheng et al., 2004, 2005a, b), although detected fold change differences are minimal (1·3–2·0).

In this paper, design and fabrication details relating to the TRAITS–SGP cDNA microarray are reported. Results of a preliminary transcriptomic analysis from a dietary lipid feeding trial are used to explore both the robustness and the sensitivity of analysis that may be achieved by the use of this microarray.

Materials and methods

cDNA resources

Archived ESTs

Two main EST collections (SALGENE c. 11 k clones and SGP c. 30 k clones) were available to the TRAITS consortium in 2004. In all cases, fish were sourced from farm stocks of European origin, and library construction began with mRNAs from tissues being used as template for oligo(dT)-primed reverse transcription. The SALGENE resource comprised ESTs from seven tissue-specific non-normalized libraries and two tissue-specific normalized libraries, with all cDNAs being directionally cloned into vectors. Details of tissues used, life-history stage (juvenile, i.e. freshwater phase, or adult, i.e. marine phase) and cloning systems employed are given in Table I. Non-normalized library construction has been detailed elsewhere (Martin et al., 2002; Hagen-Larsen et al., 2005; Adzhubei et al., 2007). Insert size varied among libraries but ranged from 300 base pairs (bp) to 4 kbp. Single-pass sequence data (5′-end) were available for all clones. Normalized libraries were made in M. B. Soares’ laboratory, University of Columbia, U.S.A., following their standard methodology (Bonaldo et al., 1996). Single-pass sequence data available for these clones were a mixture of both 5′- and 3′-end reads. SGP clones were derived from 14 tissue-specific non-normalized libraries: brain, eye, gill, head kidney, heart, intestine, kidney, liver, white muscle, ovary, skin, spleen, swimbladder and testis. All tissues were sampled from parr (freshwater phase). The cDNAs were directionally cloned into pBlueScript II SK(+) XR phagemid vector and transformed into XL10-Gold host cells (Hagen-Larsen et al., 2005; Adzhubei et al., 2007). EST data comprised single-pass 5′-end sequences.

Table I.

Details of the expressed sequence tag (EST) libraries used to construct the TRAITS–SGP cDNA microarray

Source Tissue Environment Host cells Vectors
SALGENE Liver, testis and ovary Sea water XLOLR pBK-CMV
SALGENE Spleen and kidney Sea water SOLR pBlueScript II SK(−)
SALGENE Gill and intestine Fresh water XL10-Gold pBlueScript II SK(+) XR
SALGENE White muscle and brain Sea water SURE pT7T3-Pac
TRAITS Liver, kidney, gill and white muscle Fresh water ElectroTen; Blue pGEM T-easy
TRAITS Liver, kidney, brain, pituitary and gill Sea water ElectroTen; Blue pGEM T-easy
SGP Brain, eye, gill, head kidney, heart, intestine, kidney, liver, white muscle, ovary, skin, spleen, swimbladder and testis Fresh water XL10-Gold pBlueScript II SK(+) XR

SGP, salmon genome project; TRAITS, TRanscriptome Analysis of Important Traits of Salmon.

Trait-specific enriched libraries

All enrichments used a standard approach – suppression subtractive hybridization (SSH, PCR-Select cDNA Subtraction Kit; Clontech, Mountain View, CA, U.S.A.). Non-directional cloning was subsequently used to insert subtracted cDNA fragments into the pGEM T-Easy vector (Promega, Madison, WI, U.S.A.). Resultant cDNA fragments generally ranged in size between 150 and 700 bp.

Pathogen-induced libraries

A bacterial challenge was undertaken using Aeromonas salmonicida, the bacterial pathogen responsible for furunculosis of salmon. Three tissue-specific enriched libraries (head kidney, gill and liver) were constructed (Martin et al., 2006). Fish were anaesthetized with benzocaine (20 mg l−1; Sigma-Aldrich, St Louis, MO, U.S.A.) and injected intraperitoneally with 100 μl (109 CFU ml−1) of a genetically attenuated strain (aroA) of A. salmonicida(Brivax II; Sigma-Aldrich) (Marsden et al., 1996) in phosphate-buffered saline (PBS) or 100 μl of PBS as control. Brivax II is a non-virulent strain but acts in a similar manner to the intact virulent pathogen; however, after several rounds of replication, the fish clears the bacteria. Intraperitoneal injection of Brivax II induces a protective immune response, with fish resistant to a later challenge with virulent strains. The two groups of fish were kept separately and RNA was pooled from 10 challenged fish and 10 control fish at 24 and 48 h after injection. In each case, the challenged RNA was ‘tester’ and the control ‘driver’. Approximately 500 clones from each library were sequenced, a mean redundancy of c. 33% being observed. Following basic local alignment search tool (BLAST)X sequence homology matching, 20, 23 and 50% of genes had sequences homologous to immune-associated genes for head kidney, gill and liver, respectively (Martin et al., 2006).

Starvation-induced libraries

Two tissue-specific enriched libraries (white muscle and liver) were constructed following a starvation trial. RNA was pooled from 10 fish starved for 14 days and from 10 fish fed ad libitum. RNA from the starved pool was used as ‘tester’ and RNA from fed fish as ‘driver’. For genes enriched following short-term starvation, a highly heterogeneous group of genes was found, as many different biological processes were altered by this treatment including those related to protein turnover. For the library generated from liver, 92% of the sequences were found to have homologies following BLASTX searches. Key groups of genes represented in the library-encoded metabolic enzymes, serum proteins and immune response genes, with other minor groupings being iron-binding proteins, globins and factors involved in transcription and translation. For the genes enriched in muscle following starvation, 77% had BLASTX homologies, with key groups of sequences encoding metabolic enzymes, structural proteins and transcription and translation factors (10%) and minor groups including heat shock proteins.

Diet response libraries

Atlantic salmon were fed from first feeding on diets containing either FO (capelin oil) or a 25% FO:75% blended VO diet (see Feed Trial). Four subtracted liver cDNA libraries [two time-points – 52 weeks (pre-smolt–fresh water) and 55 weeks (post-smolt–sea water) and two directions, FO driver and VO driver] were made. Pooled RNA from 12 (pre-smolt) and four (post-smolt) female fish on each diet was used. A total of 768 clones were sequenced. All four libraries were found to be highly redundant; 10 fragments comprised c. 40% of all sequences. BLASTX analyses gave significant hits (e value < e−20) for 79 (54%) of the 145 different sequences, although only one of these (catfish fatty acid-binding protein) appeared to be directly related to lipid metabolism.

Smoltification response libraries

Four tissue-specific SSH libraries enriched in genes upregulated in brain, pituitary, kidney and gill of smolts (seawater phase) in comparison to parr (freshwater phase) were made. Tissues were dissected from parr and smolt in November–December 2002 and April–May 2003. A total of 380 clones from each of the four libraries were sequenced. Between 32 and 50% of the sequenced clones were identified by BLASTX sequence homology searches. All libraries had considerable redundancy (28% brain, 56% gill, 56% kidney and 86% pituitary). Significantly, the single most abundant sequence in the gill subtractive library corresponded to Na+ and K+-ATPase whose levels of activity are a key indicator of smoltification status.

Candidate and other genes

A third, minor source of cDNAs for the array was a small collection of full-length genes or gene fragments in plasmid constructs that were already possessed by the partners. These included both candidate genes and also other genes with no known relevance to the specific traits of interest (Table II). Being better characterized than the EST clones, these constructs were potentially useful as reference genes on the microarray. Of note, with regard to the research reported in this paper, is the inclusion of three cDNA fragments [the open reading frame (ORF) and two 3′-untranslated region (UTR) fragments] from both Atlantic salmon FAD genes (Δ5 FAD and Δ6 FAD).

Table II.

Pre-identified candidates and other reference genes

Gene name cDNA lengths (bp) Notes
Apolipoprotein B 1402 Partial with 3′-UTR
Carnitine palmitoyltransferase 1 823 Partial
Carotene dioxygenase 872 Partial
Oestrogen receptor-alpha 2900 Partial with 3′-UTR
Growth hormone receptor 340 Partial, PCR fragment
Glyceraldehyde phosphate dehydrogenase 1086 Full length
Heat shock protein P70 830 Partial with 3′-UTR
Homogentisate dioxygenase 952 Partial with 3′-UTR
Insulin-like growth factor-1 230 Partial
Interferon-gamma 1132 Full length
Interleukin-1 beta 790 ORF
NGF1-B 224 Partial (RACE fragment)
Pituitary-specific transcription factor 1 250 Partial
Peroxisome proliferator-activated receptor-alpha 1644 Full length
Peroxisome proliferator-activated receptor-beta 1 1462 Full length
Peroxisome proliferator-activated receptor-beta 2 779 Partial
Peroxisome proliferator-activated receptor-gamma 1665 Full length
Polyunsaturated fatty acid elongase 950 ORF
Retinoic acid receptor-alpha 840 ORF (RACE fragment)
Retinoic acid receptor-gamma 440 Partial inc 5′-UTR
Retinaldehyde dehydrogenase type 2 922 Partial
TNF-α induced adipose related protein 483 Partial
Thyroid hormone receptor-alpha c. 2000 Partial inc 5′-UTR
Thyroid hormone receptor-beta c. 900 Partial inc 5′-UTR
Vitamin D3 receptor 360 Partial inc 5′-UTR
Δ5 FAD 408 3′-UTR fragment
Δ5 FAD 881 3′-UTR fragment
Δ5 FAD 1365 ORF
Δ6 FAD 384 3′-UTR fragment
Δ6 FAD 401 3′-UTR fragment
Δ6 FAD 1365 ORF

FAD, fatty acid desaturase; inc, including; NGF, neuronal growth factor; ORF, open reading frame; PCR, polymerase chain reaction; RACE, rapid amplification of cDNA ends; TNF, tumour necrosis factor; UTR, untranslated region.

Sequence clustering and probe selection

All sequence data derived from the above resources, together with 57 k Atlantic salmon sequences available in silico from GenBank (National Center for Biotechnology Information–NCBI) in July 2004, were clustered using the The Institute of Genomic Research (TIGR) gene indices (TGI) clustering tools (Pertea et al., 2003). The process of clone selection for the cDNA microarray is summarized in Fig. 2. Approximately equal numbers (c. 9 k) of contigs and singletons were identified as having accessible clones. Contig sequence lengths varied from 135 to 3804 bp (mean 956 bp) and comprised from 2 to 651 clone sequences (mean 9·1 sequences per contig). Because the overall number (c. 18 k potential features) was comfortably within the printing capacity of the microarray spotter, no further clone selection or refinement was undertaken. A single representative clone from each contig was selected for inclusion on the microarray. The only selection criteria applied were (a) where possible, a SALGENE clone was selected in preference to an SGP clone because the entire SALGENE resource was archived at the printing site (ARK Genomics, Roslin Institute, U.K.) and (b) clones were selected from non-normalized or normalized libraries in preference to SSH-derived cDNAs in order to take advantage of longer transcripts. Within this defined sub-set, a clone was selected at random for inclusion on the array. There was no intentional selection of a 5′- or 3′-biased clone from within each contig.

Fig. 2.

Fig. 2

Summary of clustering procedure and probe selection for the cDNA microarray. SGP, salmon genome project; TRAITS, TRanscriptome Analysis of Important Traits of Salmon.

Microarray fabrication

Bacterial cultures were grown from archived samples and cDNA inserts amplified directly from these using vector-specific primers. Initially, various primer sets were used, dependent on the specific vector employed. However, a generic set (BSKS-F: CGATTAAGTTGGGTAACGC and BSKS-R: CAATTTCACACAGGAAACAG) was found to work with all but one vector. For pBK-CMV constructs, T3 (AATTAACCCTCACTAAAGGGA) and T7 (TAATACGACTCACTATAGGG) primers were employed. PCR reactions (50 μl) were purified by passing them through a 384-well multiscreen filter plate (Millipore, Watford, U.K.). Amplicons were electrophoresed through a 1% agarose gel and stained with ethidium bromide. Only PCR products producing a clear singlet band were considered for spotting. Amplicons were quantified using PicoGreen assay (Invitrogen, Madison, WI, U.S.A.). Features were printed onto aminosilane-coated glass slides (Corning GAPS II; Corning Inc., Corning, NY, U.S.A.) using a MicroGrid II printer (Genomic Solutions, Holiston, MA, U.S.A.). DNA was resuspended in printing buffer (150 mM sodium phosphate buffer, 0·1% sodium dodecyl sulphate (SDS), pH 8·0) to a concentration of c. 150 ng ml−1and spotted with a 48 pin tool (Biorobotics 10 k pins; Biorobotics Ltd, Cambridge, U.K.). Mean spot diameter was 110 μm. The slide format comprised 48 sub-arrays, each consisting of 27 columns and 28 rows. Each cDNA was printed in duplicate, with duplicate features being printed non-adjacently within the same sub-array. In addition to these 16 950 Atlantic salmon cDNAs, a number of control features were printed across the microarray to maximize its flexibility in terms of possible experimental applications. These control features comprised sonicated Atlantic salmon genomic DNA (96 features), sheared salmon sperm DNA (Oncorhynchus sp. derived – Sigma; 96 features), four different SpotReport® (Stratagene, La Jolla, CA, U.S.A.) controls – namely PCR products 1–3 (Cab, RCA and rbcL genes from Arabidopsis thaliana) and human β-actin PCR fragment (20 features each) and spotting buffer (192 features). In addition, each sub-array had two Cy3 spots (landing lights) located at the upper left corner to aid orientation of the slide during grid placement and spot finding procedures. The remaining 1826 locations on the 36 288 spot grid were left blank. Following printing, DNA spots were fixed by baking at 80° C for 2 h. Prior to hybridization, microarray slides were treated using succinic anhydride and 1-methyl-2-pyrrolidinone (Sigma) to block unbound amino groups (slide manufacturer's recommended protocol) and denatured by incubation at 95° C in MilliQ water for 2 min. Slides were then rinsed twice in isopropanol, centrifuged to dry and stored in a desiccated environment until required. Details of the TRAITS cDNA microarray have been submitted to the ArrayExpress platform (http://www.ebi.ac.uk/arrayexpress) under accession number A-MEXP-664. The GAL file is also available for inspection from the TRAITS–SGP microarray website (http://www.traitsdb.stir.ac.uk/). All clones used on the microarray are archived in 384-well plates as glycerol stocks in two locations (ARK Genomics Facility, Roslin Institute, U.K., and SGP Genetics Laboratory, University of Oslo).

EST annotation

Gene identification was carried out by BLAST searches (Altschul et al., 1990) of appropriate databases, (NCBI) non-redundant (nr) nucleotide and protein databases, and interrogation of the TIGR Atlantic salmon Gene Index, release 3.0 (http://biocomp.dfci.harvard.edu/tgi/cgi-bin/tgi/gimain.pl?gudb=salmon). Gene ontology (GO) identifiers were obtained through Blast2GO (Conesa et al., 2005).

Sensitivity of microarray protocol

Feed trial

The effect of replacing FO with VO at a replacement level of 75% was investigated in Atlantic salmon in a trial conducted over an entire 2 year production cycle (Torstensen et al., 2005). Briefly, the two diets were fed to triplicate tanks and cages at Marine Harvest Ltd., facilities at Invergarry (fresh water) and Loch Duich, Lochalsh (sea water), U.K. Atlantic salmon fry were distributed randomly into six tanks (3 m × 3 m, depth 0·5 m) at a stocking level of 3000 per tank and weaned onto extruded feeds containing 20% added oil, which was either FO (capelin oil) or a VO blend, containing rapeseed, palm and linseed oils in a 3·7:2:1 ratio, replacing 75% of the FO. This VO blend was formulated to mimic FO in saturated and monounsaturated fatty acid content but with C18 PUFA replacing n-3 HUFA. Fish were fed the diets described above for 1 year until seawater transfer, at which point fish (mean mass c. 50 g) were transferred into 5 m × 5 m net pens at 700 fish pen−1. The fish were fed the same diet in sea water as in fresh water although the dietary oil levels were increased to 25% (3 mm pellet), rising to 32% (9 mm pellets) through the year-long seawater phase. The diets aimed to be practical and were formulated and manufactured by Skretting ARC (Stavanger, Norway) according to current practices in the salmon-feed industry. All diets were formulated to satisfy the nutritional requirements of salmonid fish (NRC, 1993).

Sample preparation

Fish fed on each diet were sampled at two time-points during freshwater rearing (at 36 and 52 weeks post-hatch, the latter just 1 week before transfer to sea) and at a further two time-points in sea water (at 55 and 86 weeks post-hatch). Twenty-four liver samples per dietary treatment and time-point were collected. Total RNA was isolated by organic solvent extraction (TriReagent; Sigma) following the manufacturer's protocol. Spectrophotometry (Nanodrop, Wilmington, DE, U.S.A.) and electrophoresis (Bioanalyser 2100; Agilent Technologies, Santa Clara, CA, U.S.A.) were used to quantify and assess the quality of the RNAs, respectively. For the transcriptomic analysis, equal amounts of RNA from four individuals (two males + two females) were pooled to produce six biological replicates per diet per time-point. Each pooled RNA sample was further cleaned by mini spin-column purification (RNeasy; Qiagen, Valencia, CA, U.S.A.) and was re-quantified and quality assessed as above.

Experimental design

Each biological replicates was co-hybridized in a two-dye experiment with a single pooled reference sample. This design permits valid statistical comparisons across both diets and time-points to be made. The pooled reference sample comprised equal amounts of RNA from each of the 48 biological replicate samples. A dye-swap procedure was incorporated to mitigate selective binding and scanning artefacts. Thus, the entire experiment comprised 96 separate hybridizations (two diets × four time-points × six biological replicates × two dye swaps).

Labelling and hybridization protocols

Because of the large number of hybridizations, not all hybridizations could be completed at the same time. Samples were therefore randomized and processed in two batches 1 week apart. RNA was reverse transcribed and labelled with either Cy3 or Cy5 fluors using the FAIRPLAY II cDNA indirect labelling kit (Stratagene) according to the manufacturer's instructions. Briefly, 20 μg total RNA was reverse transcribed after being primed with oligo dT, which incorporated aminoallyl-dUTP into the synthesized cDNA strand. The RNA template was then hydrolysed using 1 M NaOH for 15 min and neutralized with 1 M HCl. The cDNA was NaAce–ethanol precipitated overnight. cDNA pellets were washed in 80% ethanol and air-dried before being resuspended in 5 μl of 2 × coupling buffer. Once the cDNA had fully dissolved (after at least 30 min), 5 μl of either Cy3 or Cy5 dye was added and the samples were incubated in the dark for 30 min. The Cy3 and Cy5 dyes (Amersham Pharmacia, Little Chalfont, U.K.) were dissolved in 45 μl dimethyl sulphoxide (DMSO) prior to being added to the coupling buffer. To remove unincorporated dye, the labelled cDNA (total volume 10 μl) was passed through a SpinEX column (Qiagen). Dye incorporation was assessed by separating 1 μl of the sample on a mini agarose gel and visualizing fluorescent products on a microarray scanner (Perkin Elmer ScanArray 5000XL; Perkin-Elmer, Wellesley, MA, U.S.A.). No pre-hybridization step was required. For hybridization, the remainder of each labelled cDNA (7–9 μl, 16–30 pmol each dye) was added to 85 μl hybridization buffer (UltraHyb; Ambion; Austin, TX, U.S.A.), 10 μl poly(A)80 (10 mg ml−1; Sigma) and 5 μl ultrapure BSA (10 mg ml−1; Ambion). The hybridization mixture was heated to 95° C for 3 min, then cooled to 60° C before being applied to the microarray. Hybridizations (48 slides per day) were performed on a Gene TAC Hyb Station (Genomic Solutions) for 16 h at 45° C. Slides were then automatically washed with 2 × SSC and 0·5% SDS for 10 min at 60° C, 0·2 × SSC and 0·5% SDS for 10 min at 42° C and finally 0·2 × SSC for 10 min at 42° C. Slides were then manually rinsed in isopropanol and dried by centrifugation before being scanned.

Data acquisition and analysis

Hybridized slides were scanned at 10 μm resolution using a Perkin Elmer ScanArray Express HT scanner. BlueFuse software (BlueGnome, Cambridge, U.K.) was then used to identify features and extract fluorescence intensity values from the resultant TIF images. Following a manual spot removal procedure and fusion of duplicate spot data (BlueFuse proprietary algorithm), the resulting fluorescence intensity data and quality annotations for the 16 950 gene features were exported into the GeneSpring GX version 7.3.1 analysis platform (Agilent Technologies). All control features (positive, negative, landing lights, etc.) were excluded from subsequent analyses. Data transformation, normalization and quality filtering were as follows: (a) all intensity values <0·01 were set to 0·01, (b) a ‘per spot per chip’ intensity-dependent (Lowess) normalization was undertaken using software defaults (20% smoothing/cutoff 10) and (c) data were filtered using a BlueFuse spot confidence value >0.1 in ≥24 slides and BlueFuse spot quality of ≥0·5 in ≥24 slides. This gave a final list of 11 800 genes that were eligible for statistical analysis. Experimental annotations complied fully with minimum information about a microarray experiment (MIAME) guidelines (Brazma et al., 2001). The experimental hybridizations are archived on the EBI ArrayExpress database (http://www.ebi.ac.uk/arrayexpress/) under accession number E-TABM-449.

Hybridization data were analysed by two-way ANOVA, which examined the explanatory power of the variables ‘time-point’ and ‘diet’ and the interaction between the two and incorporated a Benjamini & Hochberg (1995) multiple test correction (P ≤ 0·05).

qRT-PCR validation

Quantitative real-time-PCR was performed as described by Villeneuve et al. (2005). Primer details are given in Table III. Relative expression ratios were statistically compared between diet samples following normalization against three housekeeping genes, using REST software (Pfaffl, 2001; Pfaffl et al., 2002). Two of the three housekeeping genes (β-actin and elongation factor-1α) are widely used as reference genes for salmonid quantitative PCR (qPCR) studies (Olsvik et al., 2005). From the microarray analysis, a third reference was selected, an anonymous cDNA feature on the TRAITS–SGP microarray that was identified as a flatliner both between diets and over all time-points. Differences in gene expression between diet samples were evaluated in group means using randomization tests (REST software), which employed 5000 random allocations and considered differences to be significant at P < 0·05.

Table III.

Primer details for both lipid metabolism and control genes used in quantitative real-time polymerase chain reaction assay. Annealing temperature of 60° C was used in all cases

Gene identity Primer 1 (5′ → 3′) Primer 2 (5′ → 3′)
Lipid metabolism genes
Δ5 FAD (3′-UTR) GTGAATGGGGATCCATAGCA AAACGAACGGACAACCAGA
Δ6 FAD (3′-UTR) CCCCAGACGTTTGTGTCAG CCTGGATTGTTGCTTTGGAT
Lathosterol oxidase CACTAACCTTATTCCATCGGCTACTT TTTCCCTTCCTTTTACAGACATCAAT
Squalene monooxygenase TGATCTCGGCTACTTTTTGTTTTTG GCCGCCAGGATTATCTCTTTGT
Isopentenyl-diphosphate Δ-isomerase* ACAGCCCTATGGTTATGTGTCATCTC CAAGGTGAGGCGAATGTTTGAAC
‘Housekeeping’ reference genes
β-actin ACATCAAGGAGAAGCTGTGC GACAACGGAACCTCTCGTTA
Elongation factor-1α CTGCCCCTCCAGGACGTTTACAA CACCGGGCATAGCCGATTCC
Unidentified liver EST* AGCCTATGACCAACCCACTG TGTTCACAGCTCGTTTACCG

FAD, fatty acyl desaturase; EST, expressed sequence tag; UTR, untranslated region.

*

Identified as a flatliner from microarray analysis of same sample set.

Results

Microarray feature overview

cDNAs derived from 15 different tissue sources are represented among the 16 950 Atlantic salmon gene features printed on the TRAITS–SGP cDNA microarray (Table I). Of these, c. 9% are from SSH libraries. Brain cDNAs predominate (15% of total). This reflects the large number of ESTs generated for this tissue because of (a) brain libraries being included in all three of the EST resources available, (b) increased sequencing effort being focused on these libraries due to perceived diversity of gene expression within brain tissue and (c) availability of a normalized brain EST library. BLASTX homology searches of contig and singleton sequences (Table IV) revealed close to 40% of features having a weak hit (e value ≥ e−10) or no hit at all to the NCBI nr protein database. Among the 10 399 features with a significant BLASTX hit (e value < e−10), 6762 (65%) nominally different genes were identified. GO annotations were obtained for 7749 features. The most prevalent GO annotations on the microarray are summarized in Table V. Individual feature annotations can be accessed from the TRAITS–SGP microarray website (http://www.traitsdb.stir.ac.uk/).

Table IV.

Summary of expressed sequence tag (EST) tissue representation and BLASTX analyses for the cDNA microarray

BLASTX hits (NCBI nr protein database January 2007)

Tissue Number of ESTs SSH derived (%) e values < e−40 (%) e values < e−10 (%) e value ≥ e−10/no hit (%)
Brain 2524 7 44 62 38
Eye 1609 28 44 56
Gill 1440 24 37 60 40
Head kidney 801 40 58 42
Heart 1111 40 58 42
Intestine 1140 47 71 29
Kidney 1400 28 41 60 40
Liver 720 42 56 78 22
White muscle 1621 14 48 68 32
Ovary 1432 45 66 34
Pituitary 79 100 20 34 66
Skin 134 40 59 41
Spleen 1275 35 57 43
Swimbladder 287 34 51 49
Testis 1299 44 65 35
Unknown 47 39 58 37
Totals 16919 9 41 61 39

BLAST, basic local alignment search tool; EST, expressed sequence tag; NCBI, National Center for Biotechnology; nr, non-redundant; SSH, suppression subtractive hybridization.

Table V.

The top 60 GO terms by prevalence from both biological and molecular ontologies (level 3) represented on the TRAITS–SGP cDNA microarray (significant BLASTX hit defined as e value < e−10)

GO ontology: biological process-level 3

GO term GO description Features*
GO:0044237 Cellular metabolic process 3693
GO:0044238 Primary metabolic process 3489
GO:0043170 Macromolecule metabolic process 2904
GO:0050789 Regulation of biological process 1526
GO:0050794 Regulation of cellular process 1354
GO:0006810 Transport 1215
GO:0016043 Cell organization and biogenesis 1159
GO:0007154 Cell communication 1148
GO:0009058 Biosynthetic process 1030
GO:0019222 Regulation of metabolic process 903
GO:0006091 Generation of precursor metabolites and energy 571
GO:0009056 Catabolic process 468
GO:0006807 Nitrogen compound metabolic process 446
GO:0048869 Cellular developmental process 431
GO:0051641 Cellular localization 426
GO:0051649 Establishment of cellular localization 423
GO:0007275 Multicellular organismal development 404
GO:0048856 Anatomical structure development 404
GO:0045184 Establishment of protein localization 385
GO:0048468 Cell development 357
GO:0006950 Response to stress 340
GO:0016265 Death 312
GO:0007049 Cell cycle 310
GO:0022402 Cell cycle process 246
GO:0008283 Cell proliferation 219
GO:0042221 Response to chemical stimulus 216
GO:0007155 Cell adhesion 210
GO:0009653 Anatomical structure morphogenesis 194
GO:0050877 Neurological process 182
GO:0009605 Response to external stimulus 182
GO:0046903 Secretion 169
GO:0006955 Immune response 157
GO:0065009 Regulation of a molecular function 152
GO:0006952 Defence response 131
GO:0006928 Cell motility 130
GO:0051674 Localization of cell 130
GO:0009607 Response to biotic stimulus 102
GO:0009719 Response to endogenous stimulus 95
GO:0051239 Regulation of multicellular organismal process 92
GO:0019725 Cell homeostasis 79
GO:0006936 Muscle contraction 78
GO:0065008 Regulation of biological quality 72
GO:0015976 Carbon utilization 66
GO:0016049 Cell growth 61
GO:0040008 Regulation of growth 61
GO:0051301 Cell division 60
GO:0002252 Immune effector process 56
GO:0019748 Secondary metabolic process 54
GO:0007610 Behaviour 48
GO:0050878 Regulation of body fluids 48
GO:0019953 Sexual reproduction 47
GO:0000051 Urea cycle intermediate metabolic process 45
GO:0002682 Regulation of immune system process 41
GO:0051707 Response to other organism 40
GO:0002253 Activation of immune response 40
GO:0040012 Regulation of locomotion 31
GO:0050793 Regulation of developmental process 33
GO:0009628 Response to abiotic stimulus 36
GO:0048646 Anatomical structure formation 36
GO:0050817 Coagulation 38
GO ontology: molecular function-level 3
GO:0005515 Protein binding 2223
GO:0003676 Nucleic acid binding 1293
GO:0043167 Ion binding 1150
GO:0016787 Hydrolase activity 1122
GO:0000166 Nucleotide binding 1051
GO:0016740 Transferase activity 674
GO:0016491 Oxidoreductase activity 584
GO:0004871 Signal transducer activity 462
GO:0003735 Structural constituent of ribosome 261
GO:0003700 Transcription factor activity 240
GO:0004857 Enzyme inhibitor activity 175
GO:0016874 Ligase activity 169
GO:0008289 Lipid binding 145
GO:0048037 Cofactor binding 114
GO:0016853 Isomerase activity 113
GO:0008135 Translation factor activity, nucleic acid binding 109
GO:0030246 Carbohydrate binding 106
GO:0003712 Transcription cofactor activity 99
GO:0016829 Lyase activity 98
GO:0008047 Enzyme activator activity 85
GO:0030695 GTPase regulator activity 68
GO:0016563 Transcriptional activator activity 67
GO:0016564 Transcriptional repressor activity 65
GO:0004386 Helicase activity 65
GO:0004803 Transposase activity 63
GO:0046906 Tetrapyrrole binding 62
GO:0003702 RNA polymerase II transcription factor activity 57
GO:0005200 Structural constituent of cytoskeleton 51
GO:0042277 Peptide binding 40
GO:0019207 Kinase regulator activity 39
GO:0019842 Vitamin binding 36
GO:0005201 Extracellular matrix structural constituent 36
GO:0003682 Chromatin binding 36
GO:0001871 Pattern binding 36
GO:0008307 Structural constituent of muscle 30
GO:0004601 Peroxidase activity 28
GO:0005212 Structural constituent of eye lens 26
GO:0019208 Phosphatase regulator activity 25
GO:0019825 Oxygen binding 24
GO:0008430 Selenium binding 23
GO:0003777 Microtubule motor activity 21
GO:0005496 Steroid binding 19
GO:0019239 Deaminase activity 18
GO:0051540 Metal cluster binding 18
GO:0043021 Ribonucleoprotein binding 13
GO:0008144 Drug binding 12
GO:0019840 Isoprenoid binding 10
GO:0042165 Neurotransmitter binding 9
GO:0005199 Structural constituent of cell wall 9
GO:0043028 Caspase regulator activity 7
GO:0042979 Ornithine decarboxylase regulator activity 6
GO:0015238 Drug transporter activity 6
GO:0030249 Guanylate cyclase regulator activity 6
GO:0030235 Nitric-oxide synthase regulator activity 5
GO:0008639 Small protein conjugating enzyme activity 5
GO:0003711 Transcriptional elongation regulator activity 5
GO:0030371 Translation repressor activity 4
GO:0009975 Cyclase activity 4
GO:0031406 Carboxylic acid binding 4
GO:0000146 Microfilament motor activity 4

GO, gene ontology; SGP, salmon genome project; TRAITS, TRanscriptome Analysis of Important Traits of Salmon.

*

Number of features on TRAITS–SGP cDNA microarray with identified GO term.

Dietary lipid experiment

The main findings of this microarray-based investigation will be reported in detail elsewhere. Here, data from preliminary analyses are presented to illustrate the degree of robustness and sensitivity that the microarray experimental design achieved.

The statistical analysis (two-way ANOVA – time and diet) identified 4142 features showing significant differential expression over the year-long experimental time course, demonstrating considerable temporal changes that may be related to a large number of biological factors. In contrast, only 15 significant diet-responsive features were detected (and an additional 10 features with significant time-point × diet interactions) Of the 15 significant diet-associated genes, 10 with BLASTX or other known sequence homologies had functions associated with either HUFA or cholesterol biosynthetic pathways (Table VI). The differential expression of these genes in these samples was confirmed by qRT-PCR. The identity of the remaining five of the 15 probes remains to be established. Calculation of the Pearson's correlation between microarray and qPCR fold-change data over four time-points, for the 11 probes analysed by both methods to date (including those detailed in this paper), gave a value of 0·81 (P < 0·001). This is indicative of a significant agreement between the two estimates of gene expression fold change.

Table VI.

Two-way ANOVA P values (diet) and fold change comparisons for the probes and, or genes with known identity calculated from both microarray and quantitative real-time polymerase chain reaction (qRT-PCR) data. Values are the minimum and maximum recorded over the four time-points

Fold change: VO cf. FO diet

GAL file identifier Diet P values Identity/BLASTX hit Microarray qRT-PCR
hrt_opk_06O21 4·2 × 10−06 Lathosterol oxidase 0·4–1·4 1·2–2·9
bra_snb_04D02 2·3 × 10−04 Δ5 FAD (full-length EST) 1·5–2·7
bra_bfo_03F11 8·5 × 10−04 Δ6 FAD (80% length, EST) 1·3–2·7
can_D6O_S1B06 8·5 × 10−04 Δ6 FAD ORF 1·9–2·8
can_D5O_S1B05 1·0 × 10−03 Δ5 FAD ORF 1·5–2·5
can_D6D_S1B03 1·2 × 10−03 Δ6 FAD 3′-UTR (384 bp) 1·5–2·4 1·4–3·0
bra_snb_09B09 1·2 × 10−03 Squalene monooxygenase 0·2–1·3 1·7–6·3
bra_cbr_B4H01 9·6 × 10−03 Δ5 FAD (partial 3′-UTR, EST) 1·6–2·5
can_D5D_S1B02 1·1 × 10−02 Δ5 FAD 3′-UTR (881 bp) 1·5–2·8 1·1–2·6
bra_opk_07F18 2·0 × 10−02 Isopentenyl-diphosphate Δ-isomerase 1 0·6–1·7 1·3–2·3

BLAST, basic local alignment search tool; EST, expressed sequence tag; FAD, fatty acyl desaturase; FO, fish oil; ORF, open reading frame; UTR, untranslated region; VO, vegetable oil.

To explore the consequences of analysing smaller numbers of microarrays, the significance of expression of the nine Δ5–Δ6 FAD probes present on the microarray was used as a proxy indicator for the performance of a given experimental design. Two strategies were employed to reduce microarray number: (a) omission of dye swap and (b) reduction of biological replicates. Two-way ANOVAs were undertaken (without multiple test correction) and gene lists ordered by ascending (diet) P value. The position of the nine probes in each of the lists is shown in Table VII. In a full analysis incorporating all six biological replicates and a dye swap (12 replicate microarrays per condition), the nine desaturase probes were all present in the top 25 of this list. Reducing microarray numbers, while retaining a dye swap, decreased apparent sensitivity but only markedly when the number of biological replicates was reduced to 3. At this replication level, the results of the analyses appeared to be sensitive to the particular biological replicates selected, as demonstrated by the duplicate analyses (biological replicates 1–3 v.4–6). Omission of a dye swap gave mixed results depending upon the dye selected for the pooled reference sample. With a Cy5 pooled reference, the results (all nine desaturase probes in top 27 of the list) closely matched those of the full 12 microarray design. However, with a Cy3 pooled reference, only three probes were in the top 25 of the list. One probe was not in the list at all and others had dropped as low as position 284.

Table VII.

Sensitivity of microarray analysis to reduction in slide number. Figures in body of table refer to position of probe in two-way ANOVA gene lists ordered by ascending (diet) P value

Dye swap Yes Yes Yes Yes Yes No (Cy5 reference) No (Cy3 reference)

Biological replicates 1–6 1–5 1–4 1–3 4–6 1–6 1–6

Number of slides per condition 12 10 8 6 6 6 6

Gal file identifier Feature identity
bra_snb_04D02 Δ5 FAD (full-length EST) 4 5 11 26 1 4 9
can_D6O_S1B06 Δ6 FAD ORF (1400 bp) 5 6 5 8 42 8 8
bra_bfo_03F11 Δ6 FAD (80% length, EST) 6 10 14 40 3 5 87
can_D5O_S1B05 Δ5 FAD ORF (1400 bp) 7 8 10 21 4 23 7
can_D6D_S1B03 Δ6 FAD 3′-UTR (384 bp) 8 4 4 213 19
bra_cbr_B4H01 Δ5 FAD (partial 3′-UTR, EST) 10 12 13 18 31 9 26
can_D5D_S1B02 Δ5 FAD 3′-UTR (881 bp) 11 14 16 92 19 12 62
can_D6D_S1B04 Δ6 FAD 3′-UTR (401 bp) 16 34 6 7 451 14 284
can_D5D_S1B01 Δ5 FAD 3′-UTR (408 bp) 25 51 55 94 332 27 60

EST, expressed sequence tag; FAD, fatty acyl desaturase; ORF, open reading frame; UTR, untranslated region.

Expression profiles derived from the nine FAD probes over the four time-points and for both diets are depicted in Fig. 3. The four gene-specific 3′-UTR probes (i.e. two Δ5 and two Δ6 FAD fragments) show two distinct patterns of expression, which are also shared by the other desaturase probes on the array. However, while all Δ6 FAD probes show the same pattern of expression, two of the Δ5 FAD probes (a control ORF feature and a near full-length EST) show expression profiles that mimic those of the Δ6 FAD probes.

Fig. 3.

Fig. 3

Fatty acyl desaturase (FAD) expression profiles (two diets over four time-points) derived from microarray analysis of nine different Δ5 FAD and Δ6 FAD probes. Blue lines denote Δ6 FAD probes, and red lines denote Δ5 FAD probes. Profiles for the four 3′-UTR probes are marked with an asterisk. Dashed red lines represent two Δ5 FAD probes [an open reading frame (ORF) PCR fragment and a full-length expressed sequence tag (EST) sequence], which show expression profiles that mimic those of the Δ6 FAD probes.

Discussion

TRAITS–SGP microarray fabrication

The TRAITS–SGP cDNA microarray was conceived as a preliminary tool, contributing towards the goal of developing a more focused DNA chip for routine health and performance monitoring in Atlantic salmon. A number of existing EST collections were used as the basis for its design and construction, and while this approach had obvious advantages in reducing the time frame and costs associated with resource development, it also made microarray fabrication all the more challenging. Not only did probe preparation involve a range of different host–vector combinations but also the need to track and annotate clones from different library resources compounded the difficulties. Most of the ESTs available comprised 5′-end reads. Although this improved the likelihood of successful probe annotation, use of mainly 5′-end sequence data will inevitably have compromised the cluster analysis as not all the clones will have been full length. The extent of gene redundancy on the microarray can only be confidently established from analysis of 3′-end sequence data. Presently, there are no plans to re-sequence the microarray resource.

Enriching the microarray for potential trait-specific genes has proven to be a worthwhile strategy. In a number of studies (Martin et al., 2007; in prep.), SSH-derived probes have been identified as significant responders in immune, protein catabolism and smoltification studies. However, in lipid metabolism studies to date, no probes from diet SSH clones (or contigs containing SSH clones) have been identified as responding differentially between diets. One possible explanation for the apparent failure of the SSH procedure in this case follows from the subsequent microarray analysis of the dietary lipid experiment reported here. Overall fold changes in diet-responsive genes were found to be quite low, with the greatest changes being only three-fold, and there were no data to support differential expression of these genes at the two time points (50 and 53 weeks, i.e. pre- and post-smolt samples) used for diet SSH-library construction. HUFA biosynthesis in Atlantic salmon is known to vary during the growth cycle, with peak activity occurring around seawater transfer (Zheng et al., 2005b).

There are no plans to radically improve upon the probe set in the TRAITS–SGP microarray. In its existing format, there is still spare capacity for feature printing, and additional candidate gene probes have been, and will be added on an ad hoc basis. For example, probes for 12 genes associated with lipid metabolism and 10 immune-related genes, not known to be on the microarray, have recently been included to facilitate two specific experimental studies. Over the past few years, printing technology has advanced significantly. Currently, the TRAITS–SGP microarray is being printed using non-contact inkjet-based technology (ArrayJet Ltd., Dalkeith, U.K.), which delivers more consistent spot and slide uniformity than contact pin printing and which should improve the reliability of generated microarray data and increase the sensitivity of detection.

Dietary lipid experiment

By identifying candidate genes, the results of this experiment provide reassurance of the clone-tracking accuracy of the microarray. They also confirm that the selected microarray design and fabrication technology, together with the experimental methodology employed, provide the capacity for sensitive detection of differential expression. The fold change differences in expression of Δ5 FAD and Δ6 FAD between fish fed VO-based diets v. fish fed FO-based diets closely match those obtained by qRT-PCR analysis in a previous study (Zheng et al., 2005b).

Microarray analyses are expensive and time consuming to perform, and there is often financial pressure to minimize the number of microarrays used in an experiment. Here, reducing the slide number from 12 to 8 (while retaining the dye swap) had little apparent effect on the ability to detect significant differential expression of FAD genes known to show a variable response based on diet. Using just six slides per condition (one per biological replicate and no dye swap) gave different outcomes according to the dye–target combination used. When the experimental sample was labelled with Cy3 and the pooled reference with Cy5, the results were comparable with the full 12 slide (including dye swap) analysis. However, there was much less agreement when the experimental sample was Cy5 labelled and the pooled reference was Cy3 labelled. The reason for this marked dye-dependent disparity remains to be established. However, it has been noted in this and other related studies conducted by TRAITS partners (and others) that (a) pooled reference samples produce higher background intensities and (b) the Cy3 channel consistently displays higher background values compared with the Cy5 channel. The combination of these factors may, at least in part, account for this phenomenon. The relevance of these observations to other studies is difficult to assess. The results are largely dependent on the homogeneity of the system under study and the absolute and relative expression levels of any differentially responding genes. For similar reasons, it is also difficult to critically assess published studies more generally. It is possible that the use of sex-balanced pools for experimental biological replication in this study, rather than individual samples, contributed to the discriminatory capabilities of this microarray experiment. Where cost considerations are an important factor, interrogation of reduced numbers of microarrays may be the only viable option available. If background fluorescence can confidently be controlled, omitting a dye swap, as opposed to reducing biological replication, may be a preferable route to follow in such cases.

The FAD expression profiles demonstrated the potential for obtaining artefactual results because of cross-hybridization of similar transcripts. Expression profiles derived from two Δ5 FAD probes (the ORF PCR fragment and full-length EST) closely matched those obtained from all Δ6 FAD probes. While the 3′-UTRs are very distinct (Δ5 FAD 3′-UTR = 1072 bp; Δ6 FAD 3′-UTR = 457 bp; sequence similarity c. 30%), the two ORFs are very similar to their Δ6 counterparts (both 1365 bp; c. 95% sequence similarity). It has also been reported from qRT-PCR analysis that Δ6 FAD gene expression is approximately four-fold higher than Δ5 FAD expression, in liver tissue of farmed salmon fed on FO (Zheng et al., 2005b). Thus, it would appear that cross-hybridization of the more abundant Δ6 FAD transcript targets is masking the true expression profiles derived from these two Δ5 FAD probes. The failure of the microarray analysis to correctly distinguish between Δ5 FAD and Δ6 FAD expression profiles in all cases clearly illustrates one of the inherent weaknesses of cDNA microarray-based studies, i.e. the inability to distinguish between highly similar message transcripts. This is of particular relevance in interpreting transcriptomic data from salmonid species because fish of the family Salmonidae have undergone a relatively recent whole-genome duplication c. 25–100 million years ago (Allendorf & Thorgaard, 1984), such that simultaneous expression of duplicate genes is a commonly observed phenomenon.

Other studies

As part of the original funded project, the TRAITS partners have already used the cDNA array to explore transcriptomic responses in experiments targeting the four key traits identified as being important for sustained salmon aquaculture, i.e. (a) supply of dioxin-free highly unsaturated oils for the salmon diet, (b) protein growth efficiency, (c) infectious disease and (d) a long and complex life cycle. In addition to the diet work outlined in this paper, other research at Stirling University, funded by the European Union, is comparing gene expression both within and between families of fish fed FO- or VO-based diets. Other studies, investigating the transcriptomic response of Atlantic salmon to infection by infectious pancreatic necrosis virus in both seawater and freshwater environments, are similarly in progress. Researchers at Aberdeen University are using the cDNA microarray to study the immune response in Atlantic salmon in vivo following infection by A. salmonicida and in vitro to examine the response to recombinant fish cytokines. Additionally, short-term starvation trials have been used to explore protein catabolism pathways. Finally, the Cardiff partner has identified genes and gene pathways from three tissues (brain, gill and kidney) that are involved in the parr–smolt transformation. The TRAITS–SGP microarray has also been supplied to an Australian research group to examine gene expression responses to amoebic gill disease (Wynne et al., 2008). To date, its performance has not been critically assessed for use with other closely related species. Cross-species hybridization between salmonids has been shown to be extremely high (Rise et al., 2004; von Schalburg et al., 2005), suggesting that microarrays could be used with similar confidence for both Atlantic salmon and rainbow trout Oncorhynchus mykiss(Walbaum) because of the high level of sequence homology between the two species.

Future directions

The final stage of the TRAITS project, i.e. development and validation of a focused oligonucleotide array, is currently in progress. Approximately 1000 differentially responding genes have been identified from cDNA microarray interrogations by the TRAITS partners and unique 70mer oligonucleotides designed and synthesized for these and appropriate control genes. These form the basis of the TRAITS–SGP secondary oligochip, the performance of which is currently being evaluated. Irrespective of the outcome, the TRAITS–SGP cDNA microarray will continue to be a valuable tool and be available for use to the wider scientific community. Enquiries regarding purchase and use of this microarray should be directed to ARK Genomics (http://www.ark-genomics.org) in the first instance.

Acknowledgments

The TRAITS–SGP microarray was constructed by ARK Genomics (Roslin Institute, U.K.) and funded by Biotechnology and Biological Sciences Research Council (BBSRC) grants 98/EGA17674 (University of Stirling, U.K.), 98/EGA17675 (University of Aberdeen, U.K.), 72/EGA17676 (Cardiff University, U.K.) and with grant 139617/140 ‘Salmon Genome Project’ by the Research Council of Norway. We thank C. Bendixen & L.-E. Holm (DIAS, Foulum, Denmark) for generously supplying additional normalized cDNA libraries. Evaluation work has been further supported by the EU FP5 Project Q5RS-2000-30058 (RAFOA) and EU FP6 Project 016249-2 (AQUAMAX). L.V. was funded by an EU Marie Curie EIF 024589 (MAST). A. Brass and Y. Fang (Manchester University) are thanked for key discussions relating to experimental design and analysis.

References

  1. Adzhubei A A, Vlasova A V, Hagen-Larsen H, Ruden T A, Laerdahl J K, Høyheim B. Annotated expressed sequence tags (ESTs) from pre-smolt Atlantic salmon (Salmo salar) in a searchable data resource. BMC Genomics. 2007;8 doi: 10.1186/1471-2164-8-209. doi: 10.1186/1471-2164-8-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Allendorf F W, Thorgaard G W. Tetraploidy and the evolution of salmonid fishes. In: Turner B J, editor. Evolutionary Genetics of Fishes. New York: Plenum Press; 1984. pp. 1–46. [Google Scholar]
  3. Altschul S F, Gish W, Miller W, Myers E W, Lipman D J. Basic local alignment search tool. Journal of Molecular Biology. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  4. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. 1995;B57:289–300. [Google Scholar]
  5. Bonaldo M D F, Lennon G, Soares M B. Normalisation and subtraction: two approaches to facilitate gene discovery. Genome Research. 1996;6:791–806. doi: 10.1101/gr.6.9.791. [DOI] [PubMed] [Google Scholar]
  6. Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, Aach J, Ansorge W, Ball C A, Causton H C, Gaasterland T, Glenisson P, Holstege F C P, Kim I F, Markowitz V, Matese J C, Parkinson H, Robinson A, Sarkans U, Schulze-Kremer S, Stewart J, Taylor R, Vilo J, Vingron M. Minimum information about a microarray experiment (MIAME) – toward standards for microarray data. Nature Genetics. 2001;29:365–371. doi: 10.1038/ng1201-365. [DOI] [PubMed] [Google Scholar]
  7. Cairns M T, Johnson M C, Talbot A T, Pemmasani J K, McNeill R E, Houeix B, Sangrador-Vegas A, Pottinger T G. A cDNA microarray assessment of gene expression in the liver of rainbow trout (Oncorhynchus mykiss) in response to a handling and confinement stressor. Comparative Biochemistry and Physiology D. 2008;3:51–66. doi: 10.1016/j.cbd.2007.04.009. doi: 10.1016/j.cbd.2007.04.009. [DOI] [PubMed] [Google Scholar]
  8. Conesa A, Gotz S, Garcia-Gomez J M, Terol J, Talon M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–3676. doi: 10.1093/bioinformatics/bti610. [DOI] [PubMed] [Google Scholar]
  9. Ewart K V, Belanger J C, Williams J, Karakach T, Penny S, Tsoi S C M, Richards R C, Douglas S E. Identification of genes differentially expressed in Atlantic salmon (Salmo salar) in response to infection by Aeromonas salmonicida using cDNA microarray technology. Developmental and Comparative Immunology. 2005;29:333–347. doi: 10.1016/j.dci.2004.08.004. [DOI] [PubMed] [Google Scholar]
  10. Hagen-Larsen H, Laerdahl J K, Panitz F, Adzhubei A, Høyheim B. An EST-based approach for identifying genes expressed in the intestine and gills of pre-smolt Atlantic salmon (Salmo salar) BMC Genomics. 2005;6 doi: 10.1186/1471-2164-6-171. doi: 10.1186/1471-2164-6-171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Hastings N, Agaba M K, Tocher D R, Zheng X, Dickson C A, Dick J R, Teale A J. Molecular cloning and functional characterization of fatty acyl desaturase and elongase cDNAs involved in the production of eicosapentaenoic and docosahexanoic acids from α-linolenic acid in Atlantic salmon (Salmo salar) Marine Biotechnology. 2004;6:463–474. doi: 10.1007/s10126-004-3002-8. [DOI] [PubMed] [Google Scholar]
  12. Jordal A-E O, Torstensen B E, Tsoi S, Tocher D R, Lall S P, Douglas S E. Dietary rapeseed oil affects the expression of genes involved in hepatic lipid metabolism in Atlantic salmon (Salmo salar L.) Journal of Nutrition. 2005;135:2355–2361. doi: 10.1093/jn/135.10.2355. [DOI] [PubMed] [Google Scholar]
  13. Koskinen H, Pehkonen P, Vehniainen E, Krasnov A, Rexroad C, Afanasyev S, Molsa H, Oikari A. Response of rainbow trout transcriptome to model chemical contaminants. Biochemical and Biophysical Research Communications. 2004;320:745–753. doi: 10.1016/j.bbrc.2004.06.024. [DOI] [PubMed] [Google Scholar]
  14. MacKenzie S, Iliev D, Liarte C, Koskinen H, Planas J V, Goetz F W, Molsa H, Krasnov A, Tort L. Transcriptional analysis of LPS-stimulated activation of trout (Oncorhynchus mykiss) monocyte/macrophage cells in primary culture treated with cortisol. Molecular Immunology. 2006;43:1340–1348. doi: 10.1016/j.molimm.2005.09.005. [DOI] [PubMed] [Google Scholar]
  15. Marsden M J, DeVoy A, Vaughan L, Foster T J, Secombes C J. Use of a genetically attenuated strain of Aeromonas salmonicida to vaccinate salmonid fish. Aquaculture International. 1996;4:55–66. [Google Scholar]
  16. Martin S A, Caplice N C, Davey G C, Powell R. EST-based identification of genes expressed in the liver of adult Atlantic salmon (Salmo salar) Biochemical and Biophysical Research Communications. 2002;293:578–585. doi: 10.1016/S0006-291X(02)00263-2. [DOI] [PubMed] [Google Scholar]
  17. Martin S A M, Blaney S C, Houlihan D F, Secombes C J. Transcriptome response following administration of a live bacterial vaccine in Atlantic salmon (Salmo salar) Molecular Immunology. 2006;43:1900–1911. doi: 10.1016/j.molimm.2005.10.007. [DOI] [PubMed] [Google Scholar]
  18. Martin S A M, Taggart J B, Seear P J, Bron J E, Talbot R, Teale A J, Sweeney G E, Hoyheim B, Houlihan D F, Tocher D R, Zou J, Secombes C J. Interferon type I and type II responses in an Atlantic salmon (Salmo salar) SHK-1 cell line using the salmon TRAITS/SGP microarray. Physiological Genomics. 2007;32:33–44. doi: 10.1152/physiolgenomics.00064.2007. doi: 10.1152/physiolgenomics.00064.2007. [DOI] [PubMed] [Google Scholar]
  19. National Research Council (NRC) Nutrient Requirements of Fish. Washington, DC: National Academy Press; 1993. [Google Scholar]
  20. Olsvik P A, Lie K K, Jordal A-E, Nilsen T O, Hordvik I. Evaluation of potential reference genes in real-time RT-PCR studies of Atlantic salmon. BMC Molecular Biology. 2005;6 doi: 10.1186/1471-2199-6-21. doi: 10.1186/1471-2199-6-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Pertea G, Huang X Q, Liang F, Antonescu V, Sultana R, Karamycheva S, Lee Y, White J, Cheung F, Parvizi B, Tsai J, Quackenbush J. TIGR gene indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics. 2003;19:651–652. doi: 10.1093/bioinformatics/btg034. [DOI] [PubMed] [Google Scholar]
  22. Pfaffl M W. A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Research. 2001;29:2002–2007. doi: 10.1093/nar/29.9.e45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Pfaffl M W, Horgan G W, Dempfle L. Relative expression software tool (REST) for group-wise comparison and statistical analysis of relative expression results in real-time PCR. Nucleic Acids Research. 2002;30:9–36. doi: 10.1093/nar/30.9.e36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Purcell M K, Nichols K M, Winton J R, Kurath G, Thorgaard G H, Wheeler P, Hansen J D, Herwig R P, Park L K. Comprehensive gene expression profiling following DNA vaccination of rainbow trout against infectious hematopoietic necrosis virus. Molecular Immunology. 2006;43:2089–2106. doi: 10.1016/j.molimm.2005.12.005. [DOI] [PubMed] [Google Scholar]
  25. Rise M L, von Schalburg K R, Brown G D, Mawer M A, Devlin R H, Kuipers N, Busby M, Beetz-Sargent M, Alberto R, Gibbs A R, Hunt P, Shukin R, Zeznik J A, Nelson C, Jones S R, Smailus D E, Jones S J, Schein J E, Marra M A, Butterfield Y S, Stott J M, Ng S H, Davidson W S, Koop B F. Development and application of a salmonid EST database and cDNA microarray: data mining and interspecific hybridization characteristics. Genome Research. 2004;14:478–490. doi: 10.1101/gr.1687304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Roberge C, Paez D J, Rossignol O, Guderley H, Dodson J, Bernatchez L. Genome-wide survey of the gene expression response to saprolegniasis in Atlantic salmon. Molecular Immunology. 2007;44:1374–1383. doi: 10.1016/j.molimm.2006.05.005. [DOI] [PubMed] [Google Scholar]
  27. Salem M, Kenney P B, Rexroad C E, Yao J B. Microarray gene expression analysis in atrophying rainbow trout muscle: a unique nonmammalian muscle degradation model. Physiological Genomics. 2006;28:33–45. doi: 10.1152/physiolgenomics.00114.2006. [DOI] [PubMed] [Google Scholar]
  28. Sargent J R, Tacon A G R. Development of farmed fish: a nutritionally necessary alternative to meat. Proceedings of the Nutrition Society. 1999;58:377–383. doi: 10.1017/s0029665199001366. [DOI] [PubMed] [Google Scholar]
  29. von Schalburg K R, Rise M L, Cooper G A, Brown G D, Gibbs A R, Nelson C C, Davidson W S, Koop B F. Fish and chips: various methodologies demonstrate utility of a 16,006-gene salmonid microarray. BMC Genomics. 2005;6 doi: 10.1186/1471-2164-6-126. doi: 10.1186/1471-2164-6-126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Torstensen B E, Bell J G, Rosenlund G, Henderson R J, Graff I E, Tocher D R, Lie Ø, Sargent J R. Tailoring of Atlantic salmon (Salmo salar L.) flesh lipid composition and sensory quality by replacing fish oil with a vegetable oil blend. Journal of Agricultural and Food Chemistry. 2005;53:10166–10178. doi: 10.1021/jf051308i. [DOI] [PubMed] [Google Scholar]
  31. Villeneuve L, Gisbert E, Zambonino-Infante J L, Quazuguel P, Cahu C L. Effect of nature of dietary lipids on European sea bass morphogenesis: implication of retinoid receptors. British Journal of Nutrition. 2005;94:877–884. doi: 10.1079/bjn20051560. [DOI] [PubMed] [Google Scholar]
  32. Wynne J W, O'sullivan M G, Cook M T, Stone G, Nowak B F, Lovell D R, Elliott N G. Transcriptome analyses of Amoebic gill disease-affected Atlantic Salmon (Salmo salar) tissues reveal localized host gene suppression. Marine Biotechnology. 2008 doi: 10.1007/s10126-007-9075-4. (in press). doi: 10.1007/s10126-007-9075-4. [DOI] [PubMed] [Google Scholar]
  33. Zheng X, Tocher D R, Dickson C A, Bell J G, Teale A J. Effects of diets containing vegetable oil on expression of genes involved in polyunsaturated fatty acid biosynthesis in liver of Atlantic salmon (Salmo salar) Aquaculture. 2004;236:467–483. [Google Scholar]
  34. Zheng X, Tocher D R, Dickson C A, Dick J R, Bell J G, Teale A J. Highly unsaturated fatty acid synthesis in vertebrates: new insights with the cloning and characterisation of a Δ6 desaturase of Atlantic salmon. Lipids. 2005a;40:13–24. doi: 10.1007/s11745-005-1355-7. [DOI] [PubMed] [Google Scholar]
  35. Zheng X, Torstensen B E, Tocher D R, Dick J R, Henderson R J, Bell J G. Environmental and dietary influences on highly unsaturated fatty acid biosynthesis and expression of fatty acyl desaturase and elongase genes in liver of Atlantic salmon (Salmo salar) Biochimica et Biophysica Acta. 2005b;1734:13–24. doi: 10.1016/j.bbalip.2005.01.006. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Fish Biology are provided here courtesy of Wiley

RESOURCES