Abstract
In North America, a high mortality of soft-shell clams Mya arenaria was found to be related to the disease known as disseminated neoplasia (DN). Disseminated neoplasia is commonly recognized as a tetraploid disorder related to a disruption of the cell cycle. However, the molecular mechanisms by which hemocytes of clams are transformed in the course of DN remain by far unknown. This study aims at identifying the transcripts related to DN in soft shell clams’ hemocytes using next generation of sequencing (Illumina HiSeq2000). This study mainly focuses on transcripts and molecular mechanisms involved in cell cycle. Using Illumina next generation of sequencing, more than 95,399,159 reads count with an average length of 45 bp was generated from three groups of hemocytes: (1) a healthy group with less than 10% of tetraploid cells; (2) an intermediate group with tetraploid hemocytes ranging between 10% and 50% and (3) a diseased group with more than 50% of tetraploid cells. After the reads were cleaned by removing the adapters, de novo assembly was performed on the sequences and more than 73,696 contigs were generated with a mean contig length estimated at 585 bp ranging from 189 bp to 14,773 bp. Once a Blastx search against NCBI Non Redundant database was performed and the duplicates removed, 18,378 annotated sequences matched known sequences, 3078 were hypothetical and 9002 were uncharacterized sequences. Fifty percent and 41% of known sequences match sequences from Mollusca and Gastropoda respectively. Among the bivalvia, 33%, 17%, 17% and 15% of the contigs match sequences from Ostreoida, Veneroida, Pectinoida and Mytiloida respectively. Gene ontology analysis showed that metabolic, cellular, transport, cell communication and cell cycle represent 33%, 15%, 9%, 8.5% and 7% respectively of the total biological process. Approximately 70% of the component process is related to intracellular process and 15% is linked to protein and ribonucleoprotein complex. Catalytic activities and binding molecular processes represent 39% and 33% of the total molecular functions. Interestingly, nucleic acid binding represents more than 18% of the total protein class. Transcripts involved in the molecular mechanisms of cell cycle are discussed providing new avenues for future investigations.
Keywords: Transcriptomic, Neoplasia, Tetraploidy, Mya arenaria, Cell cycle, Molecular mechanisms
1. Introduction
Disseminated neoplasia (DN) is defined as a leukemia-like disease affecting hemocytes of soft-shell clams. This disease is mainly characterized by a high number of circulating hemocytes containing pleomorphic nuclei with a high nucleus–cytoplasm ratio [1,2]. These abnormal circulating cells lose their function of phagocytosis [3] and exhibit higher DNA contents and mitosis rates than normal cells resulting in the formation of tetraploid cells [4,5]. During normal cell cycle, chromosomes segregate only when an adequate kinetochore-microtubule attachment exists enabling cells to pass the spindle assembly checkpoint [6]. However, after prolonged arrest at the spindle assembly checkpoint that promotes cytokinesis failure, cells become tetraploid [7].
The etiology of DN in Mya arenaria remains unknown although several factors such as retrovirus infection or contamination seem to contribute to the development of the disease. Retroviral infection was shown as the main factor [8] of DN development; however, other studies did not succeed in detecting retroviral components in diseased clams [9]. In addition, studies stipulate that environmental contamination by pesticides could be the main cause of DN in soft shell clams collected from Prince Edward Island [10,11].
Many studies have investigated the molecular mechanisms involved in the development of the disease [12–18]. For instance, some studies have shown that the proliferation of hemocytes in clams is due to the sequestration of p53 by mortalin [13,14], whereas other studies demonstrated that p53's pathways were disrupted by the activation of the Mouse Double Minute 2 (MDM2) proto-oncogene [15]. Our recent investigation using subtractive suppressive hybridization techniques suggested the involvement of proto-oncogene such as RAS like protein members, c-myc and c-jun in the development of DN [17]. It was hypothesized that the proto-oncogenes were regulated by the presence of transposase whose gene expression was increased in tetraploid hemocytes [16]. Briefly, although several studies have been initiated, the molecular mechanisms by which hemocytes become neoplastic are still to be unravelled. Unfortunately, little is known on the transcriptome of soft shell clams. Only few sequences are available in GenBank represented mainly by Map53 (Acc# AF253323.1), Map73 (Acc# AF253324.1), and mortalin (Acc # EF576660).
The development of the new generation of sequencing technologies represents a great opportunity to increase the number of transcript sequences and would provide a basis platform for molecular mechanisms studies involved in DN in soft shell clams.
Thus, this study aims at increasing the numbers of transcript sequences in hemocytes of soft shell clams affected by DN. Our approach focused specifically on sequencing mRNA isolated from different groups of hemocytes characterized by their tetraploid status. The sequences generated in this study will constitute a baseline in unravelling the molecular pathway(s) involved in DN as well as a source of potential target transcripts which could be involved in hemocytes' immune system of soft shell clams, M. arenaria.
2. Materials and methods
2.1. Sampling
Approximately 5-cm-long specimens of M. arenaria were collected at low tide at a depth of 15–20 cm using a hand rake from May to November 2011 in North River (46°15′10″N, 63°10′42″W) (Charlottetown, Prince Edward Island, Canada). Clams were washed with seawater and transported to the Atlantic Veterinary College at the University of Prince Edward Island for further analysis.
2.2. Flow cytometry
Flow cytometric (FCM) analysis was used to assess the ploidy status of M. arenaria's haemocytes according to the methods described by Delaporte et al. [5]. This technique enables the screening of cell populations for DNA content. The protocol is based on the binding properties of propidium iodide (PI), which stains by intercalating into DNA strands. The PI fluorescence intensity is proportional to the DNA content in the cell. Briefly, hemolymph (500 μL) was withdrawn from individual clams using a 3 mL syringe fitted with a 25-gauge needle. Haemocytes were fixed in 2.5 mL of cold absolute ethanol and stored at −20 °C for at least 30 min. Fixed cells were centrifuged (400g for 10 min at room temperature), and the supernatant was discarded. Haemocyte pellets were re-suspended in 0.01 M phosphate-buffered saline (PBS) and the cells were allowed to re-hydrate for 30 min at room temperature. After two washes in PBS (400g for 10 min at room temperature), cells were re-suspended in 380 μL of PBS solution and transferred to flow cytometer tubes by filtering through an 80 μm nylon mesh. Propidium iodide (50 μg mL−1) and DNAse-Free RNase A (50 μg mL−1) were added to each tube before the mixtures were incubated in the dark until optimal PI staining for 30 min. PI fluorescence, which is related to the DNA content of each cell, was detected on the orange photomultiplicator of a FACSCalibur flow cytometer (BD Biosciences) at a wavelength ranging between 550 and 600 nm. For each sample, 10,000 particles were counted at low flow rate (15 μL min−1). For each cell event, a single pulse of PI fluorescence was represented according to its area and width. The pulse width needs to be compared with the pulse area in order to discriminate the cells in the phase G2/M from doublets of G0/G1 cells represented by the same DNA quantity. To gate single haemocytes, PI fluorescence intensities were plotted as a FL2-area vs. FL2- width dot-plot. The region R1 was drawn in order to discriminate single cells from the doublets. The single cells gated in R1 were plotted on a FL2-area histogram and used to estimate the percentage of normal and tetraploid haemocytes in the analyzed cell population. Results are presented as the percentage of tetraploid haemocytes per clam. Three different classes of clams were distinguished according to the percentage of tetraploid hemocytes.
2.3. RNA extraction
Based on their tetraploid status, 3 groups of clams were selected: (1) Group C (healthy clams considered as control) with a low percentage of tetraploid hemocytes (<10%); Group D (disease in development): individuals presenting a percentage of tetraploid cells ranging between 10% and 50%; and Group E (established disease): clams with a high percentage of tetraploid hemocytes (>50%). Hemolymph (2 mL) was withdrawn from individual clams using a 3 mL syringe fitted with a 25-gauge needle. Hemolymph was centrifuged at 400g for 15 min at 4 °C in order to isolate hemocytes from the serum. Total RNAs were extracted from hemocytes using RNeasy purification kit (Qiagen, USA), treated with DNaseI and the quality of RNA was analyzed using the bioanalyser Experion (Biorad, Canada). RNA concentrations were analyzed using a Nanodrop (ND 1000, USA) spectrophotometer.
Samples with high quality (RNA quality index higher or equal at 8) and concentration higher than 100 ng mL−1 was selected. Based on these criteria, 6 samples were pooled for the group C (0–10%), 7 samples for the group D (10–50%) and 3 samples for the group E (>50%).
2.4. Library preparation and sequencing
Library preparation and next generation of sequencing were carried out by Eurofin (Alabama, US). Three individual libraries (one from each group) were prepared with NuGEN Ovation RNA-Seq system V2 according to the manufacturer's recommendations. NuGEN Ovation RNA-Seq system V2 corresponds to three main steps: first strand cDNA synthesis, second strand cDNA synthesis and purification and SPIA amplification and purification. Briefly, First Strand Primer Mix was mixed to RNA (100 ng). The mixture was warmed at 65° C for 5 min. First Strand Master Mix (3 mL) was added to each tube and placed in a thermal cycler at 4 ° C for 1 min, 25 °C for 10 min, 42 °C for 10 min, and 70 °C for 15 min. Second Strand Master Mix (10 mL) was added to each First Strand reaction tube. The mixture was added and placed in a thermal cycler at 4 °C for 1 min, 25 °C for 10 min, 50 °C for 30 min, and 80 °C for 20 min. Second strand cDNA was purified using Agencourt RNAClean XP purification beads provided with a NuGEN Ovation RNA-Seq system V2 kit. Finally, SPIA Master Mix (40 μL) was added to each tube containing the double-stranded cDNA bound to the dried beads and placed in a thermo cycler at 4 °C for 1 min, 47 °C for 60 min, and 80 °C for 20 min. SPIA cDNA was purified using QIAGEN MinElute Reaction Cleanup Kit (Qiagen, USA). SPIA cDNA was sequenced on Illumina HiSeq2000 with 2 × 50 bp according to the manufacturer's recommendations.
2.5. Sequence analysis and bioinformatics
Reads generated from Illumina HiSeq2000 sequencing were assembled using the CLC Genomics Workbench software v5.5. Adaptors and primer sequences were trimmed as well as low quality and duplicates removed. De novo sequence assembly was performed using de Bruijn graphs as described in [19–21]. The generated contigs and isotigs were blasted against NCBI Non-Redundant database including Swissprot, Metazoan Refseq and UniprotKB/Trembl protein databases using the GenomeQuest bioinformatic platform (US) with a cut-off of 10−6. Gene ontology searches were performed using the Panther Pathway [22]. Sequences were submitted to BLAST comparison against the KEGG GENES database to obtain KO (KEGG Orthology) assignments and to generate KEGG pathways. The threshold >60 was applied to BLAST bit scores. Reads per kilobase per million mapped reads (RPKM) were quantified from the three groups. Ratio to the healthy group C were calculated for group D and group E. Data are reported in a histogram representing Log2 RPKM ratio.
3. Results and discussion
3.1. De novo assembly
The tetraploid status of hemocytes has been screened using flow cytometry. Three groups of hemocytes were determined: Group C, a healthy group with less than 10% of tetraploid cells; Group D, an intermediate group with tetraploid hemocytes ranging between 10% and 50% and group E, a diseased group with more than 50% of tetraploid cells. More than 95,399,159 reads count were generated from the three groups with an average length of 45 bp.
The bioinformatic analysis of the sequences was performed using the CLC Genomics Workbench software v5.5. The adaptors and primers were trimmed; in addition, duplicates and low quality sequences were removed from the data set. Upon cleaning, de novo assembly was performed on the sequences using de Bruijn graphs. More than 73,696 contigs were generated with a mean contig length estimated at 585 bp ranging from 189 bp to 14,773 bp. More than 90.3% of the contigs (66,612 sequences) had a length ranging from 189 to 1268 bp and less than 9.7% were longer than 1268 bp (Fig. 1).
Fig. 1.

Contigs length distribution after de novo assembly.
The emergence of the next generation of sequencing has dramatically increased the knowledge and information on bacteria, invertebrate and vertebrate organisms' transcriptome. Although the number of Expressed Sequence Tags (ESTs) has increased during the last decade, the knowledge on mollusk bivalvia in particular M. arenaria is still limited. GenBank database contains only 248,489 nucleotides and 373,300 ESTs related to Bivalvia (4/15/2013). Ostreoida and Mytiloida represent the most representative groups with 223,017 (59.7%) and 71,328 (19.1%) ESTs respectively. Among the Ostreoida, Crassostrea is the main representative group with 222,785 (99.9%) ESTs and Mytilus have been mainly sequenced in the Mytiloida family with 67,990 (95.3%). Unfortunately, the number of ESTs related to M. arenaria is very poor with 100 entries represented mainly by cytochrome oxidase, microsatellite and ribosomal RNA. Among the 100 entries, only 10 transcripts have been recorded in GenBank (4/15/2013) represented by M. arenaria p53 tumor suppressor homolog (Myap53) mRNA, partial cds (U45237.1); M. arenaria voltage-dependent sodium channel mRNA, partial cds (AY847740.1); M. arenaria p73-like protein mRNA, complete cds (AF253324.1); M. arenaria E3 ubiquitin-protein ligase mRNA, complete cds (AF154109.2); M. arenaria aryl hydrocarbon receptor-like protein mRNA, complete cds (AF261769.1); M. arenaria p53 tumor suppressor-like protein mRNA, complete cds (AF253323.1); M. arenaria ribosomal protein S19 (S19) mRNA, complete cds (U63092.1); M. arenaria mitochondrial mortalin-2 precursor (Mot-2) mRNA, complete cds, nuclear gene for mitochondrial product (AY326398.2); M. arenaria mitochondrial mortalin splice variant mRNA, complete cds, alternatively spliced, nuclear gene for mitochondrial product (EF576660.1); and M. arenaria Rack1 gene for Receptor of Activated Kinase C 1 (AM404081.1). Most of the messengers cited above are related to the p53-like molecular mechanisms involved in DN in soft shell clams M. arenaria [12,23,24]. However, the number of sequences remains by far unrepresentative of M. arenaria's transcriptome, which therefore makes it difficult to understand the molecular mechanisms by which hemocytes become neoplastic. This study attempts to populate M. arenaria's sequence database in particular in the field of clams’ pathology such as disseminated neoplasia. These sequences could be used in studies interested in molecular mechanisms involved in disseminated neoplasia but also in ecotoxicogenomics where M. arenaria is used as a sentinel species for biosurveillance programmes.
3.2. Annotation
A Blastx search was performed against NCBI Non Redundant database including Swissprot, Metazoan Refseq and UniprotKB/Trembl protein databases. The GenomeQuest bioinformatic platform was used for the Blastx search and a cut-off of 10−6 was applied for the analysis. Once the duplicates were removed, 18,378 annotated sequences matched known sequences, 3078 were hypothetical and 9002 were uncharacterized sequences. Eighty-five percent of the annotated sequences are among eukaryote identities (Fig. 2A) with 18% and 5% linked to arthropoda and mollusca phylla (Fig. 2B).
Fig. 2.

Phylogenetic distribution of the best hit in animal kingdom (A) and eukaryota (B) based on BlastX search in the NCBI nr database.
Data showed that 50% and 41% of known sequences match with sequences from Mollusca and Gastropoda respectively (Fig. 3A). Among the bivalvia, 33%, 17%, 17% and 15% of the contigs match sequences from Ostreoida, Veneroida, Pectinoida and Mytiloida respectively (Fig. 3B).
Fig. 3.

Species distribution based on BlastX alignment in Bivalvia (A) and Mollusca (B) NCBI nr database.
3.3. Gene ontology
Gene ontology analysis showed that metabolic, cellular, transport, cell communication and cell cycle represent 33%, 15%, 9%, 8.5% and 7% of the total biological process respectively (Fig. 4A). The majority (around 70%) of the component process is related to intracellular processes and approximately 15% linked to protein and ribonucleoprotein complex respectively (Fig. 4B). Catalytic activities and binding molecular processes represent 39% and 33% of the total molecular functions respectively (Fig. 4C). Interestingly, nucleic acid binding represents more than 18% of the total protein class (Fig. 5).
Fig. 4.

Percentage representation of gene ontology mapping (A) biological processes, (B) cellular component and (C) molecular functions.
Fig. 5.

Distribution of best hits annotated protein class based on the BlastX alignment and gene ontology mapping.
3.4. Gene expression and cell cycle
Tetraploid cells with double DNA quantity can lead to tumorgenesis under a non-functional p53 gene [25]. Several studies have shown the link between the increase of tetraploid tumor cells and the failure of p53 or downstream gene such as p21 in mammalians [26]. The same observations were described in soft shell clams M. arenaria where p53 function was affected by high level of mortalin in the cytoplasm or high expression of MDM2 [13,15]. The formation of tetraploid cells is a consequence of cell fusion, endoreduplication, cytokinesis failure or mitotic slippage [26]. During the cell cycle, the chromosome segregation is under control of mitotic checkpoint genes .
3.4.1. Transcripts involved in G1– S phase of cell cycle
At the G1 phase also known as the growth phase which corresponds to a high rate of biosynthesis, the CDK4,6 like proteins were identified in our database. CDK4,6 linked to cyclin D is the first serine-threonine kinases activated during the cell cycle. CDK4,6-cyclin D complex under proliferative signal activation allow the cell to enter the cell cycle at its first G1 phase [27]. Once activated, CDK4,6-Cyclin D initiates a sequential activation of downstream of cyclin E-CDK2, which regulates the G1–S transition and centrosome duplication. In our study, cyclin D2 and cyclin E transcripts were 4.6 and 3.8 times respectively higher than the control during the development phase of the disease (Fig. 6). Once the disease was established, the expression level of cyclin D2 and E was 2.3 and 2.5 higher than the control respectively. The regulation of centrosome duplication is key in the regulation of chromosome segregation. Several types of cancer were found to be related to centrosome amplification triggering aneuploidy [28]. It was shown that cyclin E-CDK2 complex is a key regulator in centrosome duplication and DNA replication [29]. It was demonstrated that cyclin E can bind the centrosome using the centrosomal localization signal (CLS) and then initiate the S-phase of the cell cycle [30]. In addition, CLS domain allows cyclin E to bind the minimichrosome maintenance 5 (MCM 5) recognised as a DNA pre-replication complex factor [31]. It was shown that the expression of MCM5 inhibits centrosome amplification in CHO cells arrested in S-phase corroborating the essential role of MCM5 on the centrosome regulation [31]. However, in our study, MCM5 transcript level was similar to the control. Furthermore, our data showed an overexpression of c-myc in groups D and E hemocytes (Fig. 6) corroborating our previous findings [17].
Fig. 6.

Expression levels of gene identified in our sequence database during the phase G1–S. Histogram represents Log2 RPKM ratio of group D (disease in development) and E (established disease) to group C (control). GSK3b: glycogen synthase kinase 3 beta; RBL1 (p107): retinoblastoma-like protein 1; E2F4/5: transcription factor E2F4/5; c-Myc: Myc proto-oncogene protein; Miz1: zinc finger and BTB domain-containing protein 17; SMAD 2/3: SMAD 2/3: mothers against decapentaplegic homolog 2/3; SMAD4: mothers against decapentaplegic homolog 4; CycD2: cyclin D2; CDK4: cyclin-dependent kinase 4; CycE: cyclin E; CDK2: cyclin-dependent kinase 2; SCF: S-phase kinase-associated protein 1; RBL2 (p107-p130): retinoblastoma-like protein 2; E2F4/5: transcription factor E2F4/5; and E2F3: transcription factor E2F3.
3.4.2. Transcripts involved in S–G2 phase of cell cycle
In late G1 phase, chromosome duplication is triggered by a specific cdk protein known as S-phase promoting cdks. DNA replication is initiated by the formation of the pre-replicative complex (pre-RC). This complex activity is dependent on the association of Cdc6 with the Origin Recognition Complex (ORC) [32]. Protein kinase such as Dbf4-Cdc7 and Clb5/6(B-cyclin)-Cdc28 phosphorylate and activate proteins within the pre-RC initiating then the DNA replication in phase S [33]. Cdc7 is a serine-threonine kinase which phosphorylates MCM2. MCM2 is a key component of the DNA replicative helicase, which plays a central role in genome duplication. The transcript level of Cdc7 in hemocytes from group D (disease in development) is 19.1 times higher than the control, whereas this level is 5 times higher in diseased hemocytes with a tetraploidy status higher than 50% (Fig. 7). Interestingly, Cdc7 gene and protein are highly expressed in tumor cells including leukemia [34,35]. Several investigations targeted Cdc7 as a cancer therapy by inhibiting Cdc7 expression in cancer cells [36]. In G1–S transition phase, Cdc6 is a key component in DNA replication. Cdc 6 is an ATPase belonging to the AAA family with chaperone like activities as well as a role in transcriptional regulation [37]. As shown for Cdc7, the level of Cdc6 transcript is 8.8 and 3.5 higher in groups D and E respectively (Fig. 7). In cancer cells, overexpression of Cdc6 inhibits expression of p16INKa/p15INKb bot activator of retinoblastoma pathway and a p53 stimulator, ARF [38]. In addition, it was shown that the overexpression of Cdc6 has a synergistic effect with mutant p53 in tumor progression and chromosome instability [39].
Fig. 7.

Expression levels of gene identified in our sequence database during the phase S–G2. Histogram represents Log2 RPKM ratio of group D (disease in development) and E (established disease) to group C (control). Cyc H: cyclin H; CycA cyclin A; CDC7 cell division control protein 7; CDC6: cell division control protein 6; ORC5: origin recognition complex subunit 5; ORC3: origin recognition complex subunit 3; ORC2: origin recognition complex subunit 2; ORC1: origin recognition complex subunit 1; MCM4: DNA replication licensing factor; MCM5: DNA replication licensing factor; MCM2: DNA replication licensing factor; MCM6: DNA replication licensing factor; MCM7: DNA replication licensing factor MCM7; MCM3: DNA replication licensing factor; Cdk7: cyclin-dependent kinase 7; CDK2: cyclin-dependent kinase 2; GADD45: growth arrest and DNA-damage-inducible protein; RBL2: retinoblastoma-like protein 2; RBL1: retinoblastoma-like protein 1; Rb: retinoblastoma-associated protein; MDM2: MDM2 Binding Protein; p300: E1A/CREB-binding protein; DNA-PK: DNA-dependent protein kinase catalytic subunit; ATM/ATR: ataxia telangiectasia mutated family protein; Chk1: serine/threonine-protein kinase; CDK1: cyclin-dependent kinase 1; and Wee: wee1-like protein kinase.
The growth arrest and DNA damage inducible gene (GADD45) is a group of genes that are activated by genomic damage compounds such as UV radiations, chemotherapeutic agents and growth factors [40]. Under activation by p53, GADD45 interacts with Cdc2 which is cleaved from Cyclin B1 and the complex GADD45/Cdc2 acts on the cell cycle by arresting the division at the checkpoint G2/M [41]. The high expression of GADD45 recorded in group D (hemocytes with tetraploidy status ranging between 10% and 50%).
3.4.3. Transcripts involved in G2–M phase of cell cycle
G2 phase is mainly controlled by CycA/CDK1 and CycB/CDK1. Cyclin B1 is expressed at very low levels and its expression increases at the G2–M phase [42]. However, in tumor cells, cyclin B1 is mainly localized in the cytoplasm and overexpressed throughout the cell cycle due to the inactive state of p53 [43]. It was shown that cyclin A is key in the progression of male germ cells through meiosis [44]. Over-expression of cyclin A was found in leukemic cells in more than 50% of acute myeloid leukemia patients [44]. In our study, both cyclin A (Fig. 7) and B (Fig. 8) as well as CDK1 (Fig. 8) are highly expressed in hemocytes with a tetraploid status ranging between 10% and 15%. The overexpression of cyclin B seems to coincide with a high expression of c-myc (Fig. 6). It was shown that cyclin B1 is a c-myc target whose overexpression induces chromosomal activity and tetraploidy [45].
Fig. 8.

Expression levels of gene identified in our sequence database during the phase G2–M. Histogram represents Log2 RPKM ratio of group D (disease in development) and E (established disease) to group C (control). MPS1: serine/threonine-protein kinase; MAD1: mitotic spindle assembly checkpoint protein; MAD2: mitotic spindle assembly checkpoint protein; BUB3: cell cycle arrest protein; BUB1: checkpoint serine/threonine-protein kinase; BUB1β/BUBR1: mitotic checkpoint serine/threonine-protein kinase; 14–3–3 β−τ−ζ: 14–3–3 protein beta/theta/zeta; CycB: cyclin B; CDK1: cyclin-dependent kinase 1; Myt1: membrane-associated tyrosine- and threonine-specific cdc2-inhibitory kinase; PLK1: polo-like kinase 1; SMC1: structural maintenance of chromosome 1; SMC3: structural maintenance of chromosome 3 (chondroitin sulfate proteoglycan 6); STAG1/2: cohesin complex subunit; Rad21: cohesin complex subunit SCC1; ESP1: separase; APC/C4: anaphase-promoting complex subunit 4; APC/C2: anaphase-promoting complex subunit 2; APC/C6: anaphase-promoting complex subunit 6; APC/C8: anaphase-promoting complex subunit 8; APC/C3: anaphase-promoting complex subunit 3; APC/C10: anaphase-promoting complex subunit 10; APC/C1: anaphase-promoting complex subunit 1; APC/C7: anaphase-promoting complex subunit 7; Cdh1/CDC20: cell division cycle 20, cofactor of APC complex; and CDC 14: cell division cycle 14.
In the M phase, the anaphase promoting complex/cyclosome (APC/C) is an E3 ubiquitin ligase complex that targets the degradation of protein involved in the cell cycle during the M phase. APC/C's activation depends on two WD-40 domain proteins, CDC20 and CDH1. In our study, APC/C family members identified in our database and in particular APC/C sub-unit 7 showed a high expression level in hemocyte with tetraploid status ranging between 10% and 50% as well as in diseased organisms with more than 50% tetraploid hemocytes (Fig. 8). Similarly, Cdh1/Cdc20 is also higher in both groups D and E (Fig. 8). In yeast mitosis, CDC20 and CDH1 bind to the APC/C, which is activated during the mitosis process [46]. Cdc20 activates APC/C during early mitosis, while Cdh1 plays a key role and controls several phases from late mitosis to G1/S transition [47]. It was shown that the deregulation of Cdh1 impaired centrosome formation, cytokinesis and DNA re-replication [48]. Overall, the disruption of APC/Cdh1 activity may lead to cell cycle instability and thus the development of neoplastic cells [49].
On the other hand, the expression level of separin (Esp1) is high (Log2 ratio Med/Neg = 3.58) in hemocytes with a tetraploidy ranging from 10% to 50% (Fig. 8). Separin is among the protein complexes known as cohesin. This complex maintains the cohesion between the sister chromatid which is important for equal segregation of sister chromatids in anaphase [50]. Separin related proteins have been identified in yeast [51], Aspergillus nidulans [52], Xenopus, and human [53] and now in M. arenaria (this study) suggesting that the mechanism of chromatid segregation is evolutionarily conserved among organisms.
3.5. Cell cycle molecular pathway in hemocytes of M. arenaria
Cell cycle progression is mainly regulated by the complex cyclin-CDK (cyclin-dependent kinase). These kinases are under control of mitogenic signals and play a central role in cell proliferation [27].
Pathways were generated once the sequences were blasted against KEGG Genes database. Focusing on pathways related to cell cycle, 77 transcripts were identified from our sequence database. Based on the KEGG pathways, these sequences are involved at different phases of the cell cycle: G1–S–G2–M (Fig. 9).
Fig. 9.

KEGG mapping for cell cycle pathway. The green boxes represent the annotated protein found in our library. The white boxes are the unidentified proteins in our database sequences.
In this study, individual variability was not taken into account because individuals were pooled in three different pools based on their tetraploidy status. However, sequences and data generated from this study provided a snapshot of molecular mechanisms involved in cell cycle in hemocytes of soft-shell clams M. arenaria. Overall, apart from few transcripts, the majority of the identified genes were highly expressed in group D where the population of hemocytes is ranging between 10% and 50% of tetraploid cells. This corroborates our previous study showing an up-regulation of p53, p73 and mortalin in clams with a moderate percentage (15–50%) of tetraploid cells [18].
4. Conclusion
During this study, 18,378 annotated sequences matched known sequences were generated from hemocytes of soft-shell clams, M. arenaria including sequences related to the development of DN. Focusing on sequences related to cell cycle, the majority of the cell cycle pathways described in vertebrates have been identified in M. arenaria. Overall, gene expression levels of the majority of the transcripts identified in cell cycle are higher during the development of the disease (tetraploidy ranging between 10% and 50%). Further studies are needed to investigate specific molecular mechanisms involved in cell cycle in relation with the development of hemic neoplasia in soft shell clams, M. arenaria. This study will increase M. arenaria's sequences for studies involved in molecular pathology, ecotoxicogenomics as well as in identifying biomarkers for biosurveillance programmes.
Footnotes
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-No Derivative Works License, which permits non-commercial use, distribution, and reproduction in any medium, provided the original author and source are credited.
References
- 1.Elston RA, Kent ML, Drum AS. Transmission of hemic neoplasia in the bay mussel, Mytilus edulis, using whole cells and cell homogenate. Dev Comp Immunol. 1988;12:719–727. doi: 10.1016/0145-305x(88)90047-x. [DOI] [PubMed] [Google Scholar]
- 2.Barber B. Neoplastic diseases of commercially important marine bivalves. Aquat Living Resour. 2004;17: 449-466 [Google Scholar]
- 3.Walker C, Böttger SA, Mulkern J, Jerszyk E, Litvaitis M, Lesser M. Mass culture and characterization of tumor cells from a naturally occurring invertebrate cancer model: applications for human and animal disease and environmental health. Biol Bull. 2009;216:23–39. doi: 10.1086/BBLv216n1p23. [DOI] [PubMed] [Google Scholar]
- 4.Reno PW, House M, Illingworth A. Flow cytometric and chromosome analysis of softshell clams, Mya arenaria, with disseminated neoplasia. J Invertebr Pathol. 1994;64:163–172. [Google Scholar]
- 5.Delaporte M, Synard S, Pariseau J, McKenna P, Tremblay R, Davidson J. Assessment of haemic neoplasia in different soft shell clam Mya arenaria populations from eastern Canada by flow cytometry. J Invertebr Pathol. 2008;98:190–197. doi: 10.1016/j.jip.2007.12.005. [DOI] [PubMed] [Google Scholar]
- 6.Rieder CLC, Maiato HH. Stuck in division or passing through: what happens when cells cannot satisfy the spindle assembly checkpoint. Dev Cell. 2004;7:637–651. doi: 10.1016/j.devcel.2004.09.002. [DOI] [PubMed] [Google Scholar]
- 7.Shi Q, King RW. Chromosome nondisjunction yields tetraploid rather than aneuploid cells in human cell lines. Nature. 2005;437:1038–1042. doi: 10.1038/nature03958. [DOI] [PubMed] [Google Scholar]
- 8.Romalde JL, Luz Vilariño M, Beaz R, Rodríguez JM, Díaz S, Villalba A. Evidence of retroviral etiology for disseminated neoplasia in cockles (Cerastoderma edule) J Invertebr Pathol. 2007;94:95–101. doi: 10.1016/j.jip.2006.09.007. [DOI] [PubMed] [Google Scholar]
- 9.AboElkhair M, Iwamoto T, Clark KF, McKenna P, Siah A, Greenwood SJ. Lack of detection of a putative retrovirus associated with haemic neoplasia in the soft shell clam Mya arenaria. J Invertebr Pathol. 2012;109:97–104. doi: 10.1016/j.jip.2011.10.008. [DOI] [PubMed] [Google Scholar]
- 10.Pariseau J, McKenna P, Aboelkhair M, Saint-Louis R, Pelletier E, Davidson TJ. Effects of pesticide compounds (chlorothalonil and mancozeb) and benzo[a]pyrene mixture on aryl hydrocarbon receptor, p53 and ubiquitin gene expression levels in haemocytes of soft-shell clams (Mya arenaria) Ecotoxicology. 2011;20:1765–1772. doi: 10.1007/s10646-011-0709-y. [DOI] [PubMed] [Google Scholar]
- 11.Muttray A, Reinisch C, Miller J, Ernst W, Gillis P, Losier M. Haemocytic leukemia in Prince Edward Island (PEI) soft shell clam (Mya arenaria): spatial distribution in agriculturally impacted estuaries. Sci Total Environ. 2012;424:130–142. doi: 10.1016/j.scitotenv.2012.02.029. [DOI] [PubMed] [Google Scholar]
- 12.Kelley ML, Winge P, Heaney JD, Stephens RE, Farell JH, Van Beneden RJ. Expression of homologues for p53 and p73 in the softshell clam (Mya arenaria), a naturally-occurring model for human cancer. Oncogene. 2001;20:748–758. doi: 10.1038/sj.onc.1204144. [DOI] [PubMed] [Google Scholar]
- 13.Walker CW, Böttger SA. A naturally occurring cancer with molecular connectivity to human diseases. Cell Cycle. 2008;7:2286–2289. doi: 10.4161/cc.6366. [DOI] [PubMed] [Google Scholar]
- 14.Böttger S, Jerszyk E, Low B, Walker C. Genotoxic stress-induced expression of p53 and apoptosis in leukemic clam hemocytes with cytoplasmically sequestered p53. Cancer Res. 2008;68:777–782. doi: 10.1158/0008-5472.CAN-06-0968. [DOI] [PubMed] [Google Scholar]
- 15.Holbrook LAC, Butler RA, Cashon RE, Van Beneden RJ. Soft-shell clam (Mya arenaria) p53: a structural and functional comparison to human p53. Gene. 2009;433:81–87. doi: 10.1016/j.gene.2008.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Siah A, McKenna P, Danger J-M, Johnson GR, Berthe FCJ. Induction of transposase and polyprotein RNA levels in disseminated neoplastic hemocytes of soft-shell clams: Mya arenaria. Dev Comp Immunol. 2011;35:151–154. doi: 10.1016/j.dci.2010.09.012. [DOI] [PubMed] [Google Scholar]
- 17.Siah A, McKenna P, Danger JM, Johnson G, Berthe FCJ. Expression of RAS-like family members, c-jun and c-myc mRNA levels in neoplastic hemocytes of soft-shell clams Mya arenaria using microsphere-based 8-plex branched DNA assay. Results Immunol. 2012;2:83–87. doi: 10.1016/j.rinim.2012.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Siah A, Delaporte M, Pariseau J, McKenna P, Berthe FCJ. Patterns of p53, p73 and mortalin gene expression associated with haemocyte polyploidy in the soft-shell clam, Mya arenaria. J Invertebr Pathol. 2008;98:148–152. doi: 10.1016/j.jip.2008.03.002. [DOI] [PubMed] [Google Scholar]
- 19.Zerbino DR, Birney E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Li Y, Hu Y, Bolund L, Wang J. State of the art de novo assembly of human genomes from massively parallel sequencing data. Hum Genomics. 2010;4:271–277. doi: 10.1186/1479-7364-4-4-271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gnerre S, MacCallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci USA. 2011;108:1513–1518. doi: 10.1073/pnas.1017351108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mi H, Thomas P. PANTHER pathway: an ontology-based pathway database coupled with data analysis tools. Methods Mol Biol. 2009;563:123–140. doi: 10.1007/978-1-60761-175-2_7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kelley ML, Van Beneden RJ. Identification of an E3 ubiquitin-protein ligase in the softshell clam (Mya arenaria) Mar Environ Res. 2000;50:289–293. doi: 10.1016/s0141-1136(00)00087-8. [DOI] [PubMed] [Google Scholar]
- 24.Walker CW, Van Beneden RJ, Muttray AF, Böttger SA, Kelley ML, Tucker AE. p53 Superfamily proteins in marine bivalve cancer and stress biology. Adv Mar Biol. 2011;59:1–36. doi: 10.1016/B978-0-12-385536-7.00001-7. [DOI] [PubMed] [Google Scholar]
- 25.Fujiwara T, Bandi M, Nitta M, Ivanova EV, Bronson RT, Pellman D. Cytokinesis failure generating tetraploids promotes tumorigenesis in p53-null cells. Nature. 2005;437:1043–1047. doi: 10.1038/nature04217. [DOI] [PubMed] [Google Scholar]
- 26.Aylon Y, Oren M. p53: Guardian of ploidy. Mol Oncol. 2011;5:315–323. doi: 10.1016/j.molonc.2011.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Blain SW. Switching cyclin D-Cdk4 kinase activity on and off. Cell Cycle. 2008;7:892–898. doi: 10.4161/cc.7.7.5637. [DOI] [PubMed] [Google Scholar]
- 28.Fukasawa K. Centrosome amplification, chromosome instability and cancer development. Cancer Lett. 2005;230:6–19. doi: 10.1016/j.canlet.2004.12.028. [DOI] [PubMed] [Google Scholar]
- 29.Coverley D, Laman H, Laskey RA. Distinct roles for cyclins E and A during DNA replication complex assembly and activation. Nat Cell Biol. 2002;4:523–528. doi: 10.1038/ncb813. [DOI] [PubMed] [Google Scholar]
- 30.Matsumoto Y, Maller JL. A centrosomal localization signal in cyclin E required for Cdk2-independent S phase entry. Science. 2004;306:885–888. doi: 10.1126/science.1103544. [DOI] [PubMed] [Google Scholar]
- 31.Ferguson RL, Maller JL. Cyclin E-dependent localization of MCM5 regulates centrosome duplication. J Cell Sci. 2008;121:3224–3232. doi: 10.1242/jcs.034702. [DOI] [PubMed] [Google Scholar]
- 32.Toone WM, Aerne BL, Morgan BA, Johnston LH. Getting started: regulating the initiation of DNA replication in yeast. Annu Rev Microbiol. 1997;51:125–149. doi: 10.1146/annurev.micro.51.1.125. [DOI] [PubMed] [Google Scholar]
- 33.Tanaka S, Tak Y-S, Araki H. The role of CDK in the initiation step of DNA replication in eukaryotes. Cell Div. 2007;2:16. doi: 10.1186/1747-1028-2-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bonte DD, Lindvall CC, Liu HH, Dykema KK, Furge KK, Weinreich MM. Cdc7-Dbf4 kinase overexpression in multiple cancers and tumor cell lines is correlated with p53 inactivation. Neoplasia. 2008;10:920–931. doi: 10.1593/neo.08216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hess GF, Drong RF, Weiland KL, Slightom JL, Sclafani RA, Hollingsworth RE. A human homolog of the yeast CDC7 gene is overexpressed in some tumors and transformed cell lines. Gene. 1998;211:133–140. doi: 10.1016/s0378-1119(98)00094-8. [DOI] [PubMed] [Google Scholar]
- 36.Montagnoli A, Moll J, Colotta F. Targeting Cell Division Cycle 7 Kinase:a new approach for cancer therapy. Clin Cancer Res. 2010;16:4503–4508. doi: 10.1158/1078-0432.CCR-10-0185. [DOI] [PubMed] [Google Scholar]
- 37.Petrakis TG, Vougas K, Gorgoulis VG. Cdc6:a multi-functional molecular switch with critical role in carcinogenesis. Transcription. 2012;3:124–129. doi: 10.4161/trns.20301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kim WY, Sharpless NE. The regulation of INK4/ARF in cancer and aging. Cell. 2006;127:265–275. doi: 10.1016/j.cell.2006.10.003. [DOI] [PubMed] [Google Scholar]
- 39.Karakaidos P, Taraviras S, Vassiliou LV, Zacharatos P, Kastrinakis NG, Kougiou D. Overexpression of the replication licensing regulators hCdt1 and hCdc6 characterizes a subset of non-small-cell lung carcinomas. Am J Pathol. 2010;165:1351–1365. doi: 10.1016/S0002-9440(10)63393-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Fornace AJ, Jackman J, Hollander MC, Hoffman Liebermann B, Liebermann DA. Genotoxic- stress- response genes and growth- arrest genes. Ann N Y Acad Sci. 1992;663:139–153. doi: 10.1111/j.1749-6632.1992.tb38657.x. [DOI] [PubMed] [Google Scholar]
- 41.Zhan Q. Gadd45a, a p53- and BRCA1-regulated stress protein, in cellular response to DNA damage. Mutat Res/Fundam Mol Mech Mutagenesis. 2005;569:133–143. doi: 10.1016/j.mrfmmm.2004.06.055. [DOI] [PubMed] [Google Scholar]
- 42.Egloff AM. Cyclin B1 and other cyclins as tumor antigens in immunosurveillance and immunotherapy of cancer. Cancer Res. 2006;66:6–9. doi: 10.1158/0008-5472.CAN-05-3389. [DOI] [PubMed] [Google Scholar]
- 43.Yu M, Zhan Q, Finn OJ. Immune recognition of cyclin B1 as a tumor antigen is a result of its overexpression in human tumors that is caused by non-functional p53. Mol Immunol. 2002;38:981–987. doi: 10.1016/s0161-5890(02)00026-3. [DOI] [PubMed] [Google Scholar]
- 44.Ochsenreither S, Majeti R, Schmitt T, Stirewalt D, Keilholz U, Loeb KR. Cyclin-A1 represents a new immunogenic targetable antigen expressed in acute myeloid leukemia stem cells with characteristics of a cancer-testis antigen. Blood. 2012;119:5492–5501. doi: 10.1182/blood-2011-07-365890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yin XY, Grove L, Datta NS, Katula K, Long MW, Prochownik EV. Inverse regulation of cyclin B1 by c-Myc and p53 and induction of tetraploidy by cyclin B1 overexpression. Cancer Res. 2001;61:6487–6493. [PubMed] [Google Scholar]
- 46.Acquaviva C. The anaphase-promoting complex/cyclosome: APC/C. J Cell Sci. 2006;119:2401–2404. doi: 10.1242/jcs.02937. [DOI] [PubMed] [Google Scholar]
- 47.Visintin R, Prinz S, Amon A. CDC20 and CDH1: a family of substrate-specific activators of APC-dependent proteolysis. Science. 1997;278:460–463. doi: 10.1126/science.278.5337.460. [DOI] [PubMed] [Google Scholar]
- 48.Taylor S, Peters J-M. Polo and aurora kinases—lessons derived from chemical biology. Curr Opin Cell Biol. 2008;20:77–84. doi: 10.1016/j.ceb.2007.11.008. [DOI] [PubMed] [Google Scholar]
- 49.Qiao X, Zhang L, Gamper AM, Fujita T, Wang Y. APC/C-Cdh1: from cell cycle to cellular differentiation and genomic integrity. Cell Cycle. 2010;9:3904–3912. doi: 10.4161/cc.9.19.13585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Waizenegger IC, Giménez-Abián JF, Wernic D, Peters J-M. Regulation of human separase by securin binding and autocleavage. Curr Biol. 2002;12:1368–1378. doi: 10.1016/s0960-9822(02)01073-4. [DOI] [PubMed] [Google Scholar]
- 51.Ciosk R, Zachariae W, Michaelis C, Shevchenko A, Mann M, Nasmyth K. An ESP1/PDS1 complex regulates loss of sister chromatid cohesion at the metaphase to anaphase transition in yeast. Cell. 1998;93:1067–1076. doi: 10.1016/s0092-8674(00)81211-8. [DOI] [PubMed] [Google Scholar]
- 52.May GS, McGoldrick CA, Holt CL, Denison SH. The bimB3 mutation of Aspergillus nidulans uncouples DNA replication from the completion of mitosis. J Biol Chem. 1992;267:15737–15743. [PubMed] [Google Scholar]
- 53.Zou H, McGarry TJ, Bernal T, Kirschner MW. Identification of a vertebrate sister-chromatid separation inhibitor involved in transformation and tumorigenesis. Science. 1999;285:418–422. doi: 10.1126/science.285.5426.418. [DOI] [PubMed] [Google Scholar]
