(
A) Schematic view of the GLWamide genomic locus, located in scaffolds 229 of the
N. vectensis genome (v.1.0;
http://genome.jgi.doe.gov/Nemve1/Nemve1.home.html). Gene structure based on gene model prediction is shown at the top, and the revised structure based on the comparison of the
N. vectensis genome and cDNA sequences is shown at the bottom. The number above the schematic indicate genomic coordinates within the scaffold. Grey boxes are untranslated regions; aqua-colored boxes are translated regions. Blue bars show predicted translation start sites; red bars show predicted translation termination sites; yellow bars show predicted GLWamide-encoding regions described below. The region boxed in purple is missing in the currently available genome of
N. vectensis (v.1.0). Note that the revised gene structure contains two exons. (
B) cDNA and deduced amino acid sequence of the GLWamide gene. Short peptides that are either repeated or similar, and are likely to be released by proteolytic cleavages are circled in yellow. It is assumed that single or multiple acidic and basic residues become cleaved as suggested in another sea anemone
Anthopleura elegantissima (
Darmer et al., 1991;
Leviev and Grimmelikhuijzen, 1995). A purple arrowhead indicates the intron position. (
C) Amino acid sequence alignment for GLWamides I-IX. We assume that upon release of the peptide, an N-terminal glutamine is spontaneously converted into a pyroglutamic acid residue (<Glu), and a C-terminal glycine becomes amidated by an α-amidating enzyme (cf. [
Darmer et al., 1991]). The numbers on the right of each peptide sequence indicate copy numbers of each peptide that a single precursor protein is predicted to generate.