Abstract
Cephalochordates, urochordates, and vertebrates evolved from a common ancestor over 520 million years ago. To improve our understanding of chordate evolution and the origin of vertebrates, we intensively searched for particular genes, gene families, and conserved noncoding elements in the sequenced genome of the cephalochordate Branchiostoma floridae, commonly called amphioxus or lancelets. Special attention was given to homeobox genes, opsin genes, genes involved in neural crest development, nuclear receptor genes, genes encoding components of the endocrine and immune systems, and conserved cis-regulatory enhancers. The amphioxus genome contains a basic set of chordate genes involved in development and cell signaling, including a fifteenth Hox gene. This set includes many genes that were co-opted in vertebrates for new roles in neural crest development and adaptive immunity. However, where amphioxus has a single gene, vertebrates often have two, three, or four paralogs derived from two whole-genome duplication events. In addition, several transcriptional enhancers are conserved between amphioxus and vertebrates—a very wide phylogenetic distance. In contrast, urochordate genomes have lost many genes, including a diversity of homeobox families and genes involved in steroid hormone function. The amphioxus genome also exhibits derived features, including duplications of opsins and genes proposed to function in innate immunity and endocrine systems. Our results indicate that the amphioxus genome is elemental to an understanding of the biology and evolution of nonchordate deuterostomes, invertebrate chordates, and vertebrates.
The phylum Chordata comprises cephalochordates (amphioxus or lancelets), urochordates (tunicates), and vertebrates. These three groups share several characteristic features, including a notochord, dorsal hollow neural tube, gill slits, and endostyle (Fig. 1), and are thought to have diverged from a common ancestor during or prior to the Cambrian explosion. A century ago there were two major views concerning the relationship between amphioxus and vertebrates. In Gaskell’s view (Gaskell et al. 1910), amphioxus was a highly degenerate vertebrate that had lost its skeleton and complex cranial region. Others contended that amphioxus diverged before the origin of vertebrates and could shed light on the prevertebrate condition. For example, the comparative anatomist E.S. Goodrich wrote, “Amphioxus is doubtless in some respects a very specialized animal . . . yet it preserves many primitive characters” (Gaskell et al. 1910). Gradually, this view became widely accepted, and has since been supported by modern studies. Recent molecular phylogenetic studies now position cephalochordates as the basal group within the phylum Chordata, with vertebrates and urochordates diverging later (Bourlat et al. 2006; Delsuc et al. 2006).
Figure 1.
Juvenile amphioxus (Branchiostoma floridae). This specimen has not yet developed gonads. Anterior to left. (C) Buccal cirri surrounding mouth; (G) pharyngeal gill slits; (A) atrial cavity; (D) digestive diverticulum. Scale bar, 0.5 mm.
The genomes of several invertebrates have now been sequenced, including those of protostomes, such as nematodes (C. elegans Sequencing Consortium 1998; Stein et al. 2003) and insects (Adams et al. 2000; Mongin et al. 2004; The Honeybee Genome Sequencing Consortium 2006), and deuterostomes, such as the urochordate Ciona intestinalis (Dehal et al. 2002) and the sea urchin Strongylocentrotus purpuratus (Sodergren et al. 2006). Many vertebrate genome sequences are also now available, including those of humans (International Human Genome Sequencing Consortium 2001; Venter et al. 2001), mice (Waterston et al. 2002), chimpanzee (The Chimpanzee Sequencing and Analysis Consortium 2005), and several species of fish (Aparicio et al. 2002; Jaillon et al. 2004). Against this background of genomic data, the sequencing of the 520 megabase genome of the cephalochordate Branchiostoma floridae provides crucial insights into the basal chordate condition (Putnam et al. 2008). The amphioxus genome exhibits considerable synteny with the human genome, but lacks the whole-genome duplications characteristic of vertebrates (Dehal and Boore 2005). Moreover, amphioxus, although very vertebrate-like, lacks several key vertebrate innovations including neural crest, bone, and an endoskeleton (Holland et al. 1996; Shimeld and Holland 2000; Meulemans and Bronner-Fraser 2004). Here, we report an analysis of the amphioxus genome that addresses the evolution of homeobox genes, the neural crest gene network, opsins, endocrine genes, immune systems, and transcriptional enhancers. We show that, in many respects, amphioxus reflects the primitive prevertebrate condition, yet it also exhibits uniquely specialized features that arose in the half a billion years after its divergence from the rest of the chordate lineage.
Results and Discussion
Homeobox genes
Hox genes encode a family of transcription factors that are typically arranged in chromosomal clusters and are involved in rostrocaudal patterning of most bilaterally symmetrical animals. The initial description of the Hox gene cluster of amphioxus, made over a decade ago, revitalized debate concerning the prevertebrate nature of cephalochordates (Garcia-Fernàndez and Holland 1994). In contrast to the multiple Hox gene clusters of all vertebrate species examined, amphioxus possesses a single cluster. Initial studies revealed an organization remarkably like the inferred preduplication condition that must have existed in the long-extinct ancestor of all vertebrates. The amphioxus Hox gene cluster is the most prototypical deuterostome Hox cluster known, being intact, ordered, and containing the 14 Hox genes described to date (Garcia-Fernàndez and Holland 1994; Ferrier et al. 2000). Sequencing of the amphioxus genome allowed us to examine the region between the 14th Hox gene, AmphiHox14, and the neighboring pair of Evx homeobox genes. This revealed a 15th gene, AmphiHox15, which does not group strongly with any particular paralogy group in phylogenetic analysis (Fig. 2A; Supplemental Figs. S1, S2). Authenticity of this gene was confirmed by PCR amplification from genomic DNA of B. floridae with gene-specific primers. This DNA fragment was then used to isolate larger genomic clones from a cosmid library (MPMGc117J0736 and MPMGc117I2469). End sequences from these cosmids are in the expected places in the genome assembly. In addition, two-color chromosomal Fluorescence In Situ Hybridization (FISH) confirmed the tight linkage between AmphiHox15 and the rest of the Hox gene cluster (Fig. 2B–E). Thus, the complete amphioxus Hox gene cluster comprises 15 genes spanning 470 kb.
Figure 2.
The amphioxus genome contains 15 Hox genes. (A) Genomic organization of the 470-kb amphioxus Hox gene cluster and linked Evx gene pair; beneath the horizontal line are shown the two Hox15 cosmid clones analyzed. Black vertical lines represent small sequence gaps; asterisks indicate genes with an intron within the homeobox; arrows denote transcriptional orientation. FISH to amphioxus interphase nucleus (B) and metaphase chromosomes (C–E). (Red signal) Cosmid MPMGc117I2469 (AmphiHox15); (green signal) cosmid MPMGc117L0856 (AmphiHox12 and AmphiHox13). Methods as described in Castro and Holland (2002).
Following discovery of the 14th Hox gene in amphioxus, Hox14 genes were also described in coelacanth and horn shark (Powers and Amemiya 2004). Hox14, however, has been secondarily lost from all tetrapod and teleost fish species examined to date. It now becomes pertinent to ask whether any extant vertebrates, or any deuterostomes other than amphioxus, possess a Hox15 gene, and whether the ancestral condition for deuterostomes was a Hox cluster with 15 genes adjacent to an Evx family gene.
There are many other homeobox genes beside Hox genes, and most have roles in development and cell differentiation (Bürglin 2005). Determining their patterns of gene duplication and divergence can contribute to understanding the evolution of numerous developmental processes. We identified a total of 133 homeobox genes in the draft genome sequence of B. floridae (Supplemental Table S1). This figure represents a minimal estimate, because genome sequence coverage is not 100% complete and the genome possesses a high incidence of polymorphism, leading to potential confusion between loci and alleles. In our calculation of gene number, we include Pax2/5/8, which has a partial homeobox sequence, but exclude Pax1/9 and Pon, which lack homeobox sequences, but are likely derived from homeobox genes. The same rule was followed in a recent survey of the human genome, which revealed 235 human homeobox genes (and over 65 pseudogenes) (Holland et al. 2007).
Comparison to human and tunicate genomes suggests that the last common ancestor of the chordates possessed 103 homeobox genes (Supplemental Table S1). At the level of the gene family, 87 conserved homeobox gene families are shared between amphioxus and human; these have undergone very different patterns of gene duplication in the two lineages. Of these 87 families, 68 are represented by duplicate versions in the human genome, compatible with expansion by genome duplication, followed by gene loss. In contrast, just 11 are uniquely duplicated in amphioxus. These, which are usually represented as tandem duplicates, include Mnx, Hmbox, Uncx, Pou3, Lhx2/9, Hox9-15, and Iro, in addition to the previously recognized duplications of Evx, Emx, Nkx1, and Vent genes (Ferrier et al. 2001; Minguillón et al. 2002; Luke et al. 2003). There is also a duplication of the amphioxus Nedx gene, which is lost in vertebrates (this name was proposed for CG13424 to reflect its genomic position: Next to Dll homeobox). It is striking that amphioxus retains many ancient bilaterian homeobox genes that were secondarily lost either on the Olfactores (tunicates + vertebrates) lineage, the vertebrate lineage, or the tunicate lineage (Fig. 3); we found no examples of homeobox gene loss in the cephalochordate lineage.
Figure 3.
Homeobox gene loss in chordates. Homeobox gene loss has been extensive during the evolution of the Olfactores (vertebrates plus tunicates), yet in amphioxus, no homeobox gene loss has occurred since the last common ancestor of the chordates (LCAC). (Blue) ANTP class; (red) PRD class; (green) other classes. Gene losses along the tunicate branch occurred before the last common ancestor of Ciona and Oikopleura. Photo of Goodrich reproduced by permission of Department of Zoology, University of Oxford.
Expansion of opsin genes in amphioxus
Opsins are seven-pass transmembrane proteins widespread across animal phyla, typically mediating the first step in phototransduction cascades. Four opsin types are thought to have been present at the base of bilaterians: rhabdomeric (r-opsins), ciliary (c-opsin), Go-opsins, and peropsin/RGR (Arendt et al. 2004; Terakita 2005, Raible et al. 2006). We have identified 20 different opsin genes in amphioxus representing all four groups (Supplemental Fig. S3; Supplemental Table S2). The clearest r-opsin is a single melanopsin gene orthologous to Branchiostoma belcheri AmphiMop, which is expressed in rhabdomeric photoreceptor cells (Koyanagi et al. 2005). However, six of the opsin genes, which are related to AmphiOp6 and have weak similarity to r-opsins, form a divergent family duplicated specifically in amphioxus. Since these genes possess only the central proline of the r-opsin HPK fingerprint, it is unclear whether they couple to downstream phototransduction cascades. Five genes, all related to the encephalopsin/TMT-opsin subfamily, represent the c-opsin family; their genomic organization suggests expansion by tandem duplication in the cephalochordate lineage (Supplemental Fig. S4). Surprisingly we have not identified homologs of the vertebrate visual/nonvisual opsin subfamily (including rhodopsin), which are present in both vertebrates (Yokoyama 1996) and Ciona (Kusakabe et al. 2001). This gene may have been lost from amphioxus or may have arisen on the Olfactores lineage. Four opsin genes from the Go-opsin family were identified, two in tandem orientation in the genome. There is one amphioxus peropsin/RGR gene, plus three putative neuropsins related to vertebrate Opn5 that push back the date of origin of this group (Tarttelin et al. 2003). The expansion of three opsin subfamilies in the cephalochordate lineage (AmphiOp6, encephalopsin/TMT-opsins, and Go-opsins) may relate to the diversity of photoreceptor organs in amphioxus, including the frontal eye and the lamellar body, both of which are ciliary photoreceptors, and the organs of Hesse and Joseph cells, which are both rhabdomeric photoreceptors (Lacalli 2004).
Genes of the vertebrate neural crest gene network
Neural crest cells are a migratory cell population that forms at the neural plate border, migrates extensively, and differentiates into numerous differentiated cell types, including connective tissue of the craniofacial skeleton, much of the peripheral nervous system, and pigment cells (Meulemans and Bronner-Fraser 2004). These cells are responsible for many complex structures in the vertebrate head that are absent in amphioxus and tunicates. When we examined the amphioxus genome sequence to determine whether any key components of the vertebrate neural crest regulatory network are missing from amphioxus (Supplemental Table S3), we found that amphioxus has all the major transcription factors deployed in the vertebrate neural crest regulatory network. These include the epidermal ectoderm markers Dlx and AP2, the neural ectoderm markers SoxB, Zic, and islet, genes encoding neural plate border specifiers (Pax3/7 and Msx), and neural crest specifiers (Snail, SoxE, AP2, Twist, Id, FoxD, Myc), as well as downstream mediators of neural crest migration and differentiation, notably, Rho, cRet, Erbb3, Mitf, tyrosinase, and tyrosinase-related genes. Hence, the evolution of neural crest in vertebrates cannot be explained by the gain of a new regulatory gene, but must relate to the acquisition of new functions by pre-existing genes.
In contrast, two genes involved at later stages of neural development seem to be absent from amphioxus; these are homologs of c-Kit, a receptor tyrosine kinase essential for migration, survival, and differentiation of neural crest-derived melanocytes and of the gene for myelin protein P0, consistent with the notion that the glial myelin sheath is a vertebrate innovation. Interestingly, many of these neural crest specifier genes have multiple paralogs in vertebrates, presumably due to large-scale genome duplication events (Putnam et al. 2008), whereas amphioxus usually contains a single copy. These observations raise the possibility that neofunctionalization of some paralogs may have facilitated co-option of genes into a neural crest gene regulatory network.
We propose that migrating neural crest cells emerged after the split of amphioxus from tunicates and vertebrates. Therefore, it will be important to further examine neural crest-like cells in tunicates (Jeffery et al. 2004) and determine to what extent the genetic mechanisms for their specification, migration, and differentiation are evolutionarily conserved. Such studies in amphioxus and tunicates should help determine what constitutes the primitive genetic subnetwork for neural crest formation during chordate evolution.
Roots of the vertebrate endocrine system
The primary vertebrate endocrine organs (pineal gland, ovary, testis, endostyle/thyroid, brain, gut, and pituitary) have structurally simpler homologs in amphioxus. To determine which components of the endocrine system were present in the chordate ancestor, we searched the amphioxus genome for homologs of genes involved in the synthesis, secretion, and reception of endocrine signals. The relationships of these amphioxus genes to related genes in other organisms were determined by phylogenetic analyses. Support values for orthology relationships are listed in Supplemental Table S4.
Pituitary control of reproduction—A vertebrate innovation
Analysis of the amphioxus genome indicates considerable potential for neuroendocrine control of reproduction (Fig. 4A). For example, there are more than 10 homologs of the kisspeptin receptor (GPR54, also known as KISS1R), which in vertebrates is expressed in the hypothalamus and regulates release of gonadotropin-releasing hormone (GnRH). Both amphioxus and Ciona (Tello et al. 2005) have three GnRH receptors (Supplemental Table S4), which are functionally active in vitro. The presence of these receptors suggests that amphioxus also possesses the kisspeptin and GnRH peptides, even though we could not find them in the amphioxus genome; such short sequences are not readily identified by sequence analysis. Amphioxus also has a gene for thyrostimulin, which may be ancestral to three of the pituitary hormones (follicle stimulating hormone, FSH, luteinizing hormone, LH, and thyroid stimulating hormone, TSH) (Sudo et al. 2005). However, amphioxus does not appear to have homologs of several vertebrate pituitary hormones [i.e., growth hormone (GH), adrenocorticotropic hormone (ACTH), and prolactin]. Not only were they not detected in the amphioxus genome, but neither were their receptors, suggesting that the reported presence of prolactin immunoreactivity in Hatschek’s pit (amphioxus pituitary homolog) should be reexamined (Weng et al. 2006). Since genes for ACTH, GH, and prolactin are also missing in the Ciona and sea urchin genomes, these hormones may be vertebrate novelties.
Figure 4.
Predicted amphioxus endocrine pathways in which a red box identifies homologs found in the amphioxus genome. (A) Reproductive axis in which missing components include classical pituitary hormones (LH, luteinizing hormone; FSH, follicle stimulating hormone; Prl, prolactin) and their receptors. GPR54: glycoprotein receptor 54 (kisspeptin receptor); GnRH: gonadotropin-releasing hormone; E: estrogen; ER: estrogen receptor; SR: steroid receptor. (B) Thyroid metabolism showing that the thyroglobulin gene could not be identified in the amphioxus genome (hence, crossed out). CTHBP: cytosolic thyroid hormone binding protein; SIS: sodium iodide symporter; Tg: thyroglobulin; T2: diiodothyronine; T3: triiodothyronine; T4: thyroxine; TR: thyroid hormone receptor; RXR: retinoid X receptor; TRE: thyroid hormone response element. (C) Retinoid metabolism in which CRABP and AKR genes could not be identified in the amphioxus genome (hence, crossed out). ADH: alcohol dehydrogenase; AKR: aldo-keto reductase; ARAT: acylCoA:retinol acyltransferase; BCOX: beta,beta-carotene-15,15'-oxygenase; CRABP: cellular retinoic acid binding protein; CRBP: cellular retinol binding protein; CYP26: cytochrome P450–26; LRAT: lecithin:retinol acyltransferase; RALDH: retinaldehyde dehydrogenase; RDH: retinol dehydrogenase; REH: retinyl ester hydrolase; RPE65: retinal pigment epithelium-specific protein 65 kDa; RAR: retinoic acid receptor; RXR: retinoid X receptor; RARE: retinoic acid response element.
Gonad-expressed sex steroids comprise the final step in neuroendocrine control of vertebrate reproduction. Amphioxus has two genes for steroid receptors, one more closely related to vertebrate estrogen receptors (ER) and one closer to steroid receptors (SR), plus a complete set of genes for sex steroid-synthesizing enzymes: StAR, CYP11, CYP17, CYP19, 17βHSD, 3βHSD, and 5α-reductase (Supplemental Table S4). In contrast, Ciona lacks genes for steroid hormone receptors and has only a few for enzymes that synthesize steroid hormones (5α-reductase, 3βHSD, 17βHSD), suggesting tunicate-specific losses (Campbell et al. 2004). Thus, except for the pituitary hormones, amphioxus has a relatively complete endocrine pathway for control of reproduction. It may be that neuroendocrine GnRH activates the gonads without pituitary amplification. Indirect support for this hypothesis is that in vertebrates, GnRH, when synthesized locally, can act directly on the gonads (Leung et al. 2003).
Components of the stress response
The stress response in vertebrates depends on the adrenal medulla for epinephrine and on the adrenal cortex for cortisol. Adrenal cortex function in vertebrates is controlled by the brain (corticotropin-releasing hormone, CRH) and pituitary ACTH. Although amphioxus lacks a discrete adrenal gland, it possesses genes for catecholamine synthesis and for adrenergic receptors, suggesting that some cells may synthesize epinephrine. However, most of the stress-response genes (CRH, ACTH, ACTH receptor, cortisol and its receptor) are missing in amphioxus and have not been found in either Ciona or sea urchins. In addition, amphioxus also lacks a gene for the enzyme 21-hydroxylase (CYP21), necessary to synthesize cortisol precursors (Supplemental Fig. S5). The only component present in all three invertebrates is similar to the CRH receptor. However, since this receptor is also structurally related to the diuretic hormone receptor in Drosophila (Hauser et al. 2006), functional studies will be required to determine whether or not it functions in the stress pathway in amphioxus. In summary, the cortisol pathway most likely did not exist in the ancestor of chordates, but an epinephrine stress response did.
An ancient route for thyroid hormone production in amphioxus
The vertebrate thyroid gland sequesters iodine and secretes T3 and T4 hormones, which regulate metabolism, post-embryonic development, and metamorphosis in vertebrates. The amphioxus thyroid homolog, the endostyle, is primarily involved in mucous production for feeding, but it does incorporate iodine (Fredriksson et al. 1984), suggesting that it also may synthesize thyroid hormones. This suggestion has gained support from chromatography experiments with radiolabeled iodine (Covelli et al. 1960; Tong et al. 1962) from immunocytochemistry analyses with antibodies to vertebrate hormones (Fang and Weng 2002), and from the induction of amphibian metamorphosis by implantation of dried amphioxus endostyles (Sembrat 1953). The amphioxus genome sequence further strengthens this claim, since we identified a thyroid hormone receptor (TR) gene and several genes for components of the thyroid hormone production system: a sodium iodide symporter (SIS), a thyroid peroxidase (TPO), deiodinases that convert T4 to active T3, and a binding protein (CTHBP) for cytosolic transport of TH (Fig. 4B). In contrast, genes for thyroglobulin and a thyroid-releasing hormone receptor (TRH) are missing. In vertebrates, hormones secreted by the brain and pituitary control production of T3 and T4. Although a gene for the pituitary thyroid-stimulating hormone is not found in the amphioxus genome, as noted above, there are genes for thyrostimulin (GPA2/GPB5) and its receptor, suggesting that a pathway for metabolic control was present in the ancestral chordate.
Insulin family members
The insulin superfamily includes insulin, insulin-like growth factors (IGFs), and relaxins. In vertebrates, insulin is produced by the pancreas, a gut derivative, and is essential for regulating glucose metabolism. IGF is critical for growth, and relaxins are associated with parturition among other roles. Amphioxus lacks a morphologically defined pancreas, but particular cells in the gut express an insulin-like peptide (Chan et al. 1990; Holland et al. 1997). In addition to this gene, we identified five other insulin/relaxin-related genes and their putative receptors in the amphioxus genome. Three of the insulin/relaxin-related genes are linked in the genome. Phylogenetic analysis of neighboring genes revealed synteny of four of the six insulin-like peptides with human paralogons hosting the insulin/IGF and relaxin-type genes (Supplemental Fig. S6; Supplemental Table S5). The organization of the corresponding genomic region in Ciona is similar (Olinski et al. 2006), indicating that insulin and relaxin-type precursor genes were positioned in one chromosomal segment in the ancestral chordate. Genes for other protein hormones present in amphioxus are shown in Supplemental Table S4.
Nuclear receptors
Nuclear receptors (NRs) bind a wide range of hormones and transduce their signals intracellularly. Amphioxus possesses genes for a total of 33 nuclear receptors. Eighteen have clear orthologs among the 48 human NRs (Supplemental Table S6). These genes include two related to the estrogen and steroid receptors (ER, SR) and one each for thyroid hormone receptor (TR) and retinoic acid receptor (RAR). Moreover, the ligand-binding and the DNA-binding domains (CEGCK) of amphioxus SR and ER are more similar to the ER type than to the classical steroid type. A comparison of amphioxus, sea urchin, and Ciona nuclear receptors suggests a loss of several receptors in tunicates, including genes related to the ER and SR receptors and to the NR2E genes (TLL, PNR, and a novel duplicated amphioxus gene). TLL and PNR are involved in the development of the vertebrate brain and retina, and all three NRs may be important in amphioxus with its frontal eye, pineal eye, and multiple photoreceptors.
The nuclear receptors most studied in amphioxus are RAR and RXR, which mediate the roles of retinoic acid during amphioxus development through an RAR:RXR heterodimer (Escriva et al. 2002; Yagi et al. 2003; Schubert et al. 2006). Not surprisingly, amphioxus also possesses genes for retinoic acid synthesis, storage, and degradation (Fig. 4C). These include an ortholog of cellular retinol binding protein (CRBP), but not of its retinoic acid equivalent (CRABP). Although a gene named CRABP has been described in amphioxus (Jackman et al. 2004), it does not group robustly with vertebrate CRABP. Thus, amphioxus lacks some genes involved in retinoid metabolism in vertebrates (such as CRABP), but others have duplicated independently (e.g., ALDH1s/RALDHs and REHs) (Supplemental Table S6).
There are a few more examples for independent expansions of amphioxus endocrine genes. For example, genes in the LXR/FXR group, which control lipid/cholesterol metabolism in vertebrates have been duplicated in amphioxus as well as the genes for the 3βHSD enzymes, which are involved in vertebrate steroidogenesis, and for several SDR enzymes, which are important for processing and detoxification of a wide variety of biological compounds including steroids, alcohols, sugars, aromatic chemicals, and xenobiotics (Supplemental Tables S4, S6). These lineage-specific duplications in amphioxus of transcription factors (NRs of the LXR/FXR group) and enzymes (SDR, 3βHSD, ALDH1s/RALDHs, and REHs) involved in both metabolism and detoxification of various molecules represent an important example for a unique feature that has evolved specifically in the cephalochordate lineage. There are two (not mutually exclusive) evolutionary scenarios that might explain these amphioxus-specific duplications: (1) the signaling networks controlled by the molecules produced directly or indirectly by these duplicated genes have been extensively elaborated in the amphioxus lineage (for example steroid hormone synthesis) and/or (2) novel intracellular protection mechanisms against various kinds of toxins and xenobiotics derived from marine microorganisms or the surrounding environment might have evolved specifically in the amphioxus lineage, possibly as a result of the sedentary, filter feeding lifestyle of amphioxus adults.
Adaptive and innate immunity
Defense against pathogens is a basic requisite of all animals. The immune system, which performs this function, can be broadly divided into two separate, but closely integrated components: germ-line-encoded innate immunity and somatically modified adaptive immunity. Innate immunity in vertebrates relies on a relatively small number of pattern recognition receptors, notably the extracellular TOLL-like receptors (TLRs) and intracellular nucleotide-binding oligomerization domain-like receptors (NLRs) (Boraschi and Tagliabue 2006; Werts et al. 2006; Delbridge and O’Riordan 2007). TLRs consist of a TIR domain combined with leucine-rich repeats (LRRs); NLRs consist of a NACHT domain and LRRs (Martinon and Tschopp 2004; Inohara et al. 2005) (Fig. 5). A search of the amphioxus genome for homologs of innate immunity genes reveals the expansion of receptor gene families in an intriguing parallel to those discovered in the sea urchin genome (Rast et al. 2006) (Fig. 5; Supplemental Table S7). In the amphioxus genome, ab initio gene prediction yields 28 TLR gene models, compared with 10–15 in vertebrates, nine in Drosophila, and over 200 in sea urchin. The amphioxus number increases to 72 if additional ab initio models are considered. Variation in the number, length, and sequence of the LRR domains suggests that the repertoire of receptor shapes in amphioxus far exceeds that reported in other systems. Similar expansions are seen in several families of genes encoding proteins involved in apoptosis, a regulatory strategy in invertebrate host defense as well as in both innate and adaptive immunity in vertebrates.
Figure 5.
Expansion of innate immunity genes in amphioxus (right) compared with the human system (left). Novel domain architectures and (predicted) varied receptor conformations found in amphioxus proteins are expected to increase the specificity of innate immunity in amphioxus. We propose that some TLR-like proteins are localized in the endosome, as also seen in mammals. Domains: TIR (Toll/interleukin-1 receptor) domain, NACHT (NAIP, CIITA, HET-E and TP1), LRR (leucine-rich repeat), CARD (caspase recruitment domain), PYRIN (N-terminal domain of protein pyrin), DD (death domain), DED (death effector domain), SAM (sterile alpha motif) domain, BIR (Baculovirus Inhibitor of apoptosis Repeat).
In addition to TLRs and NLRs, other innate immunity genes are present in the amphioxus genome (Supplemental Tables S7–S9). The complement system of vertebrates is a highly sophisticated defense system that functions to opsonize pathogens through the covalent binding of the central component C3, through a unique intramolecular thioester bond (Law and Dodds 1997; Janssen et al. 2005). Most complement-system gene groups are highly conserved between amphioxus, Ciona, and vertebrates (Nonaka and Kimura 2006). In contrast, the sea urchin genome lacks the MASP/C1r/C1s and TCC (terminal complement component) genes. One unexpected novelty in the amphioxus genome is the presence of more than 39 C1q-like genes, of which 25 are encoded on a single scaffold, indicating an expanded complement-recognition repertoire (Supplemental Table S9).
Adaptive immunity has traditionally been viewed as a jawed vertebrate adaptation in which somatic reorganization of immunoglobulin (Ig) genes in B-lymphocytes or T-cell antigen receptor (TCR) genes in T-lymphocytes, results in the expression of cell surface receptors with unique molecular specificities. In lampreys and hagfish, genetic recombination of sequence elements encoding an LRR-containing receptor, termed a VLR (variable lymphocyte receptor), gives rise to an alternative form of adaptive immunity (Pancer and Cooper 2006). Beside the TLRs and NLRs mentioned above, the amphioxus genome contains at least 406 gene models predicted to encode proteins with multiple LRR modules that may play a role in host defense (Supplemental Table S10). Some of these encode cell surface proteins, with a single LRR domain capped by N- and C-terminal LRRs, a feature of VLRs. Some LRR-containing gene products also encode immunoglobulin (Ig) domains. In jawed vertebrates, the rearrangement mechanism is dependent on the Recombination Activating Genes (RAG) 1 and 2, which are tightly linked in vertebrates. A homologous RAG1/2-like closely linked gene pair is present in the purple sea urchin genome; however, its function is unknown (Fugmann et al. 2006). Only a RAG1-like segment is identified in the amphioxus genome; the adjacent genomic sequences are unresolved, precluding accurate annotation. Many other gene products, mostly recruited from ancient DNA damage repair pathways, also function in the diversification of immune receptors in jawed vertebrates including XRCC4, DNA ligase IV, UNG, RAD18, Artemis, Cernunnos, Pol-η, Pol-ι, Pol-θ, REV1, and KU70-80 (Jung and Alt 2004; Litman et al. 2005). Homologs of these genes are present in both amphioxus and sea urchin; however, neither immunoglobulin nor TCR genes can be identified in either of these species, nor can homologs of MHC I and MHC II, the functional partners of TCR α/β (Supplemental Table S11). Terminal deoxynucleotide transferase (TdT), which catalyzes nontemplated addition of nucleotides at double-strand DNA breaks and accounts for a high degree of somatic variation in Igs and TCRs (Jung and Alt 2004), is a vertebrate homolog of DNA-polμ; our analyses reveal that amphioxus diverged before this duplication. Alternative processing likely gives rise to functionally distinct forms (data not shown). The amphioxus genome assembly does not support a model for full-length APOBEC (I-III) enzyme(s), which effects antiviral defenses in vertebrates or activation-induced cytidine deaminase (AICDA), which is integral to somatic hypermutation, Ig class switching, and gene conversion (Fugmann and Schatz 2002). However, only a short sequence exhibiting identity to APOBEC3 and two regions exhibiting weak sequence identity to the N- and C-terminal regions of human AICDA were identified. Although homologs of Ig and TCR have only been found in jawed vertebrates, highly polymorphic genes termed VCBP (variable region-containing chitin binding protein) have been identified in amphioxus and Ciona, but not in sea urchin (Cannon et al. 2002). Structures similar to the V (variable) regions of VCBPs are attractive candidates for ancestral forms of rearranging Ig-type immune receptors. Collectively, these data suggest that the ancestor of all chordates had a complex diversified innate immune system. Key effector and intermediate molecules, with roles in germ-line change and somatic variation associated with adaptive immunity were also present at the base of the chordates and were co-opted for adaptive immunity in the jawed vertebrate lineage and VLR diversification in jawless vertebrates (Litman et al. 2005; Nagawa et al. 2007; Rogozin et al. 2007).
Conserved noncoding sequences
Although genome-scale comparisons between mammals and teleost fish have revealed up to 3100 conserved noncoding sequences, most of which function as tissue-specific enhancers (Woolfe et al. 2005; Pennacchio et al. 2006), at greater phylogenetic distances, no conservation outside of coding sequences and microRNAs (Prochnik et al. 2007) has previously been identified. However, our comparison of the amphioxus and human genomes revealed a number of conserved noncoding elements (CNEs) near coding sequences, some of which directed tissue-specific expression in both amphioxus and the mouse. Comparisons of the amphioxus and human genomes yielded 27,880 aligned regions with more than 60% identity over 50 bp. However, most of these (99.7%) either overlapped transcribed or repetitive sequences in the human and/or amphioxus genomes or were not conserved with other vertebrate species and were, therefore, excluded from further analysis. We designated the remaining 77 alignments as putative chordate CNEs. Of these, 16 overlap with or are immediately adjacent to the 3′ or 5′ untranslated regions (UTRs) of human genes, and are, therefore, likely conserved UTRs. Four CNEs are adjacent to exons, and likely represent conserved splice enhancers. A single CNE overlapped a highly conserved microRNA (MIRN10B, adjacent to the human HOXD4 gene). The remaining 56 were of unattributable function.
We tested the human orthologs of eight of these 56 CNE's for activity in a mouse enhancer assay (Pennacchio et al. 2006). Four reproducibly drove expression of a lacZ reporter gene in mice. One element, adjacent to the T-domain-containing transcriptional repressor TBX3 in human, drove expression in the limb and tail (4/9 embryos), which are two of the domains that normally express the gene (Chapman et al. 1996; Gibson-Brown et al. 1996). A second element, adjacent to the dominant-negative helix-loop-helix inhibitor of DNA-binding transcription factor ID1, drove expression in the branchial arches, eyes, spine, midbrain, forebrain, and ear (7/8 embryos) (Supplemental Tables S12, S14, S15). The remaining two CNEs with enhancer activity are adjacent to the paralogous zinc finger transcription factors ZNF503 and ZNF703 in human (Supplemental Fig. 8). Orthologs of these two genes occur in all sequenced vertebrate genomes, and correspond to a single locus in the amphioxus genome, amphioxus ZNF503/703. The single amphioxus CNE and the two corresponding human sequences were tested for enhancer activity in both mouse and amphioxus (Fig. 6; Supplemental Tables S12–S16). Although expression of the endogenous ZNF703 gene in the mouse has not been determined, the endogenous ZNF503 (Zfp503, Nolz-1) is very widely expressed in late mouse embryos (Chang et al. 2004), while the native amphioxus ZNF503/703 gene is expressed in the developing central nervous system, somites, notochord, and pharyngeal endoderm (Fig. 6). LacZ reporter constructs of the conserved CNEs from all three genes showed tissue-specific expression in both amphioxus and the mouse (Fig. 6). Taken together, the computational and experimental results are consistent with shared ancestry of putative enhancer function, and suggest subfunctionalization of these noncoding elements following duplication in the vertebrate lineage. This represents the largest evolutionary distance (∼520 million years) identified to date, across which both the sequence and function of tissue-specific, cis-regulatory enhancers have been conserved.
Figure 6.
Amphioxus-human conserved noncoding sequences function as enhancers. (A–G) LacZ reporter gene expression driven by the conserved noncoding element (CNE) near amphioxus ZNF503/703 in mouse (A–C) and amphioxus (D–G). (H–L) Expression driven by the orthologous CNE near human ZNF503 in mouse (H–J) and amphioxus (K,L). (M) Expression of the endogenous amphioxus ZNF503/703 gene in the anterior central nervous system (n), somites (s), pharynx (p), and weakly in the notochord. (N–T) Expression of the orthologous CNE near ZNF703 in mouse (N–P) and amphioxus embryos (Q–T). B, I, and O show dorsal views of the mouse hindbrain; all other views are lateral, with anterior to the left. Mouse embryos shown at 9.5 d post conception. The mouse ZNF503 construct directs expression to the forebrain, midbrain, hindbrain, eye, branchial arches. Blue arrowheads show expression in the eye directed by the two human elements, but not the amphioxus element. The red arrowhead shows expression in a lateral domain that is directed by the amphioxus element, but not the human elements. Purple arrowheads indicate sites of expression observed for all three CNEs. The amphioxus ZNF503/703 CNE construct expresses at a high level in the amphioxus notochord (n) and somites(s) and at a lower level in the ectoderm (e) and central nervous system (nc). The embryo in D has an abnormal head, presumably due to injection trauma. Ectopic expression in necrotic cells in the gut lumen as in G, L, and S is common in amphioxus embryos injected with reporter constructs. Twenty-four-hour embryos (D,E,Q–T), 30 h larvae (K–M), 36 h larvae (F,G).
Reconstructing the deuterostome ancestor
In the half-billion years since the amphioxus and vertebrate lineages split, the amphioxus genome has clearly undergone much less change than the genomes of other chordates. The ancestor of the Olfactores (tunicates + vertebrates) lost several regulatory genes, after which the vertebrate lineage underwent two whole-genome duplication events, while the tunicates experienced many additional gene losses. Moreover, while cis-regulatory enhancers are apparently not conserved between the Ciona and vertebrate genomes, amphioxus and humans share several. Thus, although there have been some independent gene duplications in the cephalochordate lineage (e.g., within opsin genes and innate immunity genes), the amphioxus genome appears to be closer than those of any other extant organism to the genome of the ancestral chordate.
Although amphioxus has no neural crest and, therefore, no homologs of vertebrate organs with major neural crest components, such as the adrenal gland, the last common ancestor of the chordates clearly possessed all of the known genes required for generating neural crest. What happened at the base of the vertebrates was evidently the co-option of pre-existing genes into the network specifying the cells at the edges of the neural plate and the evolution of new genes specific for some neural crest derivatives such as melanocytes. Similarly, amphioxus possesses homologs of vertebrate endocrine organs such as the thyroid and pituitary that are primarily epithelial derivatives (but not those with major contributions from neural crest), and these organs express the same upstream transcription factors as their vertebrate counterparts. The genes that are clearly vertebrate novelties include several coding for the downstream products of endocrine organs. A key vertebrate innovation is the adaptive immune system. In its absence, invertebrates, including amphioxus, rely on innate immunity for defense against pathogens. It is interesting that some gene families involved in innate immunity (e.g., Toll receptors) are greatly expanded in the amphioxus genome. Together with data from the fossil record (Chen et al. 1995) and embryology (Yu et al. 2007), our data indicate that the ancestral chordate was similar in many respects to a modern amphioxus: a small, free-living animal, with segmental muscles derived from somites, a notochord, a dorsal nerve cord, a robust innate immune system, and a basic set of endocrine functions.
Looking deeper in time, we compared the genomes of amphioxus B. floridae, the tunicate C. intestinalis, and the echinoderm S. purpuratus. We found that the 814-mb sea urchin genome and the 520-mb amphioxus genome are more representative of that of the ancestral deuterostome than is the 155-mb genome of C. intestinalis, which has undergone considerable secondary gene loss. Echinoderms and hemichordates (the free-living enteropneusts plus the sessile pterobranchs) are sister groups, and together constitute the Ambulacraria, which, with the Xenoturbellida, constitute the nonchordate deuterostome lineage (Bourlat et al. 2006). Hemichordate genomes have not yet been sequenced, and it is uncertain whether enteropneusts or pterobranchs are basal within the hemichordates (Cameron et al. 2000). Moreover, S. purpuratus is derived within the echinoderms. Therefore, for gene families in which the amphioxus genome differs considerably from those of protostomes (e.g., TLF genes), it is difficult to infer the ancestral deuterostome condition. Nevertheless, by integrating the findings of molecular phylogenetics, genomic data from chordates and an echinoderm, and analyses of axial patterning systems in hemichordates, amphioxus, and vertebrates, we can evaluate alternative theories concerning deuterostome evolution. These data imply that the evolutionary scenarios of Garstang, Romer, and Berrill (for review, see Lacalli 2004), prevalent for much of the 20th century, are highly unlikely. The deuterostome ancestor was almost certainly not a sessile lophophorate-like animal that gave rise to an ancestral ascidian. In turn, this larval ascidian did not give rise to the ancestor of amphioxus and the vertebrates by paedomorphosis. It is much more likely that the ancestral deuterostome was a free-living, bilateral, worm-like animal with a series of pharyngeal gill slits. Enteropneust hemichordates, amphioxus, and even vertebrates retain body plans comparable to this inferred ancestral deuterostome, albeit each with its own specializations, while the body plans of echinoderms and tunicates are considerably more derived.
Methods
The starting point for these analyses was the draft assembly of the amphioxus genome sequence (http://genome.jgi-psf.org/Brafl1/Brafl1.home.html), in which initial gene and protein predictions had been made using the JGI annotation pipeline as described by Putnam et al. (2008). For each gene family analyzed in the present study, the assembly was searched using tBLASTn, and for each putative match, the deduced amphioxus protein sequences were retrieved. Gene annotations were corrected manually, taking into account additional cDNA information and alignment to homologs from other species. In the few cases when previous genomic clone data differed from genome sequence data, putative sequencing or assembly errors were annotated on the genome and corrected sequences used in further analysis. Gene models representing allelic pairs were identified using a combination of similarity of predicted peptide sequence and gene neighborhood context. For phylogenetic analysis, predicted amphioxus proteins were aligned with other family members and with outgroup proteins using ClustalW or MUSCLE, edited manually, and analyzed using neighbor-joining, maximum parsimony, and maximum likelihood algorithms, implemented using PHYLIP, MEGA 3.0, or PhyML. Branch support of the phylogenies was assessed by bootstrap analyses. Isolation of Hox15 cosmid clones and fluorescent in situ hybridization to chromosomes followed published protocols (Castro and Holland 2002).
Computational detection of conserved noncoding sequences
Whole-genome alignments of the amphioxus genome scaffold sequences and human genome reference sequence (NCBI v35) were performed with the VISTA pipeline infrastructure (Frazer et al. 2004) based on efficient combination of glocal and local alignment methods. We first obtained a map of large blocks of conserved synteny between the two species by applying ShuffleLAGAN global chaining algorithm (Brudno et al. 2003) to local alignments produced by translated BLAT (Kent 2002). Then, in each syntenic block, we applied Shuffle-LAGAN again to obtain a more fine-grained map of small-scale rearrangements, such as inversions. Finally, we used VISTA with a similarity cut-off of 60% identity over 50 bp to identify candidate-conserved noncoding sequences. The human genome coordinates of all alignments were used in the UCSC table browser to identify and remove: (1) elements overlapping known genes, human mRNAs, spliced ESTs, Ensembl genes, human repeats, simple repeats, retroposed genes, and high-copy number regions in the human genome (defined by more than one overlapping self-chain hit); (2) elements not overlapping the “most conserved” track in the UCSC genome browser; (3) elements not conserved between human and at least one of the fish, chicken, or frog genomes. The amphioxus genome coordinates of all remaining alignments were used to exclude elements with BLASTX alignments to proteins encoded by the amphioxus genome.
Mouse transgenic enhancer assay
Primers were designed to flank the conserved element by several hundred base pairs using primer332 (Supplemental Tables S12, S13). Enhancer element constructs were PCR amplified from human genomic DNA (BD Biosciences) and directionally cloned into the pENTR/D-TOPO vector (Invitrogen). All inserts were sequence validated and transferred into a Hsp68-LacZ vector33 encompassing a Gateway cassette using LR recombination (Invitrogen). Generation of transgenic mice and embryo staining were as previously described (Pennacchio et al. 2006) in accordance with protocols approved by the Lawrence Berkeley National Laboratory. Transgenic mouse DNA was prepared from yolk sacs that were carefully dissected from embryos, boiled for 5 min in lysis solution (50 mM Tris HCl at pH 8.0, 20 mM NaCl, 1 mM EDTA at pH 8.0, 1% SDS), and then screened by PCR with LacZ primers (LacZ-fwd: 5′-TTTCCATGTTGCCACTCGC; LacZ-Rev: 5′-AACGGCTTGCCGTTCAGCA) for positive transgenic animals. Images were obtained using a Leica MZ16 microscope and DC480 camera, cropped, and level adjusted with Adobe Photoshop. An element was classed as a positive enhancer if at least three independent transgenic embryos showed the same expression pattern.
Amphioxus transgenic enhancer assay
AmPrimers were designed to flank the conserved element by several hundred base pairs using primer332, and an additional sequence containing a HindIII restriction site was added to the 5′ end of each primer. Enhancer element constructs were PCR amplified from amphioxus (B. floridae) genomic DNA. Amplified fragments were digested using HindIII and ligated into HindIII digested pASCID-FoxD vector35 using a Fast-link DNA ligation kit (Epicentre Biotechnologies). Injection of constructs into amphioxus embryos was done as previously described (Holland and Yu 2004; Beaster-Jones et al. 2007).
Acknowledgments
This work was funded by grants from the National Science Foundation, USA (L.Z.H.), National Institutes of Health, USA (G.W.L.), the Wellcome Trust (P.W.H.H.), BBSRC, UK (D.E.K.F., T.B., P.W.H.H.), and MEXT, Japan (H.S., N.S.), Center for Applied Genomics MSMT and Academy of Sciences, Czech Republic (Z.K.), and the 21st Century and Global COEs at Kyoto University (N.S.), from Ministerio de Educación y Ciencia, Spain (J.G.-F.), MIUR Italy, FIRB 2001 BAU01WAFY (S.C., M.P.), JSPS, Japan (H.S.), and MENRT, CNRS and CRESCENDO, a European Union Integrated Project of FP6 (V.L., M.S.).
Footnotes
[Supplemental material is available online at www.genome.org.]
Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.073676.107.
References
- Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., et al. The genome sequence of Drosophila melanogaster. Science. 2000;287:2185–2195. doi: 10.1126/science.287.5461.2185. [DOI] [PubMed] [Google Scholar]
- Aparicio S., Chapman J., Stupka E., Putnam N., Chia J.-m., Dehal P., Christoffels A., Rash S., Hoon S., Smit A., et al. Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes. Science. 2002;297:1301–1310. doi: 10.1126/science.1072104. [DOI] [PubMed] [Google Scholar]
- Arendt D., Tessmar-Raible K., Snyman H., Dorresteijn A.W., Wittbrodt J. Ciliary photoreceptors with a vertebrate-type opsin in an invertebrate brain. Science. 2004;306:869–871. doi: 10.1126/science.1099955. [DOI] [PubMed] [Google Scholar]
- Beaster-Jones L., Schubert M., Holland L.Z. Cis-regulation of the amphioxus engrailed gene: Insights into evolution of a muscle-specific enhancer. Mech. Dev. 2007;124:532–542. doi: 10.1016/j.mod.2007.06.002. [DOI] [PubMed] [Google Scholar]
- Boraschi D., Tagliabue A. The interleukin-1 receptor family. Vitam. Horm. 2006;74:229–254. doi: 10.1016/S0083-6729(06)74009-2. [DOI] [PubMed] [Google Scholar]
- Bourlat S., Juliusdottir T., Lowe C.J., Freeman R., Aronowicz J., Kirschner M., Lander E.S., Thorndyke M., Nakano H., Kohn A.B., et al. Deuterostome phylogeny reveals monophyletic chordates and the new phylum Xenoturbellida. Nature. 2006;444:85–88. doi: 10.1038/nature05241. [DOI] [PubMed] [Google Scholar]
- Brudno M., Malde S., Polikakov A., Do C.B. Glocal alignment: Finding rearrangements during alignment. Bioinformatics. 2003;19:i54–i62. doi: 10.1093/bioinformatics/btg1005. [DOI] [PubMed] [Google Scholar]
- Bürglin T.R. Encyclopedia of molecular cell biology and molecular medicine. Wiley-VCH, Verlag GmbH & Co; Weinheim, Germany: 2005. Homeodomain proteins; pp. 179–222. [Google Scholar]
- C. elegans Sequencing Consortium Genome sequence of the nematode C. elegans: A platform for investigating biology. Science. 1998;282:2012–2018. doi: 10.1126/science.282.5396.2012. [DOI] [PubMed] [Google Scholar]
- Campbell R.K., Satoh N., Degnan B.M. Piecing together evolution of the vertebrate endocrine system. Trends Genet. 2004;20:359–366. doi: 10.1016/j.tig.2004.06.005. [DOI] [PubMed] [Google Scholar]
- Cameron C.B., Garey J.R., Swalla B.J. Evolution of the chordate body plan: New insights from phylogenetic analysis of deuterostome phyla. Proc. Natl. Acad. Sci. 2000;97:4469–4474. doi: 10.1073/pnas.97.9.4469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cannon J.P., Haire R.N., Litman G.W. Identification of diversified genes that contain immunoglobulin-like variable regions in a protochordate. Nat. Immunol. 2002;3:1200–1207. doi: 10.1038/ni849. [DOI] [PubMed] [Google Scholar]
- Castro L.F., Holland P.W. Fluorescent in situ hybridisation to amphioxus chromosomes. Zoolog. Sci. 2002;19:1349–1353. doi: 10.2108/zsj.19.1349. [DOI] [PubMed] [Google Scholar]
- Chan S.J., Cao Q.P., Steiner D.F. Evolution of the insulin superfamily: Cloning of a hybrid insulin/insulin-like growth factor cDNA from amphioxus. Proc. Natl. Acad. Sci. 1990;87:9319–9323. doi: 10.1073/pnas.87.23.9319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang C.W., Tsai C.W., Wang H.F., Tsai H.C., Chen H.Y., Tsai T.F., Takahashi H., Li H.Y., Fann M.J., Yang C.W., et al. Identification of a developmentally regulated striatum-enriched zinc-finger gene, Nolz-1, in the mammalian brain. Proc. Natl. Acad. Sci. 2004;101:2613–2618. doi: 10.1073/pnas.0308645100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chapman D.L., Garvey N., Hancock S., Alexiou M., Agulnik S.I., Gibson-Brown J.J., Cebra-Thomas J., Bollag R.J., Silver L.M., Papaioannou V.E. Expression of the T-box family genes, Tbx1-Tbx5, during early mouse development. Dev. Dyn. 1996;206:379–390. doi: 10.1002/(SICI)1097-0177(199608)206:4<379::AID-AJA4>3.0.CO;2-F. [DOI] [PubMed] [Google Scholar]
- Chen J.Y., Dzik J., Edgecombe G.D., Ramskold L., Zhou G.Q. A possible early Cambrian chordate. Nature. 1995;377:720–722. [Google Scholar]
- The Chimpanzee Sequencing and Analysis Consortium Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005;437:69–87. doi: 10.1038/nature04072. [DOI] [PubMed] [Google Scholar]
- Covelli I., Salvatore G., Sena L., Roche J. Sur la formation d’hormones thyroidiennes et de leurs précurseurs par Branchiostoma lanceolatum. C. R. Soc. Biol. Paris. 1960;154:1165–1169. [PubMed] [Google Scholar]
- Dehal P., Boore J.L. Two rounds of genome duplication in the ancestral vertebrate. PLoS Biol. 2005;3:e314. doi: 10.1371/journal.pbio.0030314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dehal P., Satou Y., Campbell R.K., Chapman J., Degnan B., De Tomaso A., Davidson B., Di Gregorio A., Gelpke M., Goodstein D.M., et al. The draft genome of Ciona intestinalis: Insights into chordate and vertebrate origins. Science. 2002;298:2157–2167. doi: 10.1126/science.1080049. [DOI] [PubMed] [Google Scholar]
- Delbridge L.M., O’Riordan M.X. Innate recognition of intracellular bacteria. Curr. Opin. Immunol. 2007;19:10–16. doi: 10.1016/j.coi.2006.11.005. [DOI] [PubMed] [Google Scholar]
- Delsuc F., Brinkmann H., Chourrout D., Philippe H. Tunicates and not cephalochordates are the closest living relatives of vertebrates. Nature. 2006;439:965–968. doi: 10.1038/nature04336. [DOI] [PubMed] [Google Scholar]
- Escriva H., Holland N.D., Groenmeyer H., Laudet V., Holland L.Z. The retinoic acid signaling pathway regulates anterior/posterior patterning in the nerve cord and pharynx of amphioxus, a chordate lacking neural crest. Development. 2002;129:2905–2916. doi: 10.1242/dev.129.12.2905. [DOI] [PubMed] [Google Scholar]
- Fang Y.Q., Weng Y.Z. Neuroendocrine regulation of structure and function in the endostyle of amphioxus, Branchiostoma belcheri. Prog. Nat. Sci. 2002;12:904–908. [Google Scholar]
- Ferrier D.E.K., Minguillón C., Holland P.W.H., Garcia-Fernàndez J. The amphioxus Hox cluster: Deuterostome posterior flexibility and Hox14. Evol. Dev. 2000;2:284–293. doi: 10.1046/j.1525-142x.2000.00070.x. [DOI] [PubMed] [Google Scholar]
- Ferrier D.E.K., Minguillón C., Cebriá C., Garcia-Fernàndez J. Amphioxus EVX genes: Implications for the evolution of the midbrain-hindbrain boundary and the chordate tailbud. Dev. Biol. 2001;237:270–281. doi: 10.1006/dbio.2001.0375. [DOI] [PubMed] [Google Scholar]
- Frazer K.A., Pachter L., Poliakov A., Rubin E.M., Dubchak I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004;32:W273–W279. doi: 10.1093/nar/gkh458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fredriksson G., Ericson L.E., Olsson R. Iodine binding in the endostyle of larval Branchiostoma lanceolatum (Cephalochordata) Gen. Comp. Endocrinol. 1984;56:177–184. doi: 10.1016/0016-6480(84)90028-5. [DOI] [PubMed] [Google Scholar]
- Fugmann S.D., Schatz D.G. Immunology. One AID to unite them all. Science. 2002;295:1244–1245. doi: 10.1126/science.1070023. [DOI] [PubMed] [Google Scholar]
- Fugmann S.D., Messier C., Novack L.A., Cameron R.A., Rast J.P. An ancient evolutionary origin of the Rag1/2 gene locus. Proc. Natl. Acad. Sci. 2006;103:3728–3733. doi: 10.1073/pnas.0509720103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garcia-Fernàndez J., Holland P.W.H. Archetypal organization of the amphioxus Hox gene cluster. Nature. 1994;370:563–566. doi: 10.1038/370563a0. [DOI] [PubMed] [Google Scholar]
- Gaskell W.H., MacBride E.W., Starling E.H., Goodrich E.S., Gadow H., Smith Woodward A., Dendy A., Lankester E.R., Chalmers Mitchell P., Gardiner J.S., et al. Discussion on the origin of vertebrates. Proc. Linn. Soc. London. 1910;122:9–50. [Google Scholar]
- Gibson-Brown J.J., Agulnik S.I., Chapman D.L., Alexiou M., Garvey N., Silver L.M., Papaioannou V.E. Evidence of a role for T-box genes in the evolution of limb morphogenesis and the specification of forelimb/hindlimb identity. Mech. Dev. 1996;56:93–101. doi: 10.1016/0925-4773(96)00514-x. [DOI] [PubMed] [Google Scholar]
- Hauser F., Williamson M., Cazzamali G., Grimmelikhuijzen C.J. Identifying neuropeptide and protein hormone receptors in Drosophila melanogaster by exploiting genomic data. Brief. Funct. Genomic Proteomics. 2006;4:321–330. doi: 10.1093/bfgp/eli003. [DOI] [PubMed] [Google Scholar]
- Holland L.Z., Yu J.K. Cephalochordate (amphioxus) embryos: Procurement, culture, and basic methods. Methods Cell Biol. 2004;74:195–215. doi: 10.1016/s0091-679x(04)74009-1. [DOI] [PubMed] [Google Scholar]
- Holland N.D., Panganiban G., Henyey E.L., Holland L.Z. Sequence and developmental expression of AmphiDll, an amphioxus Distal-less gene transcribed in the ectoderm, epidermis and nervous system: Insights into evolution of craniate forebrain and neural crest. Development. 1996;122:2911–2920. doi: 10.1242/dev.122.9.2911. [DOI] [PubMed] [Google Scholar]
- Holland P.W.H., Patton S.J., Brooke N.M., Garcia-Fernàndez J. In: Advances in comparative endocrinology: XIII International Congress of Comparative Endocrinology. Kawashima S., Kikuyama S., editors. Monduzzi; Bologna, Italy: 1997. pp. 247–252. [Google Scholar]
- Holland P.W.H., Booth H.A.F., Bruford E. Classification and nomenclature of all human homeobox genes. BMC Biol. 2007;5:47. doi: 10.1186/1741-7007-5-47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- The Honeybee Genome Sequencing Consortium Insights into social insects from the genome of the honeybee Apis mellifera. Nature. 2006;443:931–949. doi: 10.1038/nature05260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Inohara N., Chamaillard M., McDonald C., Nuñez G. NOD-LRR proteins: Role in host-microbial interactions and inflammatory disease. Annu. Rev. Biochem. 2005;74:355–383. doi: 10.1146/annurev.biochem.74.082803.133347. [DOI] [PubMed] [Google Scholar]
- International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
- Jackman W.R., Mougey J.M., Panopoulou G.D., Kimmel C.B. Crabp and maf highlight the novelty of the amphioxus club-shaped gland. Acta Zool. Stockholm. 2004;85:91–99. [Google Scholar]
- Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N., Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., et al. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature. 2004;431:946–957. doi: 10.1038/nature03025. [DOI] [PubMed] [Google Scholar]
- Janssen B.J., Huizinga E.G., Raaijmakers H.C., Roos A., Daha M.R., Nilsson-Ekdahl K., Nilsson B., Gros P. Structures of complement component C3 provide insights into the function and evolution of immunity. Nature. 2005;437:505–511. doi: 10.1038/nature04005. [DOI] [PubMed] [Google Scholar]
- Jeffery W.R., Strickler A.G., Yamamoto Y. Migratory neural crest-like cells form body pigmentation in a urochordate embryo. Nature. 2004;431:696–699. doi: 10.1038/nature02975. [DOI] [PubMed] [Google Scholar]
- Jung D., Alt F.W. Unraveling V(D)J recombination; insights into gene regulation. Cell. 2004;116:299–311. doi: 10.1016/s0092-8674(04)00039-x. [DOI] [PubMed] [Google Scholar]
- Kent W.J. BLAT—The BLAST-like alignment tool. Genome Res. 2002;12:656–664. doi: 10.1101/gr.229202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koyanagi M., Kubokawa K., Tsukamoto H., Shichida Y., Terakita A. Cephalochordate melanopsin: Evolutionary linkage between invertebrate visual cells and vertebrate photosensitive retinal ganglion cells. Curr. Biol. 2005;15:1065–1069. doi: 10.1016/j.cub.2005.04.063. [DOI] [PubMed] [Google Scholar]
- Kusakabe T., Kusakabe R., Kawakami I., Satou Y., Satoh N., Tsuda M. Ci-opsin1, a vertebrate-type opsin gene, expressed in the larval ocellus of the ascidian Ciona intestinalis. FEBS Lett. 2001;506:69–72. doi: 10.1016/s0014-5793(01)02877-0. [DOI] [PubMed] [Google Scholar]
- Lacalli T.C. Sensory systems in amphioxus: A window on the ancestral chordate condition. Brain Behav. Evol. 2004;64:148–162. doi: 10.1159/000079744. [DOI] [PubMed] [Google Scholar]
- Law S.K., Dodds A.W. The internal thioester and the covalent binding properties of the complement proteins C3 and C4. Protein Sci. 1997;6:263–274. doi: 10.1002/pro.5560060201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leung P.C.K., Cheng C.K., Zhu X.-M. Multi-factorial role of GnRH-I and GnRH-II in the human ovary. Mol. Cell. Endocrinol. 2003;202:145–153. doi: 10.1016/s0303-7207(03)00076-5. [DOI] [PubMed] [Google Scholar]
- Litman G.W., Cannon J.P., Dishaw L.J. Reconstructing immune phylogeny: New perspectives. Nat. Rev. Immunol. 2005;5:866–879. doi: 10.1038/nri1712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luke G.N., Castro L.F.C., McLay K., Bird C., Coulson A., Holland P.W.H. Dispersal of NK homeobox gene clusters in amphioxus and humans. Proc. Natl. Acad. Sci. 2003;100:5292–5295. doi: 10.1073/pnas.0836141100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martinon F., Tschopp J. Inflammatory caspases: Linking an intracellular innate immune system to autoinflammatory diseases. Cell. 2004;117:561–574. doi: 10.1016/j.cell.2004.05.004. [DOI] [PubMed] [Google Scholar]
- Meulemans D., Bronner-Fraser M. Gene-regulatory interactions in neural crest evolution and development. Dev. Cell. 2004;7:291–299. doi: 10.1016/j.devcel.2004.08.007. [DOI] [PubMed] [Google Scholar]
- Minguillón C., Ferrier D.E., Cebrián C., Garcia-Fernàndez J. Gene duplications in the prototypical cephalochordate amphioxus. Gene. 2002;287:121–128. doi: 10.1016/s0378-1119(01)00828-9. [DOI] [PubMed] [Google Scholar]
- Mongin E., Louis C., Holt R.A., Birney E., Collins F.H. The Anopheles gambiae genome: An update. Trends Parasitol. 2004;20:49–52. doi: 10.1016/j.pt.2003.11.003. [DOI] [PubMed] [Google Scholar]
- Nagawa F., Kishishita N., Shimizu K., Hirose S., Miyoshi M., Nezu J., Nishimura T., Nishizumi H., Takahashi Y., Hashimoto S., et al. Antigen-receptor genes of the agnathan lamprey are assembled by a process involving copy choice. Nat. Immunol. 2007;8:206–213. doi: 10.1038/ni1419. [DOI] [PubMed] [Google Scholar]
- Nonaka M., Kimura A. Genomic view of the evolution of the complement system. Immunogenetics. 2006;58:701–713. doi: 10.1007/s00251-006-0142-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olinski R.P., Lundin L.-G., Hallböök F. Conserved synteny between the Ciona genome and human paralogons identifies large duplication events in the molecular evolution of the insulin-relaxin gene family. Mol. Biol. Evol. 2006;23:10–22. doi: 10.1093/molbev/msj002. [DOI] [PubMed] [Google Scholar]
- Pancer Z., Cooper M.D. The evolution of adaptive immunity. Annu. Rev. Immunol. 2006;24:497–518. doi: 10.1146/annurev.immunol.24.021605.090542. [DOI] [PubMed] [Google Scholar]
- Pennacchio L.A., Ahituv N., Moses A.M., Prabhakar S., Nobrega M.A., Shoukry M., Minovitsky S., Dubchak I., Holt A., Lewis K.D., et al. In vivo enhancer analysis of human conserved non-coding P sequences. Nature. 2006;444:499–502. doi: 10.1038/nature05295. [DOI] [PubMed] [Google Scholar]
- Powers T.P., Amemiya C.T. Evidence for a Hox14 paralog group in vertebrates. Curr. Biol. 2004;14:R183–R184. doi: 10.1016/j.cub.2004.02.015. [DOI] [PubMed] [Google Scholar]
- Prochnik S.E., Rokhsar D.S., Aboobaker A.A. Evidence for a microRNA expansion in the bilaterian ancestor. Dev. Genes Evol. 2007;217:73–77. doi: 10.1007/s00427-006-0116-1. [DOI] [PubMed] [Google Scholar]
- Putnam N.H., Blow M.J., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U., Kawashima T., Robinson-Rechavi M., Shoguchi D., Terry A., et al. The amphioxus genome and the evolution of the chordate karyotype. Nature. 2008 doi: 10.1038/nature06967. (in press) [DOI] [PubMed] [Google Scholar]
- Raible F., Tessmar-Raible K., Arboleda E., Kaller T., Bork P., Arendt D., Arnone M.I. Opsins and clusters of sensory G-protein-coupled receptors in the sea urchin genome. Dev. Biol. 2006;300:461–475. doi: 10.1016/j.ydbio.2006.08.070. [DOI] [PubMed] [Google Scholar]
- Rast J.P., Smith L.C., Loza-Coll M., Hibino T., Litman G.W. Genomic insights into the immune system of the sea urchin. Science. 2006;314:952–956. doi: 10.1126/science.1134301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rogozin I.B., Iyer L.M., Liang L., Glazko G.V., Liston V.G., Pavlov Y.I., Aravind L., Pancer Z. Evolution and diversification of lamprey antigen receptors: Evidence for involvement of an AID-APOBEC family cytosine deaminase. Nat. Immunol. 2007;8:647–656. doi: 10.1038/ni1463. [DOI] [PubMed] [Google Scholar]
- Schubert M., Holland N.D., Laudet V., Holland L.Z. A retinoic acid-Hox hierarchy controls both anterior/posterior patterning and neuronal specification in the developing central nervous system of the cephalochordate amphioxus. Dev. Biol. 2006;296:190–202. doi: 10.1016/j.ydbio.2006.04.457. [DOI] [PubMed] [Google Scholar]
- Sembrat K. Effect of the endostyle of the lancelet (Branchiostoma lanceolatum Pall.) on the metamorphosis of axolotl. Zoolog. Polon. 1953;6:3–19. [Google Scholar]
- Shimeld S.M., Holland P.W.H. Vertebrate innovations. Proc. Natl. Acad. Sci. 2000;97:4449–4452. doi: 10.1073/pnas.97.9.4449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sodergren E., Weinstock G.M., Davidson E.H., Cameron R.A., Gibbs R.A., Angerer R.C., Angerer L.M., Arnone M.I., Burgess D.R., Burke R.D., et al. The genome of the sea urchin Strongylocentrotus purpuratus. Science. 2006;314:941–952. doi: 10.1126/science.1133609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stein L.D., Bao Z., Blasiar D., Blumenthal T., Brent M.R., Chen N., Chinwalla A., Clarke L., Clee C., Coghlan A., et al. The genome sequence of Caenorhabditis briggsae: A platform for comparative genomics. PLoS Biol. 2003;1:e45. doi: 10.1371/journal.pbio.0000045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sudo S., Kuwabara Y., Park J.-I., Hsu S.Y., Hsueh A.J.W. Heterodimeric fly glycoprotein hormone-alpha2 (GPA2) and glycoprotein hormone-beta5 (GPB5) activate fly leucine-rich repeat-containing G protein-coupled receptor-1 (DLGR1) and stimulation of human thyrotropin receptors by chimeric fly GPA2 and human GPB5. Endocrinology. 2005;146:3596–3604. doi: 10.1210/en.2005-0317. [DOI] [PubMed] [Google Scholar]
- Tarttelin E.E., Bellingham J., Hankins M.W., Foster R.G., Lucas R.J. Neuropsin (Opn5): A novel opsin identified in mammalian neural tissue. FEBS Lett. 2003;554:410–416. doi: 10.1016/s0014-5793(03)01212-2. [DOI] [PubMed] [Google Scholar]
- Tello J.A., Rivier J.E., Sherwood N.M. Tunicate gonadotropin-releasing hormone GnRH peptides selectively activate Ciona intestinalis GnRH receptors and the green monkey type II GnRH receptor. Endocrinology. 2005;146:4061–4073. doi: 10.1210/en.2004-1558. [DOI] [PubMed] [Google Scholar]
- Terakita A. The opsins. Genome Biol. 2005;6:213. doi: 10.1186/gb-2005-6-3-213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tong W., Kerkof P., Chaikoff I.L. Identification of labeled thyroxine and triiodothyronine in amphioxus treated with 131-I. Biochim. Biophys. Acta. 1962;56:326–331. doi: 10.1016/0006-3002(62)90570-x. [DOI] [PubMed] [Google Scholar]
- Venter J.C., Adams M.D., Myers E.W., Li P.W., Mural R.J., Sutton G.G., Smith H.O., Yandell M., Evans C.A., Holt R.A., et al. The sequence of the human genome. Science. 2001;291:1304–1351. doi: 10.1126/science.1058040. [DOI] [PubMed] [Google Scholar]
- Waterston R.H., Lindblad-Toh K., Birney E., Rogers J., Abril J.F., Agarwal P., Agarwala R., Ainscough R., Alexandersson M., An P., et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. doi: 10.1038/nature01262. [DOI] [PubMed] [Google Scholar]
- Weng Y.-Z., Song H.-X., Fang Y.-Q. Distribution of prolactin and PRL receptor-like immunoreactivities in the nervous system, Hatschek's pit and other tissues of amphioxus Branchiostoma belcheri. Acta Zool. Sinica. 2006;52:907–915. [Google Scholar]
- Werts C., Girardin S.E., Philpott D.J. TIR, CARD and PYRIN: Three domains for an antimicrobial triad. Cell Death Differ. 2006;13:798–815. doi: 10.1038/sj.cdd.4401890. [DOI] [PubMed] [Google Scholar]
- Woolfe A., Goodson M., Goode D.K., Snell P., McEwen G.K., Vavouri T., Smith S.F., North P., Callaway H., Kelly K., et al. 2005Highly conserved non-coding sequences are associated with vertebrate development PLoS Biol. 3e7 . 10.1371/journal.pbio.0030007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yagi K., Satou Y., Mazet F., Shimeld S.M., Degnan B., Rokhsar D., Levine M., Kohara Y., Satoh N.A. Genomewide survey of developmentally relevant genes in Ciona intestinalis III. Genes for Fox, ETS, nuclear receptors and NFκB. Dev. Genes Evol. 2003;213:235–244. doi: 10.1007/s00427-003-0322-z. [DOI] [PubMed] [Google Scholar]
- Yokoyama S. Molecular evolution of retinal and nonretinal opsins. Genes Cells. 1996;1:787–794. doi: 10.1046/j.1365-2443.1996.d01-273.x. [DOI] [PubMed] [Google Scholar]
- Yu J.-K., Satou Y., Holland N.D., Shin-I T., Kohara Y., Satoh N., Bronner-Fraser M., Holland L.Z. Axial patterning in cephalochordates and the evolution of the organizer. Nature. 2007;445:613–617. doi: 10.1038/nature05472. [DOI] [PubMed] [Google Scholar]