ABSTRACT
Evolutionary developmental biology has grown historically from the capacity to relate patterns of evolution in anatomy to patterns of evolution of expression of specific genes, whether between very distantly related species, or very closely related species or populations. Scaling up such studies by taking advantage of modern transcriptomics brings promising improvements, allowing us to estimate the overall impact and molecular mechanisms of convergence, constraint or innovation in anatomy and development. But it also presents major challenges, including the computational definitions of anatomical homology and of organ function, the criteria for the comparison of developmental stages, the annotation of transcriptomics data to proper anatomical and developmental terms, and the statistical methods to compare transcriptomic data between species to highlight significant conservation or changes. In this article, we review these challenges, and the ongoing efforts to address them, which are emerging from bioinformatics work on ontologies, evolutionary statistics, and data curation, with a focus on their implementation in the context of the development of our database Bgee (http://bgee.org). J. Exp. Zool. (Mol. Dev. Evol.) 324B: 372–382, 2015. © 2015 The Authors. J. Exp. Zool. (Mol. Dev. Evol.) published by Wiley Periodicals, Inc.
Variation in animal or plant morphology is caused by variation in their genomes and in their environment (phenotypic plasticity). Evolutionary developmental biology, or Evo‐Devo, aims at understanding the origin of morphological variations, whether between closely related species [e.g., the variation in wings shapes and colors between sister species of butterflies (Brunetti et al., 2001; Beldade and Brakefield, 2002; Monteiro and Podlaha, 2009)], or between very distant lineages [e.g., the divergence between insect and vertebrate body‐plans (Raff, 2000; De Robertis, 2008)], and how genomic variation translates into morphological variation at all scales. Anatomical differences between species are studied in the context of the developmental processes generating species anatomy, and the role of genes in these developmental processes (Bolker, 2014). The involvement of genes and their function in the developmental process is traditionally determined by comparing their expression patterns between species. A particular emphasis is put on genes which exhibit either rapid changes in expression, which have been linked to numerous interesting developmental differences, or deep conservation of expression patterns, which have been show to underlie developmental similarities on unexpectedly large evolutionary scales (Raff, 2000; Carroll, 2005). This has long been done on a small scale, a few genes at a time, using techniques such as in situ hybridization or quantitative RT‐PCR. With microarrays, then RNA‐seq, it has become possible to study expression patterns on a genome‐wide scale. Yet, application of these techniques to questions from Evo‐Devo has lagged somewhat behind other fields. This stands in contrast to the rapid and productive use that was made of the increasingly available genome sequences, illustrated for example by the in‐depth study of the evolution of Hox genes (Hoegg and Meyer, 2005), or enhancers sequences (Holland et al., 2008).
Although there has been recent progress in the comparison of transcriptomes of closely related species (Gallego Romero et al., 2012; Necsulea and Kaessmann, 2014) the difficulties in answering many of the questions posed in Evo‐Devo remains. How do changes in expression patterns translate to changes in the function of homologous organs? What is the molecular basis for morphological innovation? What can deep conservation of expression tell us about homology relations between distant species? How much of transcriptome evolution is convergent versus parallel? Here, we discuss some of these issues and emerging solutions. We believe that the next years will be a very fruitful time for the application of transcriptomics to Evo‐Devo, and that this will yield important insights into the evolution of development and of anatomy.
WHAT TO COMPARE? THE PROBLEM OF ANATOMICAL HOMOLOGY AND FUNCTIONAL EQUIVALENCE
A major problem in comparing expression patterns on a large scale between species is the definition of what is comparable. In comparative genomics, orthologous genes can be readily identified by sequence comparison, and reliable data are now available from numerous databases (Sonnhammer et al., 2014). But expression patterns are much more complex combinations of genes and of anatomical and temporal patterns. Thus criteria to guide the comparison between these structures are necessary.
The most obvious comparison criterion is homology, as is used for the genes themselves. Homology is most commonly taken in the sense of “historical homology,” a definition in which the common evolutionary origin has to derive from common descent, that is, the common ancestor of both species possessed the anatomical structure, and both present day species being compared inherited it from this ancestor. But the interpretations of the homology concept have changed with progresses in biology, and different subfields operate with different definitions (Roux and Robinson‐Rechavi, 2010). Notably, the “biological homology” concept considers organs homologous if they share a set of developmental constraints (Wagner, 1989). This process‐oriented and more mechanistic definition, mostly used in Evo‐Devo, encompasses repeated parts in the same organism (e.g., somites), as well as sexually differentiated parts of individuals of the same species (e.g., testis and ovaries), which do not pass the common ancestry criterion, yet are important to our understanding of the relation between development and the evolution of form.
For transcriptome comparisons between very closely related species, such distinctions are usually not an issue: homologous structures are mostly obvious and comply with most definitions of the homology concept. For example, in comparisons among Great Apes, such as between humans and chimpanzees [see Gallego Romero et al. (2012) for a review], it is mostly trivial to accept that structures with the same name are homologous, and that moreover they essentially perform the same function. The situation is similar for the comparison of domesticated plants and their wild relatives (Koenig et al., 2013), or even two slightly more divergent C3 and C4 Cleomaceae species (Kulahoglu et al., 2014). In the latter case, there is for example little doubt that the sepals or petals are homologous over the ca. 10–20 million years of divergence investigated.
But when more distant species are compared, which anatomical structures should be compared becomes less obvious. Moreover, it is not always clear whether the most biologically relevant comparisons should involve homologous or functionally equivalent structures. Indeed, some homologies are well established, but correspond to organs which are widely divergent in function and structure [e.g., tetrapod lung and actinopterygian fish swim bladder (Zheng et al., 2011)]. For other structures, such as the vomeronasal organ in different tetrapods, or the ovary between teleost fishes and mammals, similarity of function is clear but homology is debated [for a discussion see Niknejad et al. (2012)]. Similarly, the homology of segmentation mechanisms among bilaterian lineages remains unclear (De Robertis, 2008).
To take into account the complexity of homology on a large scale, we have been developing computational representations of homology, through ontologies (Parmentier et al., 2010; Roux and Robinson‐Rechavi, 2010; Niknejad et al., 2012), and using them in the annotation of transcriptome data to anatomical information. The distribution of the ontologies (Noy et al., 2009), and the recent merger of our efforts into Uberon (Mungall et al., 2012; Haendel et al., 2014), allows any comparative transcriptomics or comparative functional genomics project with an interest in homology to built upon this work (Robinson and Webber, 2014).
It is worth noting that the comparison of homologous organs may not be the relevant criterion if the aim is to detect sets of genes whose expression is responsible for the emergence or maintenance of particular functions (Bolker, 2014). It is generally assumed that the expression of core genes responsible for conserved functions should be itself conserved (Levine and Davidson, 2005). But in homologous structures, the signal of functional constraint might be difficult to characterize from the background similarity in gene expression due to their common origin. The task of isolating the common molecular signals responsible for functional similarity could thus be facilitated by comparing non‐homologous organs with similarity in function, for example, tetrapod lung versus fish gills, both used as respiratory organs, or structures used in pregnancies of female mammals versus male Syngnathids (seahorses and pipefishes) (Stölting and Wilson, 2007). Homologous organs that present functions that were not inherited from the common ancestor but have evolved in parallel can also be used, for example, body parts of cichlid fishes which evolved in parallel in African or Central American lakes to perform the same functions, in species that colonized similar ecological niches (Elmer and Meyer, 2011). In this context many studies have successfully related patterns of phenotypic convergence to patterns of gene expression, at the scale of a few genes.
Between closely related species, it was notably shown that independent genetic changes could be responsible for regulatory changes, often of homologous genes, leading to the parallel evolution of similar phenotypes (Martin and Orgogozo, 2013). Examples include the evolution of black spots on wings of different Drosophila species, involving the regulation of the yellow gene (Gompel et al., 2005; Prud'homme et al., 2006), or the parallel evolution of opsins expression affecting color perception in cichlids (Hofmann et al., 2010; O'Quin et al., 2010). Between marine and freshwater populations of the stickleback fish, differences in the number of armor plates have been associated to an increased linkage disequilibrium within intron 2 of the EDA gene (Colosimo et al., 2005), likely to influence the regulation of this signaling gene between the two morphs (Knecht et al., 2007). A recent genome‐wide study confirmed this finding, and found that across all genomic variants implicated in the repeated adaptations of marine populations to freshwater, regulatory changes likely played a substantially more prominent role than coding changes (Jones et al., 2012). Interestingly, parallel patterns of regulatory changes in the same genes in different species or populations are often caused by nucleotide substitutions affecting the same cis‐regulatory elements (Elmer and Meyer, 2011).
There are still few transcriptome‐wide studies of convergent anatomical structures, but recent work highlights the diversity of signatures underlying phenotypic convergence. For example, the convergent evolution of bacterial photophores of two squid species extends beyond anatomy, to highly convergent transcriptome profiles (Pankey et al., 2014). Whereas the convergent evolution of external genitalia in different amniotes is mirrored in patterns of transcriptomes which relate more to the tissue of origin of these new structures than to the evolution of their functional equivalence (Tschopp et al., 2014). It remains to be seen which case is more general: higher transcriptome similarity between convergent structures, or higher transcriptome similarity between homologous (yet functionally divergent) structures. And it remains to be seen which factors could lead to deep reorganization of transcriptomes with the evolution of new functions.
At larger evolutionary distances, such as between insects and vertebrates, there are few unambiguous homologous anatomical structures between species, and most shared functions are the result of convergence or parallel evolution. But fascinating, and unexpected, examples of gene expression patterns similarity were uncovered by Evo‐Devo studies. The most famous example is probably the case of vertebrate and arthropods eyes, which are not homologous, but express similar transcription factors during their developmental cascade [including Pax6 (Halder et al., 1995)]. The concept of “deep homology” (Shubin et al., 2009) has been proposed to describe the relation between structures which share some homology in their development, notably in their key patterning genes, also share similarity of function and of structure, yet are not homologous at the anatomical level, certainly not in the sense of historical homology. Deep homology is formalized in our HOM ontology (Roux and Robinson‐Rechavi, 2010) (http://purl.obolibrary.org/obo/HOM_0000044) as a special case of parallelism, which like convergence is a case of homoplasy. Other examples illustrate such a co‐option of common, reusable toolkits, such as the developmental program of arthropod and vertebrate appendages, implicating the transcription factor Dll/Dlx (Nielsen and Martinez, 2003). Interestingly, this gene is also expressed in the distal tips of “outgrowths” in diverse animals, for example, the horns of beetles (Moczek, 2006). Interestingly, it seems that some orthologous loci behave as mutational “hotspots” for parallel quantitative variation of phenotypes, over phylogenetic distances spanning populations to phyla (e.g., insects and nematodes, or mollusks and vertebrates) (Martin and Orgogozo, 2013).
Conversely, convergent or parallel phenotypes can evolve with non‐homologous gene expression. For example, the convergent evolution of coat color in different subspecies of beach mice probably evolved through changes in expression of different genes (Manceau et al., 2010). More striking, homologous structures are not always patterned by conserved expression patterns. For example, the vulva of different species of nematodes are patterned by non‐homologous pathways (Schlager et al., 2006) and the proteins expressed in the crystalline lens of different vertebrate species are unrelated (Piatigorsky and Wistow, 1991).
Overall, the correspondence between homology at the levels of gene expression and of anatomy is complex (Hall, 1994), yet with proper experimental design can be very informative. Genes can be co‐opted to conduct the same function in non‐homologous structures, while vastly divergent expression can underlie homologous structures. Determining which anatomical comparisons are likely to be informative in the comparison of transcriptomes between species is not easy, especially between very distant species, and requires good anatomical and developmental knowledge. A challenge of future Evo‐Devo transcriptomics is to integrate such knowledge with genome‐scale studies to provide relevant insight into the relation between the evolution of organ function and homology.
WHAT TO COMPARE? THE PROBLEM OF HETEROCHRONY
For developmental processes, as for anatomy, we need to define what is comparable if we are to perform meaningful transcriptome comparisons. But unlike for anatomical structures, there are no clear homology relations between developmental stages. This is mostly because of heterochrony—differences in the relative timing of developing organs between species. For example, the human and rat developmental stages at which heart reaches the same developmental milestone might differ, and other structures in these two species can reach their developmental milestones in a different order [detailed in Jeffery et al. (2002)]. Thus, even a pair of very similar developmental stages between two species will usually include some organs or tissues whose development is more advanced in one species than the other.
Accordingly, heterochrony patterns are recovered in transcriptome data. For example, clear examples of timing differences in the expression of several gene pathways were found in a whole body comparison of closely related Xenopus species (Yanai et al., 2011). In a more anatomically targeted study focusing on the development of human, chimpanzee and rhesus macaque prefrontal cortex, as many as 71% of genes were found to change in timing of expression (Somel et al., 2009). In both of these examples, closely related species were compared. For more distant species, very broad developmental periods can be compared, to diminish the impact of heterochrony and ensure some level of comparability. For example, in Comte et al. (2010), we calculated the conservation of expression of orthologous genes between zebrafish and mouse for seven broad developmental stages, which lacked fine resolution, but was sufficient to detect significant differences between early development (up to neurula) and late development. These broad bilaterian “meta‐stages” are available in our database of gene expression evolution Bgee (Bastian et al., 2008). Alternatively, transcriptome data themselves can be used to decide which developmental time points should be compared. The expression levels of genes possess information on the temporal order of samples from different developmental stages (Anavy et al., 2014; Snoek et al., 2014). It is thus possible to perform all‐against‐all comparisons of transcriptome time series from pairs of species, to align their developmental stages and define the “most comparable” ones. This approach has allowed to identify interesting patterns of expression conservation or divergence over ontogeny (Parikh et al., 2010; Irie and Kuratani, 2011), notably a minimum temporal divergence between Drosophila species at mid development (Kalinka et al., 2010; Gerstein et al., 2014; Li et al., 2014).
CAN WE COMPARE LEVELS OF EXPRESSION BETWEEN SPECIES ON A LARGE‐SCALE?
In classical small‐scale Evo‐Devo studies, gene expression patterns have been typically compared in terms of presence or absence (Carroll, 2005). Yet expression levels can be key in understanding changes in ontogeny. This is for example illustrated by patterning genes whose expression levels influence developmental phenotypes—for example, Ubx for the formation of Drosophila halteres (de Navas et al., 2011)—but also by gradient genes whose expression levels control growth and differentiation during development—for example, Dpp for Drosophila eye morphogenesis (Wartlick et al., 2014), retinoic acid and fibroblast growth factor signaling for vertebrate neurogenesis (Pera et al., 2014), or KNOX genes for tomato ripening (Nadakuduti et al., 2014).
Such changes in expression levels have been shown to be involved in the emergence of developmental innovations during evolution. For example, differences in metamorphosis among vertebrates, including heterochronic shifts, seem to be driven in large part by changes in the expression level of thyroid hormone receptors (Laudet, 2011). Other examples include floral pigmentation differences between plant species, determined in part by changes in expression level of genes from the anthocyanin pathway (Sobel and Streisfeld, 2013), wing pigmentation differences across Drosophila species, controlled by the interplay between the expression levels of the yellow, ebony, and Dll genes (Wittkopp et al., 2002; Arnoult et al., 2013), and the fin‐to‐limb transition in evolution, involving changes in the complex interactions between Hox, Shh, Fgf8, and Wnt3a signals (Yano and Tamura, 2013).
Thus it is clear that the level of gene expression, and its evolution, plays a role in morphogenesis and its evolution. To understand the extent of this role, large‐scale quantitative comparisons are necessary. A comparison of expression levels in mammalian organs identified numerous modules of genes with conserved tissue‐specific expression, but whose absolute expression levels shifted in particular taxonomic groups, for example, primates (Brawand et al., 2011; Necsulea and Kaessmann, 2014). And the comparison of developmental transcriptomes of Xenopus species revealed that the dominant mode of divergence between species was through changes in levels of expression, which occurred at a much higher frequency than heterochronic shifts in expression (Yanai et al., 2011). It is thus relevant to consider expression levels as a primary evolving phenotype.
Microarrays allowed the first large‐scale studies of the evolution of gene expression between species, but their analysis in an evolutionary context poses specific problems. Most microarray datasets whose comparison would be interesting were generated by different studies carried out separately in different species. Absolute expression levels are thus difficult to compare directly in such cases because they suffer from important batch effects. There can also be large biases in signal between microarray platforms and technologies used in different species, for example because the hybridization strength depends on probes and on gene sequence composition. Thus, to detect conserved or divergent levels of expression between species, post‐processing approaches are often used. It is for example possible to compare lists of co‐expressed genes, or modules, derived individually in each species (Bergmann et al., 2003; Stuart et al., 2003; Lu et al., 2009; Piasecka et al., 2012a). It is also possible to compare lists of differentially expressed genes between different conditions in each species, or to compare lists of functional categories of genes enriched for differentially expressed genes in each species (Lu et al., 2009).
Correlative methods have also been used to compare the hybridization signals across species. Gene expression is considered conserved if the orthologs are strongly correlated across different conditions or tissues between species. The most commonly used methods are Pearson or Spearman correlations and Euclidian distance (Su et al., 2004; Liao and Zhang, 2006; Xing et al., 2007; Parikh et al., 2010). However, the results of these approaches have been shown to differ greatly when different measures or data normalization schemes are used, and they behave differently for tissue‐specific genes or broadly expressed (housekeeping) genes (Piasecka et al., 2012b), which complicates their interpretation. Additionally, correlative methods rarely take into account the variability of expression levels within species.
Direct comparing of hybridization signal is more legitimate when technical problems of microarrays in comparative studies are minimized or circumvented, for example, when the hybridization of samples from different species is performed within the same study, on the same microarray platform (Ranz et al., 2003; Rifkin et al., 2003; Nuzhdin et al., 2004; Ometto et al., 2011). Unfortunately, such an approach only works for the comparison of very closely related species where the hybridization of cDNAs on probes of the microarray is not perturbed by the low nucleotide divergence in the sequence of orthologous genes (Lu et al., 2009). Otherwise, probes that do not match exactly all orthologous target genes have to be ignored (Khaitovich et al., 2005; Kalinka et al., 2010). Customized multi‐species microarrays have also been designed to eliminate the sequence mismatch effects. Samples from two species are competitively hybridized to the probes and intensity ratios are estimated by averaging signal on probes from the different species (Gilad et al., 2005; Vallee et al., 2006; Oshlack et al., 2007). However these approaches are dependent on having a high quality genome annotation for each species in order to locate sequence differences and to account for their potential effect.
RNA‐seq is now allowing major progresses in describing gene expression variation between species. This technique has a larger dynamic range than microarrays, and can also be used to study differences in exon usage and alternative splicing (Gallego Romero et al., 2012). Importantly, it allows the study of non‐model species in the absence of a sequenced genome (Grabherr et al., 2011; Perry et al., 2012), or when the genome sequence is of poor quality. The advantages of RNA‐seq allow more straightforward direct comparisons of expression levels between species, and interesting insights have been provided by the first evolutionary studies using this technology (Blekhman et al., 2010; Brawand et al., 2011; Perry et al., 2012; Khan et al., 2013).
When expression levels are directly compared across species, heuristic approaches have mostly been used to identify patterns of conservation or of directional selection (Fraser, 2011). Expression divergence between species is compared to the divergence within biological replicates of the same species. This procedure is similar to the McDonald and Kreitman test (MK test) of positive selection, widely used to characterize selection on protein‐coding sequences (McDonald and Kreitman, 1991). Genes under stabilizing selection display limited expression variation both within and between species, whereas genes under directional selection in a given lineage display a shift in mean expression between species, while maintaining a low variation within species. Of note, alternative explanations for the latter pattern are possible, such as a relaxation of selective constraints, although this scenario should be accompanied with an increase of within species variation. Progress has recently been made towards the development of formal models to test for positive selection on expression levels (Rohlfs et al., 2014). Still, these approaches cannot control for differences in environmental input to gene regulation between species (Gallego Romero et al., 2012).
Even though many technological challenges have been overcome in the past years, the comparison of expression levels between species holds numerous remaining difficulties (Dunn et al., 2013), particularly when the species compared are distant. First, expression levels measured in a given tissue are a mix of expression levels of the cell types constituting this tissue. Tissues of different species often display differences in cellular composition, notably in the proportion and localization of different cell types. This factor can result in artificial differences in expression levels and bias the comparison between species (Pantalacci and Semon, 2014). For example, brain, blood or pancreatic samples were shown to typically vary substantially in their cellular composition between even closely related species (Hill and Walsh, 2005; Magalhaes et al., 2010; Steiner et al., 2010).
Second, a basic source of measurement noise both in microarray and RNA‐seq transcriptomic studies comes from alternative splicing (Barbosa‐Morais et al., 2012; Merkin et al., 2012; McManus et al., 2014). Alternative‐splicing patterns have been shown to evolve fast, and in consequence to be mostly species‐specific. A difference of the isoforms expressed between species can affect the measure of expression level because of differences in length (e.g., longer transcripts produce more reads in RNA‐seq) and in exon structure (e.g., microarray probes may be designed to bind only some exons). Although RNA‐seq facilitates the reconstruction of the pool of all isoforms for each gene, which then could ideally be examined in isolation, this task still proves to be very challenging (Hayer et al., 2014).
Third, the heterogeneity of annotation may bias expression comparison between species. Gene models for orthologs may contain fragments that are not orthologous. A solution to this problem is to carefully align the nucleotide sequences of orthologs sets to compare, and restrict further analysis only to fragments that are strictly orthologous (Blekhman et al., 2010; Brawand et al., 2011). The difference in the genomic context of orthologous genes between species can also be an issue. Different genomes contain different numbers of paralogous genes and pseudogenes, which may both be sources of spurious reads or of unspecific hybridization background. This challenges the comparison of genomic regions whose sequences are unique in the genome of some species but not others, leading to differential mappability of RNA‐seq reads.
Although several recent papers illustrate the power of comparative RNA‐seq and other functional genomics approaches between close species, and even try to extend them to the comparison of distant species (Boyle et al., 2014; Chen et al., 2014; Gerstein et al., 2014; Li et al., 2014), these studies remain limited in the anatomical and developmental complexity which is covered. In consequence their applications to fundamental Evo‐Devo questions remains limited. A notable study focused on the evolution of gene expression in distant Dictyostelium species, which was facilitated by the conserved morphology of these species (Parikh et al., 2010). But to link functional genomics to more fine questions of Evo‐Devo, more detailed transcriptome profiles are needed, which will in turn raise the issues of homology and heterochrony which have so far been circumvented.
DATA SETS FOR COMPARATIVE TRANSCRIPTOMICS ARE INCREASINGLY AVAILABLE AND INTEGRATED
Transcriptomics data, whether from microarrays or RNA‐seq experiments, is increasing exponentially in public databases (Rustici et al., 2013). Although it is mostly generated to answer biomedical and other practical questions, it covers an increasing number of organs, tissues, developmental stages, and species, of interest for Evo‐Devo studies. The challenge is to identify, recover and organize the relevant data among the tens of thousands of tumor and cell line samples. While there are several ongoing efforts to organize transcriptome data [notably the Gene Expression Atlas (Kapushesky et al., 2010) or the Gene Expression Barcode (McCall et al., 2014); see also Rung and Brazma (2013)], we will focus here on our database Bgee (http://bgee.org) (Bastian et al., 2008), which is the only effort to our knowledge which focuses on Evo‐Devo concerns: inter‐species comparisons using anatomical homology, detailed developmental stage annotation, and present/absent calls for different expression data types. We provide an overview of some major transcriptomic datasets of interest for Evo‐Devo, whether they are already integrated into Bgee or will be in the near future.
Bgee was the first database to integrate together RNA‐seq, microarray and in situ hybridization data. Bgee release 13 (December 2014) integrates more than 500 Illumina RNA‐seq libraries from 15 animal species: human (Homo sapiens), chimpanzee (Pan troglodytes), bonobo (Pan paniscus), gorilla (Gorilla gorilla), macaque (Macaca mulatta), mouse (Mus musculus), rat (Rattus norvegicus), pig (Sus scrofa), cow (Bos taurus), opossum (Musca domestica), platypus (Ornithorhynchus anatinus), chicken (Gallus gallus), lizard (Anolis carolinensis), frog (Xenopus tropicalis), and worm (Caenorhabditis elegans). A high proportion of these RNA‐seq samples comes from a few large‐scale comparative studies, such as Brawand et al. (2011), which contains samples from brain, cerebellum, heart, kidney, liver, and testis from nine species of placental mammals (great apes, rhesus macaque, and mouse), marsupials (gray short‐tailed opossum) and monotremes (platypus); Barbosa‐Morais et al. (2012), which complemented the above dataset with samples from 5 tissues of lizard and frog; and Merkin et al. (2012), which generated RNA‐seq samples from nine tissues from four mammals (rat, mouse, cow, and macaque), and one bird (chicken). The value of such comparative RNA‐seq datasets is reflected by their rapid reuse in independent comparative studies (Barbosa‐Morais et al., 2012; Gokcumen et al., 2013; Reyes et al., 2013; Hong et al., 2014; Washietl et al., 2014; Yang et al., 2014). For model organisms, these RNA‐seq data are completed by microarray data; the high number of collected experiments, and available methods for processing and analysis, still make microarrays a valuable resource for meta‐analyses. Bgee release 13 integrates more than 13,000 Affymetrix arrays, mapped on more than 400 anatomical structures in five model species: mouse, human, zebrafish (Danio rerio), fruit fly (Drosophila melanogaster) and worm.
All microarray and RNA‐seq datasets come from public repositories such Gene Expression Omnibus (GEO) (Barrett et al., 2013) or ArrayExpress (Rustici et al., 2013) and are carefully annotated by Bgee curators to appropriate anatomical structures and developmental stages classes of the ontologies (Haendel et al., 2014). All selected datasets and samples were collected in normal (non‐treated) conditions, and from wild‐type animals. This latter condition is critical for evolutionary studies, where expression should be as much as possible relevant to the evolutionary history and wild‐type selective pressures experienced by the species compared.
It is clear that the integration of datasets for non‐model species obtained with RNA‐seq is much easier than equivalent microarray datasets prepared using different technologies and often even with in‐house designed platforms. Such data, from diverse species, is increasingly available, and much of it is not yet integrated into Bgee. For example, a newly emerging source of transcriptomic data from many primate species is the non‐human primate reference transcriptome resource (NHPRTR). Their full RNA‐seq dataset consists of 157 libraries across 14 species or subspecies: Chimpanzee (P. troglodytes), Gorilla (G. gorilla), Indochinese Cynomolgus macaque (Macaca fascicularis), Mauritian Cynomolgus Macaque (M. fascicularis), Japanese macaque (Macaca fuscata), Chinese Rhesus macaque (M. mulatta), Indian Rhesus macaque (M. mulatta), Pig‐tailed macaque (Macaca leonina), Olive baboon (Papio anubis), Common marmoset (Callithrix jacchus), Ring‐tailed lemur (Lemur Catta), Mouse lemur (Microcebus sp.), Sooty mangabey (Cercocebus atys), and Squirrel monkey (Saimiri sp.), across up to 14 different tissues (Pipes et al., 2013).
Additionally, the ENCODE and modENCODE consortia have recently released large amounts of RNA‐seq data from mouse, human, worm and fly, sampling numerous developmental stages, tissues and cell types, many of them highly relevant to Evo‐Devo studies. Notably, more than 140 RNA‐seq worm samples and 250 fly samples cover their development at great resolution (35 developmental stages for worm and 30 developmental stages for fly) (Graveley et al., 2011; Gerstein et al., 2014; Li et al., 2014). Although the majority of the human transcriptomic data from the ENCODE project comes from cancer or immortalized cell lines, which limits its use for evolutionary analysis, more than 200 human RNA‐seq libraries were recently used along with worm and fly data for the identification of evolutionary conserved transcription modules between these distantly related species (Gerstein et al., 2014).
On a final note, while RNA‐seq and microarrays provide genome‐wide information, they often lack the fine resolution of in situ hybridizations, which are mostly small‐scale. Yet, there exist a few large‐scale projects aimed at generating in situ hybridizations for every gene of a species at different developmental time points, for example for mouse (Diez‐Roux et al., 2011) or for zebrafish (Thisse et al., 2004). Apart from these systematic efforts, a collection of many small‐scale studies can sum up to a large valuable resource, as reflected by the success of the Gene Ontology functional annotations (du Plessis et al., 2011). The abundance of in situ hybridizations integrated from published studies now provides an almost genomic overview of expression patterns in some species (Kassahn et al., 2009). Bgee integrates in situ hybridization data from four databases: BDGP for fruit fly (Hammonds et al., 2013), MGI for mouse (Smith et al., 2014), Xenbase for frog (Bowes et al., 2010), and ZFIN for zebrafish (Bradford et al., 2011). Thus it includes information from tens of thousands of independent in situ hybridization experiments. Unlike for microarrays and RNA‐seq data, the primary extraction and annotation of information from papers is done by the curators of each of these respective databases, while Bgee provides mapping to the bilaterian anatomy ontology Uberon (Haendel et al., 2014), and to ontologies of developmental stages. These data cover anatomical complexity in an extremely detailed manner.
Thus overall there exist abundant transcriptome and gene expression data, which are relevant to the questions posed by Evo‐Devo, and modern database efforts make these data increasingly available for genome‐wide investigation into evolutionary developmental biology.
FUTURE DIRECTIONS
A challenge as well as an opportunity of comparative transcriptomics for Evo‐Devo is relating large scale quantitative trends with morphological observations. The main limitation of comparative approaches so far is that, while it is relatively easy to establish significant conservation of expression, it is very difficult at a large scale to characterize significant changes of evolutionary relevance. Expression can differ because of many experimental reasons, from environmental conditions to measurement errors, as well as because of a lack of stabilizing selection. Yet it can also differ because of developmental innovations, convergent evolution, or other evolutionary scenarios of interest to understanding morphological variety. In the absence of a good baseline, similar to the synonymous substitution rate of protein‐coding gene codon models (Yang, 2006), calling such relevant changes in transcriptomic studies remains a major challenge. In the end, the evolutionary scenarios that are the most difficult to unveil are often the most fascinating, which motivates further methodological work into evolutionary transcriptomics.
Another important challenge is establishing causality: mechanistic causality, as in determining which gene expression changes determine changes in morphology; and evolutionary causality, as in determining which expression patterns are fixed for their role in morphological innovations or constraints. For this, we will probably need to obtain more transcriptomic data of variation both within and between species, and combine it with other functional genomics information, such as transcription factor binding patterns (ChIP‐seq) and large‐scale mutant screens.
Despite the challenges and limitations, we feel that we are at a privileged moment in the long pursuit of the goal of linking genome evolution to the evolution of form, with its constraints and its adaptive roles. First, the progress of RNA‐seq is unlocking the function of the genomes of many species beyond the classical model organisms. Second, new transcriptomics techniques are arriving which promise to combine genomic scale with anatomical precision. The development of single‐cell RNA‐seq has opened a wide array of possibilities for the study of the transcriptomics signatures of precise sets of cells extracted from developing organs (Tang et al., 2011). For example it has been recently used to study the evolutionary history of the endoderm germ layer (Hashimshony et al., 2014). There has been recent progress towards retrieving full transcriptomes of individual cells in situ, which proves particularly useful to study rare cell populations without compromising the quality of their transcriptomes (Battich et al., 2013; Avital et al., 2014; Lee et al., 2014; Lovatt et al., 2014; Crosetto et al., 2015). Finally, the improvement of ontologies and bioinformatics methods allow us to increasingly take advantage of these genomic datasets to answer long standing questions of Evo‐Devo.
ACKNOWLEDGEMENTS
J.R. is supported by Marie Curie fellowship PIOF‐GA‐2010‐273290. Work on Bgee and comparative transcriptomics in the M.R.‐R. lab is supported by the Swiss Institute of Bioinformatics, by the Swiss National Science Foundation [grants number 31003A 133011/1 and 31003A_153341/1], by SystemsX.ch project AgingX, and by Etat de Vaud.
Roux J, Rosikiewicz M, Robinson‐Rechavi M. 2015. What to compare and how: Comparative transcriptomics for Evo‐Devo. J. Exp. Zool. (Mol. Dev. Evol.) 324B:372–382.
LITERATURE CITED
- Anavy L, Levin M, Khair S, et al. 2014. BLIND ordering of large‐scale transcriptomic developmental timecourses. Development 141:1161–1166. [DOI] [PubMed] [Google Scholar]
- Arnoult L, Su KFY, Manoel D, et al. 2013. Emergence and diversification of fly pigmentation through evolution of a gene regulatory module. Science 339:1423–1426. [DOI] [PubMed] [Google Scholar]
- Avital G, Hashimshony T, Yanai I. 2014. Seeing is believing: new methods for in situ single‐cell transcriptomics. Genome Biol 15:110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barbosa‐Morais NL, Irimia M, Pan Q, et al. 2012. The evolutionary landscape of alternative splicing in vertebrate species. Science 338:1587–1593. [DOI] [PubMed] [Google Scholar]
- Barrett T, Wilhite SE, Ledoux P, et al. 2013. NCBI GEO: archive for functional genomics data sets‐update. Nucleic Acids Res 41:D991–D995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bastian F, Parmentier G, Roux J, et al. 2008. Bgee: integrating and comparing heterogeneous transcriptome data among species. Data integration in the life sciences. Lect Notes Comput Sci 5109:124–131. [Google Scholar]
- Battich N, Stoeger T, Pelkmans L. 2013. Image‐based transcriptomics in thousands of single human cells at single‐molecule resolution. Nat Methods 10:1127–1133. [DOI] [PubMed] [Google Scholar]
- Beldade P, Brakefield PM. 2002. The genetics and evo‐devo of butterfly wing patterns. Nat Rev Genet 3:442–452. [DOI] [PubMed] [Google Scholar]
- Bergmann S, Ihmels J, Barkai N. 2003. Similarities and differences in genome‐wide expression data of six organisms. PLoS Biol 2:e9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blekhman R, Marioni JC, Zumbo P, Stephens M, Gilad Y. 2010. Sex‐specific and lineage‐specific alternative splicing in primates. Genome Res 20:180–189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolker JA. 2014. Model species in evo‐devo: a philosophical perspective. Evol Dev 16:49–56. [DOI] [PubMed] [Google Scholar]
- Bowes JB, Snyder KA, Segerdell E, et al. 2010. Xenbase: gene expression and improved integration. Nucleic Acids Res 38:D607–D612. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boyle AP, Araya CL, Brdlik C, et al. 2014. Comparative analysis of regulatory information and circuits across distant species. Nature 512:453–456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradford Y, Conlin T, Dunn N, et al. 2011. ZFIN: enhancements and updates to the zebrafish model organism database. Nucleic Acids Res 39:D822–D829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brawand D, Soumillon M, Necsulea A, et al. 2011. The evolution of gene expression levels in mammalian organs. Nature 478:343–348. [DOI] [PubMed] [Google Scholar]
- Brunetti CR, Selegue JE, Monteiro A, et al. 2001. The generation and diversification of butterfly eyespot color patterns. Curr Biol 11:1578–1585. [DOI] [PubMed] [Google Scholar]
- Carroll SB. 2005. Endless forms most beautiful: the new science of evo devo and the making of the animal kingdom. New York: WW Norton & Company. [Google Scholar]
- Chen Z‐X, Sturgill D, Qu J, et al. 2014. Comparative validation of the D. melanogaster modENCODE transcriptome annotation. Genome Res 24:1209–1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colosimo PF, Hosemann KE, Balabhadra S, et al. 2005. Widespread parallel evolution in sticklebacks by repeated fixation of ectodysplasin alleles. Science 307:1928–1933. [DOI] [PubMed] [Google Scholar]
- Comte A, Roux J, Robinson‐Rechavi M. 2010. Molecular signaling in zebrafish development and the vertebrate phylotypic period. Evol Dev 12:144–156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crosetto N, Bienko M, van Oudenaarden A. 2015. Spatially resolved transcriptomics and beyond. Nat Rev Genet 16:57–66. [DOI] [PubMed] [Google Scholar]
- de Navas LF, Reed H, Akam M, et al. 2011. Integration of RNA processing and expression level control modulates the function of the Drosophila Hox gene Ultrabithorax during adult development. Development 138:107–116. [DOI] [PubMed] [Google Scholar]
- De Robertis EM. 2008. The molecular ancestry of segmentation mechanisms. Proc Natl Acad Sci USA 105:16411–16412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diez‐Roux G, Banfi S, Sultan M, et al. 2011. A high‐resolution anatomical atlas of the transcriptome in the mouse embryo. PLoS Biol 9:e1000582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- du Plessis L, Škunca N, Dessimoz C. 2011. The what, where, how and why of gene ontology—a primer for bioinformaticians. Briefings Bioinformatics 12:723–735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunn CW, Luo X, Wu Z. 2013. Phylogenetic analysis of gene expression. Integr Comp Biol 53:847–856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elmer KR, Meyer A. 2011. Adaptation in the age of ecological genomics: insights from parallelism and convergence. Trends Ecol Evol 26:298–306. [DOI] [PubMed] [Google Scholar]
- Fraser HB. 2011. Genome‐wide approaches to the study of adaptive gene expression evolution. BioEssays 33:469–477. [DOI] [PubMed] [Google Scholar]
- Gallego Romero I, Ruvinsky I, Gilad Y. 2012. Comparative studies of gene expression and the evolution of gene regulation. Nat Rev Genet 13:505–516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerstein MB, Rozowsky J, Yan KK, et al. 2014. Comparative analysis of the transcriptome across distant species. Nature 512:445–448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilad Y, Rifkin SA, Bertone P, Gerstein M, White KP. 2005. Multi‐species microarrays reveal the effect of sequence divergence on gene expression profiles. Genome Res 15:674–680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gokcumen O, Tischler V, Tica J, et al. 2013. Primate genome architecture influences structural variation mechanisms and functional consequences. Proc Natl Acad Sci USA 110:15764–15769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gompel N, Prud'homme B, Wittkopp PJ, Kassner VA, Carroll SB. 2005. Chance caught on the wing: cis‐regulatory evolution and the origin of pigment patterns in Drosophila. Nature 433:481–487. [DOI] [PubMed] [Google Scholar]
- Grabherr MG, Haas BJ, Yassour M, et al. 2011. Full‐length transcriptome assembly from RNA‐Seq data without a reference genome. Nat Biotechnol 29:644–652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Graveley BR, Brooks AN, Carlson JW, et al. 2011. The developmental transcriptome of Drosophila melanogaster . Nature 471:473–479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haendel M, Balhoff J, Bastian F, et al. 2014. Unification of multi‐species vertebrate anatomy ontologies for comparative biology in Uberon. J Biomed Semantics 5:21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Halder G, Callaerts P, Gehring WJ. 1995. Induction of ectopic eyes by targeted expression of the eyeless gene in Drosophila. Science 267:1788–1792. [DOI] [PubMed] [Google Scholar]
- Hall BK. 1994. Homology: the hierarchical basis of comparative biology. San Diego: Academic Press. [Google Scholar]
- Hammonds A, Bristow C, Fisher W, et al. 2013. Spatial expression of transcription factors in Drosophila embryonic organ development. Genome Biol 14:R140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hashimshony T, Feder M, Levin M, Hall BK, Yanai I. 2014. Spatiotemporal transcriptomics reveals the evolutionary history of the endoderm germ layer. Nature [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hayer K, Pizzaro A, Lahens NL, Hogenesch JB, Grant GR. 2014. Benchmark Analysis of Algorithms for Determining and Quantifying Full‐length mRNA Splice Forms from RNA‐Seq Data. [DOI] [PMC free article] [PubMed]
- Hill RS, Walsh CA. 2005. Molecular insights into human brain evolution. Nature 437:64–67. [DOI] [PubMed] [Google Scholar]
- Hoegg S, Meyer A. 2005. Hox clusters as models for vertebrate genome evolution. Trends Genet 21:421–424. [DOI] [PubMed] [Google Scholar]
- Hofmann CM, O'Quin KE, Smith AR, Carleton KL. 2010. Plasticity of opsin gene expression in cichlids from Lake Malawi. Mol Ecol 19:2064–2074. [DOI] [PubMed] [Google Scholar]
- Holland LZ, Albalat R, Azumi K, et al. 2008. The amphioxus genome illuminates vertebrate origins and cephalochordate biology. Genome Res 18:1100–1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hong SE, Song HK, Kim DH. 2014. Identification of tissue‐enriched novel transcripts and novel exons in mice. BMC Genomics 15:592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Irie N, Kuratani S. 2011. Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis. Nat Commun 2:248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeffery JE, Bininda‐Emonds ORP, Coates MI, Richardson MK. 2002. Analyzing evolutionary patterns in amniote embryonic development. Evol Dev 4:292–302. [DOI] [PubMed] [Google Scholar]
- Jones FC, Grabherr MG, Chan YF, et al. 2012. The genomic basis of adaptive evolution in threespine sticklebacks. Nature 484:55–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalinka AT, Varga KM, Gerrard DT, et al. 2010. Gene expression divergence recapitulates the developmental hourglass model. Nature 468:811–814. [DOI] [PubMed] [Google Scholar]
- Kapushesky M, Emam I, Holloway E, et al. 2010. Gene expression atlas at the European bioinformatics institute. Nucleic Acids Res 38:D690–698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kassahn KS, Dang VT, Wilkins SJ, Perkins AC, Ragan MA. 2009. Evolution of gene function and regulatory control after whole‐genome duplication: comparative analyses in vertebrates. Genome Res 19:1404–1418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khaitovich P, Hellmann I, Enard W, et al. 2005. Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science 309:1850–1854. [DOI] [PubMed] [Google Scholar]
- Khan Z, Ford MJ, Cusanovich DA, et al. 2013. Primate transcript and protein expression levels evolve under compensatory selection pressures. Science 342:1100–1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knecht AK, Hosemann KE, Kingsley DM. 2007. Constraints on utilization of the EDA‐signaling pathway in threespine stickleback evolution. Evol Dev 9:141–154. [DOI] [PubMed] [Google Scholar]
- Koenig D, Jimenez‐Gomez JM, Kimura S, et al. 2013. Comparative transcriptomics reveals patterns of selection in domesticated and wild tomato. Proc Natl Acad USA 110:E2655–E2662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kulahoglu C, Denton AK, Sommer M, et al. 2014. Comparative transcriptome atlases reveal altered gene expression modules between two Cleomaceae C3 and C4 plant species. Plant Cell 26:3243–3260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laudet V. 2011. The origins and evolution of vertebrate metamorphosis. Curr Biol 21:R726–R737. [DOI] [PubMed] [Google Scholar]
- Lee JH, Daugharthy ER, Scheiman J, et al. 2014. Highly multiplexed subcellular RNA sequencing in situ. Science 343:1360–1363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levine M, Davidson EH. 2005. Gene regulatory networks for development. Proc Natl Acad Sci USA 102:4936–4942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li JJ, Huang H, Bickel PJ, Brenner SE. 2014. Comparison of D. melanogaster and C. elegans developmental stages, tissues, and cells by modENCODE RNA‐seq data. Genome Res 24:1086–1101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liao BY, Zhang J. 2006. Evolutionary conservation of expression profiles between human and mouse orthologous genes. Mol Biol Evol 23:530–540. [DOI] [PubMed] [Google Scholar]
- Lovatt D, Ruble BK, Lee J, et al. 2014. Transcriptome in vivo analysis (TIVA) of spatially defined single cells in live tissue. Nat Methods 11:190–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu Y, Huggins P, Bar‐Joseph Z. 2009. Cross species analysis of microarray expression data. Bioinformatics 25:1476–1483. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Magalhaes I, Vudattu NK, Ahmed RK, et al. 2010. High content cellular immune profiling reveals differences between rhesus monkeys and men. Immunology 131:128–140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Manceau M, Domingues VS, Linnen CR, Rosenblum EB, Hoekstra HE. 2010. Convergence in pigmentation at multiple levels: mutations, genes and function. Philos Trans R Soc Lond B Biol Sci 365:2439–2450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin A, Orgogozo V. 2013. The Loci of repeated evolution: a catalog of genetic hotspots of phenotypic variation. Evol Int J Org Evol 67:1235–1250. [DOI] [PubMed] [Google Scholar]
- McCall MN, Jaffee HA, Zelisko SJ, et al. 2014. The Gene Expression Barcode 3.0: improved data processing and mining tools. Nucleic Acids Res 42:D938–D943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McDonald JH, Kreitman M. 1991. Adaptive protein evolution at the Adh locus in Drosophila. Nature 351:652–654. [DOI] [PubMed] [Google Scholar]
- McManus CJ, Coolon JD, Eipper‐Mains J, Wittkopp PJ, Graveley BR. 2014. Evolution of splicing regulatory networks in Drosophila. Genome Res 24:786–796. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merkin J, Russell C, Chen P, Burge CB. 2012. Evolutionary dynamics of gene and isoform regulation in Mammalian tissues. Science 338:1593–1599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moczek AP. 2006. Integrating micro‐ and macroevolution of development through the study of horned beetles. Heredity 97:168–178. [DOI] [PubMed] [Google Scholar]
- Monteiro A, Podlaha O. 2009. Wings, horns, and butterfly eyespots: how do complex traits evolve? PLoS Biol 7:e1000037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mungall C, Torniai C, Gkoutos G, Lewis S, Haendel M. 2012. Uberon, an integrative multi‐species anatomy ontology. Genome Biol 13:R5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nadakuduti SS, Holdsworth WL, Klein CL, Barry CS. 2014. KNOX genes influence a gradient of fruit chloroplast development through regulation of GOLDEN2‐LIKE expression in tomato. Plant J 78:1022–1033. [DOI] [PubMed] [Google Scholar]
- Necsulea A, Kaessmann H. 2014. Evolutionary dynamics of coding and non‐coding transcriptomes. Nat Rev Genet 15:734–748. [DOI] [PubMed] [Google Scholar]
- Nielsen C, Martinez P. 2003. Patterns of gene expression: homology or homocracy? Dev Genes Evol 213:149–154. [DOI] [PubMed] [Google Scholar]
- Niknejad A, Comte A, Parmentier G, et al. 2012. vHOG, a multi‐species vertebrate ontology of homologous organs groups. Bioinformatics 28:1017–1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Noy NF, Shah NH, Whetzel PL, et al. 2009. BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic Acids Res 37:W170–W173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nuzhdin SV, Wayne ML, Harmon KL, McIntyre LM. 2004. Common pattern of evolution of gene expression level and protein sequence in Drosophila. Mol Biol Evol 21:1308–1317. [DOI] [PubMed] [Google Scholar]
- O'Quin KE, Hofmann CM, Hofmann HA, Carleton KL. 2010. Parallel evolution of opsin gene expression in African cichlid fishes. Mol Biol Evol 27:2839–2854. [DOI] [PubMed] [Google Scholar]
- Ometto L, Shoemaker D, Ross KG, Keller L. 2011. Evolution of gene expression in fire ants: the effects of developmental stage, caste, and species. Mol Biol Evol 28:1381–1392. [DOI] [PubMed] [Google Scholar]
- Oshlack A, Chabot AE, Smyth GK, Gilad Y. 2007. Using DNA microarrays to study gene expression in closely related species. Bioinformatics 23:1235–1242. [DOI] [PubMed] [Google Scholar]
- Pankey MS, Minin VN, Imholte GC, Suchard MA, Oakley TH. 2014. Predictable transcriptome evolution in the convergent and complex bioluminescent organs of squid. Proc Natl Acad Sci USA 111:E4736–E4742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pantalacci S, Semon M. 2014. Transcriptomics of developing embryos and organs: a raising tool for evo‐devo. J Exp Zool B Mol Dev Evol. Available online at: http://onlinelibrary.wiley.com/doi/10.1002/jez.b.22595/abstract [DOI] [PubMed] [Google Scholar]
- Parikh A, Miranda ER, Katoh‐Kurasawa M, et al. 2010. Conserved developmental transcriptomes in evolutionarily divergent species. Genome Biol 11:R35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parmentier G, Bastian FB, Robinson‐Rechavi M. 2010. Homolonto: generating homology relationships by pairwise alignment of ontologies and application to vertebrate anatomy. Bioinformatics 26:1766–1771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pera EM, Acosta H, Gouignard N, Climent M, Arregi I. 2014. Active signals, gradient formation and regional specificity in neural induction. Exp Cell Res 321:25–31. [DOI] [PubMed] [Google Scholar]
- Perry GH, Melsted P, Marioni JC, et al. 2012. Comparative RNA sequencing reveals substantial genetic variation in endangered primates. Genome Res 22:602–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Piasecka B, Kutalik Z, Roux J, Bergmann S, Robinson‐Rechavi M. 2012a. Comparative modular analysis of gene expression in vertebrate organs. BMC Genomics 13:124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Piasecka B, Robinson‐Rechavi M, Bergmann S. 2012b. Correcting for the bias due to expression specificity improves the estimation of constrained evolution of expression between mouse and human. Bioinformatics 28:1865–1872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Piatigorsky J, Wistow G. 1991. The recruitment of crystallins: new functions precede gene duplication. Science 252:1078–1079. [DOI] [PubMed] [Google Scholar]
- Pipes L, Li S, Bozinoski M, et al. 2013. The non‐human primate reference transcriptome resource (NHPRTR) for comparative functional genomics. Nucleic Acids Res 41:D906–D914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prud'homme B, Gompel N, Rokas A, et al. 2006. Repeated morphological evolution through cis‐regulatory changes in a pleiotropic gene. Nature 440:1050–1053. [DOI] [PubMed] [Google Scholar]
- Raff RA. 2000. Evo‐devo: the evolution of a new discipline. Nat Rev Genet 1:74–79. [DOI] [PubMed] [Google Scholar]
- Ranz JM, Castillo‐Davis CI, Meiklejohn CD, Hartl DL. 2003. Sex‐dependent gene expression and evolution of the Drosophila transcriptome. Science 300:1742–1745. [DOI] [PubMed] [Google Scholar]
- Reyes A, Anders S, Weatheritt RJ, et al. 2013. Drift and conservation of differential exon usage across tissues in primate species. Proc Natl Acad Sci USA 110:15377–15382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rifkin SA, Kim J, White KP. 2003. Evolution of gene expression in the Drosophila melanogaster subgroup. Nat Genet 33:138–144. [DOI] [PubMed] [Google Scholar]
- Robinson PN, Webber C. 2014. Phenotype ontologies and cross‐species analysis for translational research. PLoS Genet 10:e1004268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rohlfs RV, Harrigan P, Nielsen R. 2014. Modeling gene expression evolution with an extended Ornstein‐Uhlenbeck process accounting for within‐species variation. Mol Biol Evol 31:201–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roux J, Robinson‐Rechavi M. 2010. An ontology to clarify homology‐related concepts. Trends Genet 26:99–102. [DOI] [PubMed] [Google Scholar]
- Rung J, Brazma A. 2013. Reuse of public genome‐wide gene expression data. Nat Rev Genet 14:89–99. [DOI] [PubMed] [Google Scholar]
- Rustici G, Kolesnikov N, Brandizi M, et al. 2013. ArrayExpress update‐trends in database growth and links to data analysis tools. Nucleic Acids Res 41:D987–D990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schlager B, Röseler W, Zheng M, Gutierrez A, Sommer RJ. 2006. HAIRY‐like transcription factors and the evolution of the nematode vulva equivalence group. Curr Biol 16:1386–1394. [DOI] [PubMed] [Google Scholar]
- Shubin N, Tabin C, Carroll S. 2009. Deep homology and the origins of evolutionary novelty. Nature 457:818–823. [DOI] [PubMed] [Google Scholar]
- Smith CM, Finger JH, Hayamizu TF, et al. 2014. The mouse Gene Expression Database (GXD): 2014 update. Nucleic Acids Res 42:D818–D824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Snoek LB, Sterken MG, Volkers RJ, et al. 2014. A rapid and massive gene expression shift marking adolescent transition in C. elegans . Sci Rep 4:3912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sobel JM, Streisfeld MA. 2013. Flower color as a model system for studies of plant evo‐devo. Front Plant Sci 4:321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Somel M, Franz H, Yan Z, et al. 2009. Transcriptional neoteny in the human brain. Proc Natl Acad Sci USA 106:5743–5748. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonnhammer ELL, Gabaldón T, Sousa da Silva AW, et al. 2014. Big data and other challenges in the quest for orthologs. Bioinformatics 30:2993–2998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steiner DJ, Kim A, Miller K, Hara M. 2010. Pancreatic islet plasticity: interspecies comparison of islet architecture and composition. Islets 2:135–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stölting KN, Wilson AB. 2007. Male pregnancy in seahorses and pipefish: beyond the mammalian model. BioEssays 29:884–896. [DOI] [PubMed] [Google Scholar]
- Stuart JM, Segal E, Koller D, Kim SK. 2003. A gene‐coexpression network for global discovery of conserved genetic modules. Science 302:249–255. [DOI] [PubMed] [Google Scholar]
- Su AI, Wiltshire T, Batalov S, et al. 2004. A gene atlas of the mouse and human protein‐encoding transcriptomes. Proc Natl Acad Sci USA 101:6062–6067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang F, Lao K, Surani MA. 2011. Development and applications of single cell transcriptome analysis. Nat Methods 8:S6–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thisse B, Heyer V, Lux A, et al. 2004. Spatial and temporal expression of the zebrafish genome by large‐scale in situ hybridization screening. Methods Cell Biol 77:505–519. [DOI] [PubMed] [Google Scholar]
- Tschopp P, Sherratt E, Sanger TJ, et al. 2014. A relative shift in cloacal location repositions external genitalia in amniote evolution. Nature 516:391–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vallee M, Robert C, Methot S, Palin MF, Sirard MA. 2006. Cross‐species hybridizations on a multi‐species cDNA microarray to identify evolutionarily conserved genes expressed in oocytes. BMC Genomics 7:113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagner GP. 1989. The biological homology concept. Annu Rev Ecol Syst 20:51–569. [Google Scholar]
- Wartlick O, Jülicher F, Gonzalez‐Gaitan M. 2014. Growth control by a moving morphogen gradient during Drosophila eye development. Development 141:1884–1893. [DOI] [PubMed] [Google Scholar]
- Washietl S, Kellis M, Garber M. 2014. Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals. Genome Res 24:616–628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wittkopp PJ, True JR, Carroll SB. 2002. Reciprocal functions of the Drosophila Yellow and Ebony proteins in the development and evolution of pigment patterns. Development 129:1849–1858. [DOI] [PubMed] [Google Scholar]
- Xing Y, Ouyang Z, Kapur K, Scott MP, Wong WH. 2007. Assessing the conservation of mammalian gene expression using high‐density exon arrays. Mol Biol Evol 24:1283–1285. [DOI] [PubMed] [Google Scholar]
- Yanai I, Peshkin L, Jorgensen P, Kirschner M. 2011. Mapping gene expression in two xenopus species: evolutionary constraints and developmental flexibility. Dev Cell 20:483–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Z. 2006. Computational molecular evolution. USA: Oxford University Press. [Google Scholar]
- Yang H, Li DW, Cheng C. 2014. Relating gene expression evolution with CpG content changes. BMC Genomics 15:693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yano T, Tamura K. 2013. The making of differences between fins and limbs. J Anat 222:100–113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zheng W, Wang Z, Collins JE, et al. 2011. Comparative transcriptome analyses indicate molecular homology of zebrafish swimbladder and mammalian lung. PLoS ONE 6:e24019. [DOI] [PMC free article] [PubMed] [Google Scholar]