Abstract
Why gene order is conserved over long evolutionary timespans remains elusive. A common interpretation is that gene order conservation might reflect the existence of functional constraints that are important for organismal performance. Alteration of the integrity of genomic regions, and therefore of those constraints, would result in detrimental effects. This notion seems especially plausible in those genomes that can easily accommodate gene reshuffling via chromosomal inversions since genomic regions free of constraints are likely to have been disrupted in one or more lineages. Nevertheless, no empirical test has been performed to this notion. Here, we disrupt one of the largest conserved genomic regions of the Drosophila genome by chromosome engineering and examine the phenotypic consequences derived from such disruption. The targeted region exhibits multiple patterns of functional enrichment suggestive of the presence of constraints. The carriers of the disrupted collinear block show no defects in their viability, fertility, and parameters of general homeostasis, although their odorant perception is altered. This change in odorant perception does not correlate with modifications of the level of expression and sex bias of the genes within the genomic region disrupted. Our results indicate that even in highly rearranged genomes, like those of Diptera, unusually high levels of gene order conservation cannot be systematically attributed to functional constraints, which raises the possibility that other mechanisms can be in place and therefore the underpinnings of the maintenance of gene organization might be more diverse than previously thought.
Author Summary
Eukaryotic genomes have been reshaped by chromosomal rearrangements during evolution. However, the comparison of distantly related species has uncovered unusually large genomic regions with conserved gene organization. A widely accepted explanation is that, in those regions, there exist genes with joint and/or intricate regulation, which, if altered, might affect the performance of the carriers. We used a system that allowed us to precisely disrupt one of the largest genomic regions that has been conserved in the fruit flies since ∼63 million years ago and performed a variety of assays devised to detect differences in organismal performance and anomalous gene expression. Despite the overrepresentation of genes with expression profiles related to male fertility and detection of chemical stimuli in this genomic region, as well as the presence of genes expressed during development and putatively regulated by long distantly located sequences, we do not find evidence of diminished viability and fertility or of anomalous levels of expression. These results lead us to propose that regulatory constraints might not suffice to explain the maintenance of the integrity of some of the best candidate regions in one of the most dynamic eukaryotic genomes.
Introduction
Collinearity conservation, i.e. conservation of local gene order, across distantly related phyla is often viewed as the result of functional constraints that prevent the occurrence of breaks of chromosomal rearrangements during evolution. The nature of these constraints is still poorly understood. They may merely reflect the presence of yet-to-be annotated protein and nonprotein-coding genes in intergenic regions [1]–[3]. A second type of constraints may be linked to the existence of regulatory domains, i.e. genomic regions associated with complex regulatory inputs. These regulatory domains can adopt at least two forms. One common form of complex regulatory inputs corresponds to genes that are coordinately expressed or repressed [4]–[7]. Alternatively, regulatory constraints can adopt the form of long-range regulatory interactions, which often involve the interdigitation of cis-regulatory sequences with genes that are not their targets [8]–[10]. These regulatory domains are enriched for noncoding DNA (highly conserved noncoding elements or HCNEs) with putative regulatory potential [11]–[16]. These HCNEs tend to be found in the vicinity of protein-coding genes that participate in key processes during development, such as regulation of gene expression and signal transduction [10], [12], [17]. Disruption of genomic regions under constraints can be accompanied by alteration of gene activity, as illustrated by chromosomal rearrangements that modify gene expression as a result of the separation of a gene from its regulatory sequences [18]–[20]. These alterations in gene activity may have a detrimental effect, which would lead to conservation of gene organization [19], [21].
Highly rearranged genomes, such as those of the Diptera, are especially suitable for analyzing the presence of regulatory-based constraints that preserve collinearity since regions free of them are likely to have been disrupted in one or more lineages. Gene order comparisons have helped to delineate collinear blocks [22] across nine Drosophila species that represent ∼380 million years (myr) of total divergence time [23], [24]. A minimum of ∼6,100 chromosomal breakpoints have been estimated to have occurred in 2,688 intergenic regions [22] scattered across the main chromosomal elements (the so-called Muller's elements A–E) that constitute the Drosophila genome [25], [26]. The analysis of the expected patterns of evolution of gene organization under different evolutionary scenarios indicate that fragile regions, i.e. those that accumulate most chromosomal breaks during evolution [27], are the main factor that explains the patterns of gene arrangement across Drosophila species [22]. Constraints nevertheless may be responsible for ∼15% of gene order conservation and their presence is positively correlated with the size of the collinear block [22]. The top 1% largest collinear blocks, or ultraconserved regions [22], are enriched for genes associated with particular expression profiles, but the functional signature most prominently found in ultraconserved regions are stretches of DNA with multiple HCNEs (14.5% of the 145 HCNE peaks mapped as compared to 6.7% expected).
To our knowledge, only two empirical tests for the presence of functional constraints have been performed in eukaryotic genomes [20], [28]. In both cases, engineered chromosomal inversions were induced to disrupt clusters of genes with shared expression attributes and, subsequently, the phenotypic consequences of such disruptions monitored. For example, the disruption of the mouse Hoxd cluster, which is conserved across vertebrates, helped determine the presence of two functional subdomains and two long-range enhancers on either side of the cluster. This functional organization of the Hoxd cluster was proposed as the underlying cause that kept the cluster intact during the evolution of vertebrate lineages. In Diptera, three gene neighborhoods expressed in testes and one gene neighborhood expressed during embryogenesis of D. melanogaster have been disrupted [28], but no modification of the expression of the genes included in the neighborhoods examined was detected. These neighborhoods are conserved within the D. melanogaster species subgroup but disrupted in some other Drosophila lineages (Table S1), which does not clarify the phylogenetic scope of the putative constraints tested. The phenotypic consequences of disrupting a collinear region conserved throughout the genus Drosophila, which is presumably maintained by constraints, are presently unknown.
Here, we used chromosome engineering to disrupt an ultraconserved region located on Muller's element C of the Drosophila genome (Figure 1A). This ultraconserved region is delimited by the genes CG15121 and CG16894 and is ∼701 kb long in D. melanogaster [22]. Importantly, this ultraconserved region is, based on current information, the one with the largest number of functional signatures, suggestive of the presence of regulatory-based constraints [22]. After disrupting the ultraconserved region CG15121–CG16894, we examined the resulting phenotypic consequences both by performing a variety of genetic and competition experiments that tested for differences in viability, fertility, and relevant parameters of general homeostasis and by characterizing levels of mRNA abundance in both sexes. Our results indicate that, contrary to the prevalent view, the extraordinary conservation of some of the largest collinear blocks in eukaryotic genomes might not necessarily derive only from functional constraints that result in severe detrimental effects.
Results
Functional and comparative organization of the ultraconserved region CG15121–CG16894
The ultraconserved region CG15121–CG16894 ranked first in length and eleventh in the number of genes encompassed among 2,683 regions of conserved collinearity across nine Drosophila species [22]. Data from another comparative analysis on gene organization in the genus Drosophila [29] are consistent with the overall maintenance of the collinearity in this genomic region. In addition, this region shows statistically significant enrichment for genes encoding proteins involved in chemosensory perception and for genes preferentially expressed in males, many of them showing this same trend across multiple Drosophila species (Figure 1B and Table S2). This higher-than-expected local density of genes with coherent patterns of expression supports the presence of a male-biased gene expression neighborhood, which is intertwined with a smaller chemosensory perception gene neighborhood.
This ultraconserved region is spanned by four HCNE peaks [12] (Figure 1C), more than any other collinear block. Genes responsive to HCNEs have been postulated to be preferentially associated with a particular kind of core promoters. Specifically, using promoter predictions for 42% of the protein-coding genes of D. melanogaster, a significant overrepresentation of genes with some kind of Inr-motif (Inr only, Inr/DPE, or Inr/TATA) was found in HCNE-dense regions [12]. We screened 500 nt upstream of each protein-coding gene in the region under study using McPromoter and obtained reliable predictions for eight genes. Six of these genes were predicted to have a core promoter responsive to HCNEs. These genes are found scattered along the region (Figure 1B and Figure S1) and encode proteins involved in the regulation of gene expression, from mRNA binding proteins (sm), to signal transducers (Toll-7, 18-w), to ubiquitins (Isopeptidase-T-3), and also to others that we did not predict a priori (Obp56a and CG8654). Given the large number of genes for which a prediction was not provided by McPromoter, we checked for the presence of genes that are expressed during key developmental stages using the expression profile characterization generated by modENCODE [2]. Fifteen out of 36 protein-coding genes show a preferential pattern of expression during embryogenesis; among them, there is a cluster of eight genes mostly displaying high levels of expression during the first 16 h of development and moderate expression during the larva-pupa transition (Figure S1). Six of these eight genes are predicted to have core promoters of the Inr-motif type and four are associated with lethal phenotypes, the latter underscoring their functional relevance prior to imago emergence (Figure S1) [30], [31]. Together, these functional features associated with some of the genes in the region are suggestive of regulation by HCNEs.
Examination of the organization of the ultraconserved region CG15121–CG16894 in Anopheles gambiae revealed the presence of orthologues in six different locations (Figure S2A and Table S3). This degree of dispersion is not surprising given the fast differentiation of the Drosophila and Anopheles genomes via the accumulation of chromosomal rearrangements [32]–[34] and the lack of common constraints reflected in the pronounced differences in development, morphology, and ecology of these two Diptera [33], [35], [36]. Nevertheless, we detected a conserved gene arrangement including the gene Toll-7 and the Obp genes, an association that is also present in A. aegypti (Figure S2B). Phylogenetic analyses [37] unambiguously support the close phylogenetic relationship among several Obp genes in the ultraconserved region under study (Obp56a, Obp56d, and Obp56e) and some that are adjacent to Toll-7 in A. gambiae (OBP23, OBP25, OBP26, OBP28), which in turn are closely related to those that cluster nearby Toll-7 in A. aegypti ([38] and this work; Figure S2C). Interestingly, the genes Obp56a, Obp56d and Toll-7 are found within the same expression cluster in D. melanogaster and two of them have core promoter types presumably responsive to HCNE-mediated regulation (Figure S1).
To sum up, the existence of two intertwined gene neighborhoods associated with very marked expression profiles, the enrichment for HCNE peaks and presence of their putative targets, and the detection of a region conserved across Diptera reinforces the possibility that one or more regulatory-based constraints might exist in the ultraconserved region CG15121–CG16894.
Disruption of the ultraconserved region CG15121–CG16894
To assess the importance of the integrity of the ultraconserved region CG15121–CG16894, we aimed to disrupt it and characterize the resulting phenotypic effects. We examined the existence of stocks carrying isolated naturally occurring inversions disrupting the ultraconserved region and none was found. Thus, we generated a disruption of the ultraconserved region CG15121–CG16894 by inducing the inversion In(2R)51F11-56E2 in D. melanogaster, a species in which nonallelic homologous recombination (NAHR) events can be mediated between FLP recombination target-bearing transposable elements (FRT-bearing TEs hereafter) via activation of a heat-inducible flippase-recombinase [28], [39]–[42] (Figure S3, S4, S5). For that, we used two TEs bearing FRT sites in opposite orientation: P{RS5}5-HA-1995, which is inside the ultraconserved region CG15121–CG16894; and P{RS3}CB-0236-3, which is located 4.35 Mb upstream (Figure 1A). The outer element is virtually terminal within a collinear block of 15 genes and is inserted into a naturally occurring copy of the TE 1360{}835 [43].
We adopted several measures to avoid confounding effects that could overlay those of the intended disruption of the ultraconserved region CG15121–CG16894. First, TEs were selected to avoid disrupting any known regulatory sequences of flanking genes and those presumably embedded in HCNE peaks, thus preventing the generation of artifactual position effects. The comparison of the size and sex ratio of the progeny of flies homozygous for each of the TEs confirmed the absence of any obvious detrimental effect associated with particular TE insertions (Figure S6). Second, in addition to strains carrying the inversion In(2R)51F11-56E2 (INV1 and INV2), we generated several control strains to account for further mutations that could have been incidentally generated by our approach [44]. Specifically, three kinds of control strains were constructed: strains carrying two FRT-bearing TEs in cis (REC), i.e. just before inducing the NAHR event that mediates the inversion; strains carrying the standard arrangement as a result of failed induced NAHR events but that were exposed to the same experimental conditions as the INV strains -SIMultaneous controls- (SIM1, SIM2, and SIM3); and strains in which the inverted segment is reverted back to its original orientation -REVertant controls- (REV1 and REV2) (Figure 2A, Figure S3 and S4). The main molecular changes at the inversion breakpoints of all relevant strains, plus those carrying the original FRT-bearing TEs, are depicted in Figure S5. In the absence of any secondary effect of our procedure, SIM, REV, and REC should perform likewise as measured by the size and sex ratio in the progeny of low-density crosses with homozygous flies (Figure S7). INV strains for which no SIM and/or REV control lines could be generated were discarded. For the remaining strains, their putative karyotype was verified at the cytological level and the expected molecular organization at their breakpoint regions confirmed by PCR and Sanger sequencing (Figure 2B–2C and 8). Further expression profiling confirmed the absence of local position effects at the inversion breakpoints (see below). Lastly, engineered chromosomes were maintained in homozygosis thus preventing the otherwise unavoidable accumulation of detrimental mutations if kept in heterozygosis over a balancer chromosome. Most of the strains that were not discarded were included in one or more downstream analyses.
Phenotypic consequences of disrupting the ultraconserved region CG15121–CG16894
The disruption of the ultraconserved region CG15121–CG16894 did not lead to viability impairment in the progeny of the carriers (INV1, INV2) as compared to that of non-carriers based on two proxies examined (Figure S7). Cursory examination of embryos, larva, and pupa did not detect any obvious morphological defect either. We then explored the possibility of a detrimental effect in heterozygous condition before reaching adult eclosion, either because of the disruption of the collinear block, meiotic distortion due to the presence of a chromosomal inverted rearrangement, or both. The comparison of different chromosome combinations revealed no departures from the expected Mendelian ratios (1∶2∶1) and absence of sex-specific effects (Figure S9). Further, we assessed differential viability during early stages (i.e. prior to imago emergence), when competition among individuals is specially intense [45], [46], and when most of the genes putatively targeted by HCNEs exhibit high levels of expression. Differences among carriers of different 2R chromosomes were detected in frequency-dependent competition experiments involving different pairwise combinations of embryos, but those differences were not consistently shown by the carriers of the disrupted ultraconserved region (INV1, INV2) compared to those that carry it in its intact form (REC) across different genetic backgrounds (Figure S10). Together, these results do not point out to any obvious detrimental effect on viability, and therefore on stages of the life cycle that encompass key developmental transitions, as a result of disrupting the ultraconserved region CG15121–CG16894.
Next, we examined the effect of disrupting the ultraconserved region CG15121–CG16894 after imago emergence since many of the protein-coding genes included in the region under study are expressed during adulthood, often in a sex-dependent fashion (Figure S1). Specifically, the region under study is populated with male-biased genes in expression, 11 of them preferentially expressed in testes [22] and four, all of them Obp genes, present in the seminal fluid (Figure 1; Table S2). We tested for differences in several parameters of male fertility: progeny size, sperm performance, and mating ability. The comparison of males from the strains INV1, INV2, and REC, which are all red-eyed so that differences due to differential pigmentation can be factored out, revealed that although there are differences in progeny size (Figure S11A), strains INV1 and INV2 are more different from each other than either of them is to strain REC. Double-mating experiments did not reveal any substantial difference in sperm performance (Figure S11B) and mating ability (Figure S12). Further, given the presence of nine odorant-binding and one odorant-receptor protein-coding genes in the ultraconserved region CG15121–CG16894, we analyzed the odorant abilities of the different strains. We examined the response to three volatile compounds (ethanol, acetone, and benzaldehyde) and found statistically significant differences in five out of 12 sex-by-strain combinations. Importantly, for females exposed to ethanol at a concentration of 10−3 and males exposed to acetone at a concentration of 10−4.5, strains INV1 and INV2 exhibited a coherent pattern of differentiation from strain REC (Figure 3; Table S14), involving in all cases an attenuated attraction to the chemical in question. This attenuated response does not result from an overall impairment of the odorant abilities of the flies as shown by the response to the repellent compound benzaldehyde. Analyses of different proxies for the general adult homeostasis (negative gravitaxis, heat-shock resistance, desiccation resistance, and starvation resistance) did not uncover any other difference between the strains with and without the disrupted ultraconserved region CG15121–CG16894 (Figure S13).
Lastly, we tested whether the disruption of the ultraconserved region CG15121–CG16894 resulted in a perturbation of gene expression. We performed a microarray-based characterization of the transcriptome of six lines (REC, INV1, INV2, SIM1, REV1, and REV2) during adulthood, the stage in which we found evidence of phenotypic differences associated with the disruption generated. At FDR 0.01, we found a very limited number of differentially expressed transcripts both in males and females (0.07% -11/16,637- and 6.2% -1,033/16,637-, respectively; Dataset S1 and Table S16). Further analyses confirmed the similarity of the expression profiles between equivalent strains generated by our procedure (first three planned contrasts in Table S17; Datasets S1 and S2). Likewise, these analyses indicated that the most statistically significant differences in mRNA abundance found are associated with differences in pigmentation (last three planned contrasts in Table S17; Dataset S2), in good agreement with the clustering of expression profiles among strains (Figure 4). The inspection of the chromosomal distribution of the differences in gene expression showed that a few of them were related to genes in the ultraconserved region. However, these alterations in mRNA levels are in fact the result of pigmentation differences, given that these alterations were found invariably between red-eyed and white-eyed strains, regardless of whether the former carry the disrupted (INV1 and INV2) or the intact form (REC) of the ultraconserved region (Figures S14 and S15). Searches for biologically coherent patterns among differentially expressed genes indicated that, for example for females, statistically significant enrichment was found for functional classes related to perception of visual stimuli (Table S18). Further, patterns of sex bias in gene expression were not affected either (Figure S16; Dataset S3). Hence, no discernible effect on the levels of mRNA of genes both inside and outside of the ultraconserved region CG15121–CG16894 was detected as a result of its disruption.
Discussion
The interplay among organization, function, and evolution of eukaryotic chromosomes is still poorly understood. Collinearity conservation is an important genomic feature that can conflate all three aspects, especially in genomes characterized by their ease in accommodating structural variation. Up to date, no empirical evaluation has been performed on the effects of disrupting the integrity of one of the largest genomic regions whose overall gene organization has been preserved over a large time scale. Importantly, this conservation entails species with very different behaviors and ecologies and therefore subject to very different selective pressures [46], [47]. Our results show an absence of detrimental effects on the carriers of the disrupted ultraconserved region for a variety of traits associated with viability and fertility. Although the absence of detrimental phenotypic effects could result from a limited ability to detect differences as statistically significant, the comparison of actual and estimated ideal samples sizes shows that this potential limitation could be the explanation in only a few cases (Table S19). Therefore, at least in Diptera genomes, our results show that an unusually high degree of collinearity conservation coupled with enrichment for functional coherent patterns is not necessarily associated with severe detrimental effects upon perturbation [20], [48].
The ultraconserved region CG15121–CG16894 harbors at least three sets of protein-coding genes that may be associated with regulatory-based constraints. The genes fall into the following broad categories: detection of chemical stimuli; sperm manufacturing and performance; and developmental processes. The only phenotype detected in association with our disruption was the more attenuated odorant response to attractant volatile compounds. Although variation in olfactory response occurs both within and between species [49], [50], it has been shown that the altered function of Obp genes, including some in the region studied, can affect fitness components in D. melanogaster [51]. Transcriptome characterization of adult flies did not uncover any obvious consequences in the activity of the Obp genes due to our disruption as measured by mRNA abundance, and magnitude and direction of sex bias. This lack of evidence for misregulation cannot rule out though that some other expression attributes, such as the spatial distribution of transcripts, had not been altered since they would have gone unnoticed by our approach. The evolutionary relevance of the maintenance of the clustering of Obp genes throughout the genus Drosophila is reinforced by the presence of an orthologous arrangement in different mosquito species to an extent not explained by chance [37]. In fact, the maintenance of the arrangement that includes a core cluster of Obp genes and the gene Toll-7 for ∼970 myr was unexpected due to the extent of rearrangement undergone by the genomes of the Diptera involved [22], [33], [34] (Figure S2B). Regardless, the conservation of the cluster of Obp genes and Toll-7 would not explain the collinearity conservation observed elsewhere in the ultraconserved region under study.
In relation to the two other biological signatures suggestive of constraints, we detected no evidence that our perturbation resulted in a detectable phenotype. The genes included in the two intertwined gene expression neighborhoods of the ultraconserved region CG15121–CG16894 do not show evidence of altered mRNA levels or malfunction that could result in impaired male fertility. Whether the constituent genes are under a tight coordinated regulation [52] that could affect the long-term stability of the region does not seem to be upheld by our results. This conclusion would be reinforced by the genus-wide lability shown by clusters of male-biased genes in expression [22]. The absence of misregulation, as measured by mRNA abundance, is consistent with the results obtained when D. melanogaster male-specific gene neighborhoods spanning over hundred of kb were disrupted [28].
Further, protein-coding genes presumably responsive to long-range regulation mediated by HCNEs during development are found scattered across the ultraconserved region CG15121–CG16894 at both sides of the disruption. The lack of a detrimental phenotype could reflect that the region is a composite of several autonomous genomic regulatory domains, each of them under the control of a particular HCNE peak. Under this scenario, our disruption would have separated different genomic regulatory domains without affecting any long-range interaction between HCNEs and their targets. Nevertheless, marked autonomy among HCNE peaks should result in certain regionalization of the expression profiles, i.e. physically closer HCNE targets should exhibit more similar expression profiles. In fact, we find putative targets of HCNEs located at both sides of the disruption, such as CG9854 and CG8896, showing very similar expression profiles (Figure 1 and Figure S1). If this similarity denotes the existence of long-range interactions, the latter are not associated with detrimental effects that explain the collinearity conservation. Alternatively, some degree of regulatory redundancy could exist at both sides of the disruption, which would explain the lack of phenotypic effect but not the collinearity conservation.
A different view of collinearity conservation entails factors other than functional constraints. DNA stretches with sufficient sequence identity can mediate NAHR events that give rise to chromosomal rearrangements [53]. Large collinear regions could be depleted of this type of sequences thus explaining why this ultraconserved region has maintained its integrity. A search for well-annotated sequences with the potential to mediate NAHR events [54] in the studied region across Drosophila species revealed that these sequences are present (Table S20). These sequences include ncRNA genes such as tRNAs [55], [56], rRNAs genes [57], [58], and snoRNAs [59]. Equivalent comparative analysis focusing on TEs, once properly annotated, will enable to test whether the region under study is particularly depleted for these sequences, which would decrease its propensity of being rearranged. Further, recent findings indicate that orthologous landmarks harboring genes that bind to the nuclear periphery are significantly larger than those that do not harbor any suggesting that particular intranuclear localizations might provide molecular environments associated with higher levels of genome stability [60]. The ultraconserved region CG15121–CG16894 is known to establish some contact with the nuclear periphery [61]. Specifically, it contains at least five protein-coding genes (CG16716, CG13872, CG10822, CG8654, CG16898) that exhibit statistically significant association with the B-type Lamin protein, a key component of the inner nuclear membrane [60], [62]. This pattern leaves open the possibility that some regulatory-based constraints had evolved under the enhanced evolutionary stability enjoyed by genomic regions associated with the nuclear periphery, and therefore functional constraints would not be the only mechanism contributing to their collinearity conservation.
Regardless how extensive the characterization of individuals carrying engineered genomic regions could be, our ability to detect phenotypic effects will always be contingent to the experimental setting used and the timescale in which the assays are performed. Nevertheless, our results raise the possibility that, at least in Diptera genomes, the mechanistic basis of collinearity conservation might be much more subtle and diverse than previously thought and that regulatory-based interactions might not suffice to account for the patterns of extensive conservation seen in some genomic regions [22]. Only further empirical tests for this and other ultraconserved regions can shed light on the scope of our observations.
Materials and Methods
Fly husbandry
Table S4 describes the strains used. Fly cultures were grown and maintained on dextrose-cornmeal-yeast medium at room temperature. Unless otherwise stated, all phenotypic assays were performed at 25°C in a temperature-controlled chamber and fly manipulation, sorting, and scoring were carried out under CO2 anesthesia. Strains generated are available upon request.
Inversion generation
Strains carrying the original FRT-bearing TE insertions (5-HA-1996 and CB-0236-3) were selected from the DrosDel collection and examined at different levels prior to their use in the generation of the chromosomal inversion In(2R)51F11-56E2. Sanger sequencing confirmed the insertion point of the FRT-bearing TEs used. Low-density crosses using homozygous flies were also performed to test for potential detrimental effects associated with the TE insertions. Briefly, six sexually mature individuals (three four-day-old females and three two-day-old males) were used per cross and strain; each cross was replicated five times. Males were discarded after 24 hours while females were transferred twice every three days until discarded on the ninth day.
To generate the inversion In(2R)51F11-56E2, we followed essentially [42] with slight modifications (John Roote, pers. comm.; Figure S3A–S3B). Three types of strains are generated under this procedure (REC, INV, and SIM); a fourth type of strain (REV; Figure S3C) was generated by reverting the inversion In(2R)51F11-56E2 [40]. The strains generated were scrutinized for evidence of side effect associated with our procedure (see below). Individuals from strains deemed as valid were crossed with those from the strain w1118, which possesses the standard arrangement for all chromosomes, and the third-instar salivary gland polytene chromosomes of the progeny examined. Chromosome squashes were stained with orcein and inspected with a Zeiss AX10 Imager M1 microscope. Cytological analysis was performed using the photographic polytene maps of D. melanogaster as a reference [63]. Diagnostic DNA stretches at the breakpoint regions were PCR amplified and their identity verified by Sanger sequencing; sequences were deposited at GenBank (http://www.ncbi.nlm.nih.gov/genbank/; JN805541–JN805602). Amplicons C, D, E, and F confirmed the presence of FRT-bearing TEs and their derivatives at the breakpoint regions. Amplicon G, or G′ after excision of one of the exons of the reporter gene, and H correlate with the absence and presence of the inversion, respectively (Tables S5 and S6 for further details on primers used and amplicons). Genomic DNA used in PCR genotyping was extracted from 50 individuals of each strain as described [39] and quantified using a NanoDrop 8000 Spectrophotometer. Takara Taq and Takara Ex Taq, depending on the size of the DNA fragment to be amplified, were used according to manufacturer conditions. PCR products were resolved on 1% agarose gels and visualized in an AlphaImager HP system. Amplicon sequences were examined for point mutations and indels that could have been generated incidentally during the course of our procedure.
Viability tests
Low-density crosses using homozygous flies were set up for each strain, as described above for the evaluation of the TE insertions, to confirm absence of differences among control strains and to test for differences among the latter and the strains carrying the disrupted ultraconserved region. Additional crosses, five per strain, evaluated the effects of the disruption in heterozygous condition with different chromosomes carrying the standard arrangement (SIM1, REV1, REV2); 10 sexually mature individuals of each sex were used per cross. Further, adapted frequency-dependent experiments [64] evaluated differences in viability of mixtures of embryos for two different genotypes. Two types of mixtures were prepared. In both, w+ individuals (INV1, INV2, or REC; the tested chromosome) compete with w− individuals carrying the ultraconserved region in its intact form. The only difference between the mixtures was the condition of the tested chromosome (heterozygosis, e.g. INV1/SIM1 versus SIM1/SIM1; homozygosis, e.g. INV1/INV1 versus SIM1/SIM1). For both types of mixtures, three sets of experiments were done varying the standard arrangement of the competing embryo (SIM1, REV1, REV2; the tester embryo), which was always in homozygosis. Embryos were collected from grape juice-sucrose-agar plates supplemented with yeast paste using FlyStuff Small Embryo Collection Cages. Using dissecting needles, 100 embryos were deposited on small grape juice-sucrose-agar cube according to one of three starting proportions for the two competing embryos (1∶3, 1∶1, 3∶1). The cube was subsequently introduced into a vial of fresh medium; each assay was replicated 10 times. Separate previous pilot experiments for the strains REC, SIM1, and INV1, using 100 embryos, indicated that the rate of survival was approximately 40% and therefore enough to detect differences between the strains to be compared; each assay was replicated 10 times. In total, 42 different competition settings were set up (7 tester by tested combinations×2 genotype conditions for the tested chromosome×3 starting proportions). Given the number of embryos used across competition experiments (42,000), only those involving the same tester w− embryo (e.g. SIM1/SIM1) were performed simultaneously, which determined how the contrasts were done (three for each genotype condition of the tested chromosome and particular starting proportion, i.e. 18 in total). No bias in sex ratio was assumed in all cases. Progenies from each cross were scored after 15 days to ensure the emergence of all surviving imagoes and the relative viability between the two competing genotypes estimated as (n1 ′×n2)/(n1×n2 ′), where n1 and n2, and n1 ′ and n2 ′, are the number of embryos and imagoes, respectively, of the two competing genotypes. Values of relative viability were log2-transformed.
Male fertility tests
The effects of the induced disruption on male fertility were assayed by examining progeny size, sperm performance, and mating ability. For the first test, we exposed single four-day-old virgin females carrying the ultraconserved region CG15121–CG16894 in its intact form (SIM1, REV1, REV2) to single two-day-old males with (INV1, INV2) or without (REC) the disrupted ultraconserved region. Males were discarded after 24 hours while females were transferred daily to vials with fresh food over a 10-day period. The number of replicates ranged from seven to nine. To evaluate sperm performance in the carriers of the disrupted ultraconserved region, we followed a similar experimental design to that for monitoring progeny size with the exception that the females mated in the first day were exposed to single males of their own strain in subsequent days. The progeny sired by the first and second males was scored based on eye color. For those days during which progeny from both parents were detected, i.e. those oviposited roughly the same day, we estimated the fraction of red-eyed individuals (necessarily sired by INV1, INV2, or REC males) in relation to the total. To account for the effect of the order of the males used, we performed identical experiments but this time the first male possessed the same genotype as the female while the second male was from the strains INV1, INV2, or REC. The number of replicates for each combination of genotypes ranged from four to nine. As for the comparison of mating abilities, we exposed single two-day-old males to 10 four-day-old virgin females for different time periods (1 hr, 3 hr, 6 hr). Afterwards, the females were transferred individually into vials with fresh food. After 15 days, 10 vials were examined for the presence of progeny, which indicates that at least one successful fertilization event occurred, and the number of females successfully fertilized recorded. Ten replicates were done per strain and time period combination.
Response to volatile compounds
The “dipstick” method was used [65]. Briefly, virgin individuals from INV and REC strains were separated by sex and transferred by aspiration in groups of five to marked empty plastic vials (O.D.×H: 25×95 mm) 24 hr after emergence. The vials were marked at 3 and 6 cm from the bottom. Fisherbrand Q-tips dipped into the odorant dilutions to be tested were introduced into the vials up to the 6 cm mark, and secured with a cotton plug to avoid contact with the walls of the vial. After a 15-second recovery period, the number of flies in the bottom compartment was recorded 10 times every five seconds and the avoidance scored estimated as the average over those 10 measurements. A score of 2.5 indicates indifference to the odorant tested while values >2.5 and <2.5 indicate repulsion and attraction, respectively. All tests were performed between 2 and 6 pm after starving the flies for no less than four hours, and the vials were always placed sideways to prevent interfering with the geotactic response. For each strain, 10 groups of five individuals for each of the sexes were analyzed. We tested three odorants, two of them usually considered to elicit an attracting response (ethanol, Gold Shield; acetone, Fisher Chemical) while the other is considered to be a repellant (benzaldehyde; Sigma Aldrich B1334 Benzaldehyde-ReagentPlus). Since odorant response can be concentration dependent, we tested two concentrations deferring by several orders of magnitude. These concentrations (vol/vol) were: ethanol, 10−3 and 10−0.5; acetone, 10−4.5 and 10−1.5; and benzaldehyde, 10−3 and 10−0.5. Only fresh and thoroughly mixed dilutions were used to prevent oxidation, which is particularly relevant in the case of benzaldehyde. As a control, we used distilled water, which is known to attract starved flies [65].
General homeostasis tests
Four proxies, including gravity response and survival to three stressors (heat-shock, desiccation, and starvation), were assayed. In all cases, males and females were separated after emergence and left for 24 hours in vials with fresh food. Subsequently, all individuals were transferred to new vials by aspiration for performing the pertinent tests. Negative gravitaxis was measured essentially as reported [66]. Twenty flies per sex and strain were transferred individually into a 250 ml glass volumetric cylinder. Flies were knocked down by tapping the cylinder ten times on a pad, and the height reached by each fly in 20 s recorded using the volumetric scale, which was divided by the maximum height. In our experience, flies performed more consistently after some training, reason why only measurements from a third trial were recorded. For the heat-shock resistance test [67], five groups of 20 flies per sex and strain were transferred in pairs into 3 ml Pyrex vials and then incubated in water baths at 35°C for 30 m (as a pre-conditioning step) and then at 39°C for another 30 m. The mobility of the flies was restricted to the submerged portion of the vial with cotton plugs. After the heat-shock treatment, flies were collected in vials with food that had been incubated at 25°C overnight. Next morning, the fraction of flies alive within each group was recorded. For the desiccation resistance test [68], groups of five flies, from a total of 20 per sex and strain, were transferred into empty vials. The mobility of the flies was restricted to the lower third of the vial using foam plugs, over which 3 g of Drierite desiccant were added. Next, the vials were sealed with Parafilm to maintain low humidity. Flies were checked every hour and the elapsed time-to-death recorded. As for the starvation resistance test, 20 flies per sex and strain were transferred into glass vials containing 2 ml of 1% agar dissolved in water to ensure normal humidity conditions. Flies were examined every 12 hr and the elapsed time-to death recorded.
Statistical analysis of non-molecular phenotypes
For each phenotype, normality of the data was visually inspected using normal quantile plots and precisely determined with the Shapiro-Wilk test. Homogeneity of variances was estimated using the Levene's test. Parametric tests were used if the departure from the assumptions of normality and homoscedasticity was absent or negligible. The Welch ANOVA was used if only heterodasticity was detected. Different transformations (log2 or arcsine square root, depending on the test) of the measurements were calculated for some phenotypes to improve fit to normality although this had a very little effect. Alternatively, non-parametric tests (e.g. Kruskal-Wallis) were used. When multiple post-hoc contrasts were necessary, appropriate tests that account for multiple comparisons were used (e.g. Steel-Dwass). In the case of the departure from expected Mendelian ratios in frequency-dependent competition experiments, the G-test for goodness of fit, upon applying the William's correction, was used. Statistical contrasts were performed with JMP 4.1 (SAS Institute); the evaluation of the sensitivity to detect statistically significant differences was done with GPower 3.1.3 when needed [69].
Expression data and analysis
Three low-density crosses were set up for the strains REC, INV1, INV2, SIM1, REV1, and REV2. The resulting virgin progeny was collected, separated by sex, and allowed to age for 5–7 days. Fifty individuals for each sex were snap frozen in liquid nitrogen at the same time of the day within a 2 hours window and subsequently stored at −80°C. Total RNA from biological samples was extracted using TRIzol Reagent (Invitrogen) and purified using the RNeasy Mini Kit (Qiagen). Concentration, quality and integrity of the RNA samples were estimated using a NanoDrop 8000 Spectrophotometer and the RNA 6000 Nano Chip Kit (Agilent Technologies) with an Agilent 2100 Bioanalyzer. Ten µg of total RNA were reverse transcribed into cDNA using the SuperScript Double-Stranded cDNA Labeling Kit (Invitrogen). Probe labeling, hybridization, array scanning, and data extraction were performed by Roche NimbleGen Service Group in Iceland. We used the oligonucleotide NimbleGen 12×135k D. melanogaster arrays, which contain 135,000 probes including different types of controls and 16,637 transcripts of protein-coding genes as annotated in release 5.7. The experiment consisted of 36 one-color hybridizations (6 samples×2 sexes×3 biological replicates). Raw fluorescence intensity values of probe pairs were summarized for each transcript using the median value after log transformation. Subsequent data analysis was performed using the tools implemented in the online pipeline WebArrayDB [70]. Data were normalized between arrays using the scale method [71] implemented in the LIMMA package [72]. Data for males and females were analyzed separately. Statistically significant differences across strains were assessed using a fixed-effect model ANOVA and multiple testing was performed with the Benjamini-Hochberg correction [73]. Similarity in expression profiles across genes and strains was assessed by hierarchical clustering using Ward's minimum variances as a distance metric. The first principal component was used to assist in the sorting. Six biologically meaningful planned contrasts defined a priori, all of them orthogonal, were done likewise by pooling the appropriately expression data of different strains as necessary (Table S17). Scrutiny of the differences in post-hoc comparisons among strains was done using the “multcomp” R package [74]. Functional information for relevant genes was obtained from FlyBase [30] and enrichment for particular Gene Ontology term categories (biological processes, molecular functions, and cellular components), KEGG pathways, and InterPro protein domains was evaluated using DAVID [75]. Benjamini-Hochberg correction [73] was applied to account for multiple tests. We proceeded likewise, but for each strain separately, to evaluate statistically significant differences between the sexes. Raw microarray data were deposited at the Gene Expression Omnibus database (GSE31120). In relation to the characterization of the expression profiles of the genes in the region under study during the life cycle of D. melanogaster, RPKM expression values across 30 timepoints and conditions [2] were extracted from FlyBase [30], log transformed, and compared using hierarchical clustering as above.
Promoter analysis
Core promoter predictions for all the transcripts of protein-coding genes examined were done with McPromoter using the most stringent parameter values [76]. We inspected 500 nt upstream of the 5′ UTR start of each transcripts as annotated in FlyBase [30]. In the absence of an annotated 5′UTR, a stretch of DNA of equal length upstream of the first nucleotide annotated was examined. Categorization of genes as responsive to HCNEs was based on the prediction of having an Inr core promoter type.
Comparative organization of the ultraconserved region CG15121–CG16894
We retrieved mapping coordinates of protein-coding genes included in the ultraconserved region under study in A. gambiae (AgamP3 assembly) using Biomart [77]. Any orthologous mapping information that did not conform to a one-to-one relationship between species was discarded. The global gene organization nearby the ortholog of Toll-7 in A. gambiae and A. aegypti was examined through VectorBase [78]. Phylogenetic relationships among Obp-related amino acid sequences encoded by genes in the same region that harbors the gene Toll-7 from the two mosquito species were conducted in MEGA 5.0 [79]. Amino acid sequences were downloaded from VectorBase [78] and aligned with CLUSTALW [80]. The best evolutionary model of amino acid evolution was found to be WAG and the consensus tree was built using the Maximum Likelihood method [81]. A discrete Gamma distribution was used to model evolutionary rate differences among sites (+G; 4 categories). Bootstrapping was performed to determine the confidence of the branches (1,000 replicates).
Supporting Information
Acknowledgments
We thank John Roote for providing the protocol to generate inversions; Bradford Hawkins, Arnaud Martin, Ricardo Pappa, Rita Petersen, Robert Reed, and Andrei Tatarenkov for access to their instrumentation and technical help; and Michael Ashburner, Francisco Ayala, Brandon Gaut, Laurence Muller, John Parsch, Julio Rozas, Igor Sharakhov, Shudan Yeh, Marcin von Grotthuss, and three reviewers for comments.
Footnotes
The authors have declared that no competing interests exist.
This work was supported by start-up funds and a grant from NSF (DEB-0949365) to JMR. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007;316:1484–1488. doi: 10.1126/science.1138341. [DOI] [PubMed] [Google Scholar]
- 2.Graveley BR, Brooks AN, Carlson JW, Duff MO, Landolin JM, et al. The developmental transcriptome of Drosophila melanogaster. Nature. 2010 doi: 10.1038/nature09715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Roy S, Ernst J, Kharchenko PV, Kheradpour P, Negre N, et al. Identification of functional elements and regulatory circuits by Drosophila modENCODE. Science. 2010;330:1787–1797. doi: 10.1126/science.1198374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sharpe J, Nonchev S, Gould A, Whiting J, Krumlauf R. Selectivity, sharing and competitive interactions in the regulation of Hoxb genes. Embo J. 1998;17:1788–1798. doi: 10.1093/emboj/17.6.1788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Spitz F, Gonzalez F, Duboule D. A global control region defines a chromosomal regulatory landscape containing the HoxD cluster. Cell. 2003;113:405–417. doi: 10.1016/s0092-8674(03)00310-6. [DOI] [PubMed] [Google Scholar]
- 6.Bulger M, Groudine M. Looping versus linking: toward a model for long-distance gene activation. Genes Dev. 1999;13:2465–2477. doi: 10.1101/gad.13.19.2465. [DOI] [PubMed] [Google Scholar]
- 7.Cavodeassi F, Modolell J, Gomez-Skarmeta JL. The Iroquois family of genes: from body building to neural patterning. Development. 2001;128:2847–2855. doi: 10.1242/dev.128.15.2847. [DOI] [PubMed] [Google Scholar]
- 8.Mackenzie A, Miller KA, Collinson JM. Is there a functional link between gene interdigitation and multi-species conservation of synteny blocks? Bioessays. 2004;26:1217–1224. doi: 10.1002/bies.20117. [DOI] [PubMed] [Google Scholar]
- 9.Spitz F, Duboule D. Global control regions and regulatory landscapes in vertebrate development and evolution. Adv Genet. 2008;61:175–205. doi: 10.1016/S0065-2660(07)00006-5. [DOI] [PubMed] [Google Scholar]
- 10.Mongin E, Dewar K, Blanchette M. Long-range regulation is a major driving force in maintaining genome integrity. BMC Evol Biol. 2009;9:203. doi: 10.1186/1471-2148-9-203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kikuta H, Laplante M, Navratilova P, Komisarczuk AZ, Engstrom PG, et al. Genomic regulatory blocks encompass multiple neighboring genes and maintain conserved synteny in vertebrates. Genome Res. 2007;17:545–555. doi: 10.1101/gr.6086307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Engstrom PG, Ho Sui SJ, Drivenes O, Becker TS, Lenhard B. Genomic regulatory blocks underlie extensive microsynteny conservation in insects. Genome Res. 2007;17:1898–1908. doi: 10.1101/gr.6669607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hufton AL, Mathia S, Braun H, Georgi U, Lehrach H, et al. Deeply conserved chordate noncoding sequences preserve genome synteny but do not drive gene duplicate retention. Genome Res. 2009;19:2036–2051. doi: 10.1101/gr.093237.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.de la Calle-Mustienes E, Feijoo CG, Manzanares M, Tena JJ, Rodriguez-Seguel E, et al. A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts. Genome Res. 2005;15:1061–1072. doi: 10.1101/gr.4004805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Pennacchio LA, Ahituv N, Moses AM, Prabhakar S, Nobrega MA, et al. In vivo enhancer analysis of human conserved non-coding sequences. Nature. 2006;444:499–502. doi: 10.1038/nature05295. [DOI] [PubMed] [Google Scholar]
- 16.Ellingsen S, Laplante MA, Konig M, Kikuta H, Furmanek T, et al. Large-scale enhancer detection in the zebrafish genome. Development. 2005;132:3799–3811. doi: 10.1242/dev.01951. [DOI] [PubMed] [Google Scholar]
- 17.Woolfe A, Goodson M, Goode DK, Snell P, McEwen GK, et al. Highly conserved non-coding sequences are associated with vertebrate development. PLoS Biol. 2005;3:e7. doi: 10.1371/journal.pbio.0030007. doi: 10.1371/journal.pbio.0030007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zuniga A, Michos O, Spitz F, Haramis AP, Panman L, et al. Mouse limb deformity mutations disrupt a global control region within the large regulatory landscape required for Gremlin expression. Genes Dev. 2004;18:1553–1564. doi: 10.1101/gad.299904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kleinjan DA, van Heyningen V. Long-range control of gene expression: emerging mechanisms and disruption in disease. Am J Hum Genet. 2005;76:8–32. doi: 10.1086/426833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Spitz F, Herkenne C, Morris MA, Duboule D. Inversion-induced disruption of the Hoxd cluster leads to the partition of regulatory landscapes. Nat Genet. 2005;37:889–893. doi: 10.1038/ng1597. [DOI] [PubMed] [Google Scholar]
- 22.von Grotthuss M, Ashburner M, Ranz JM. Fragile regions and not functional constraints predominate in shaping gene organization in the genus Drosophila. Genome Res. 2010;20:1084–1096. doi: 10.1101/gr.103713.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tamura K, Subramanian S, Kumar S. Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks. Mol Biol Evol. 2004;21:36–44. doi: 10.1093/molbev/msg236. [DOI] [PubMed] [Google Scholar]
- 24.Powell JR, DeSalle R. Drosophila molecular phylogenies and their uses. In: Hecht MK, MacIntire RJ, Clegg MT, editors. Evolutionary Biology. New York: Plenum; 1995. pp. 87–138. [Google Scholar]
- 25.Muller HJ. Bearings of the Drosophila work on systematics. In: Huxley J, editor. The New Systematics. Oxford: Clarendon Press; 1940. pp. 185–268. [Google Scholar]
- 26.Sturtevant AH, Novitski E. The homologies of the chromosome elements in the genus Drosophila. Genetics. 1941;26:517–541. doi: 10.1093/genetics/26.5.517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Pevzner P, Tesler G. Human and mouse genomic sequences reveal extensive breakpoint reuse in mammalian evolution. Proc Natl Acad Sci U S A. 2003;100:7672–7677. doi: 10.1073/pnas.1330369100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Meadows LA, Chan YS, Roote J, Russell S. Neighbourhood continuity is not required for correct testis gene expression in Drosophila. PLoS Biol. 2010;8:e1000552. doi: 10.1371/journal.pbio.1000552. doi: 10.1371/journal.pbio.1000552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bhutkar A, Schaeffer SW, Russo SM, Xu M, Smith TF, et al. Chromosomal rearrangement inferred from comparisons of 12 Drosophila genomes. Genetics. 2008;179:1657–1680. doi: 10.1534/genetics.107.086108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tweedie S, Ashburner M, Falls K, Leyland P, McQuilton P, et al. FlyBase: enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res. 2009;37:D555–559. doi: 10.1093/nar/gkn788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Swarup S, Williams TI, Anholt RR. Functional Dissection of Odorant Binding Protein Genes in Drosophila melanogaster. Genes Brain Behav. 2011 doi: 10.1111/j.1601-183X.2011.00704.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ranz JM, Casals F, Ruiz A. How malleable is the eukaryotic genome? Extreme rate of chromosomal rearrangement in the genus Drosophila. Genome Res. 2001;11:230–239. doi: 10.1101/gr.162901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zdobnov EM, Von Mering C, Letunic I, Torrents D, Suyama M, et al. Comparative genome and proteome analysis of Anopheles gambiae and Drosophila melanogaster. Science. 2002;298:149–159. doi: 10.1126/science.1077061. [DOI] [PubMed] [Google Scholar]
- 34.Zdobnov EM, Bork P. Quantification of insect genome divergence. Trends Genet. 2007;23:16–20. doi: 10.1016/j.tig.2006.10.004. [DOI] [PubMed] [Google Scholar]
- 35.Holt RA, Subramanian GM, Halpern A, Sutton GG, Charlab R, et al. The genome sequence of the malaria mosquito Anopheles gambiae. Science. 2002;298:129–149. doi: 10.1126/science.1076181. [DOI] [PubMed] [Google Scholar]
- 36.Papatsenko D, Levine M, Goltsev Y. Clusters of temporal discordances reveal distinct embryonic patterning mechanisms in Drosophila and anopheles. PLoS Biol. 2011;9:e1000584. doi: 10.1371/journal.pbio.1000584. doi: 10.1371/journal.pbio.1000584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vieira FG, Rozas J. Comparative Genomics of the Odorant-Binding and Chemosensory Protein Gene Families across the Arthropoda: Origin and evolutionary history of the chemosensory system. Genome Biol Evol. 2011 doi: 10.1093/gbe/evr033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhou JJ, He XL, Pickett JA, Field LM. Identification of odorant-binding proteins of the yellow fever mosquito Aedes aegypti: genome annotation and comparative analyses. Insect Mol Biol. 2008;17:147–163. doi: 10.1111/j.1365-2583.2007.00789.x. [DOI] [PubMed] [Google Scholar]
- 39.Parks AL, Cook KR, Belvin M, Dompe NA, Fawcett R, et al. Systematic generation of high-resolution deletion coverage of the Drosophila melanogaster genome. Nat Genet. 2004;36:288–292. doi: 10.1038/ng1312. [DOI] [PubMed] [Google Scholar]
- 40.Golic KG, Golic MM. Engineering the Drosophila genome: chromosome rearrangements by design. Genetics. 1996;144:1693–1711. doi: 10.1093/genetics/144.4.1693. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ryder E, Blows F, Ashburner M, Bautista-Llacer R, Coulson D, et al. The DrosDel collection: a set of P-element insertions for generating custom chromosomal aberrations in Drosophila melanogaster. Genetics. 2004;167:797–813. doi: 10.1534/genetics.104.026658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ryder E, Ashburner M, Bautista-Llacer R, Drummond J, Webster J, et al. The DrosDel deletion collection: a Drosophila genomewide chromosomal deficiency resource. Genetics. 2007;177:615–629. doi: 10.1534/genetics.107.076216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Adams MD, Celniker SE, Holt RA, Evans CA, Gocayne JD, et al. The genome sequence of Drosophila melanogaster. Science. 2000;287:2185–2195. doi: 10.1126/science.287.5461.2185. [DOI] [PubMed] [Google Scholar]
- 44.O'Keefe LV, Smibert P, Colella A, Chataway TK, Saint R, et al. Know thy fly. Trends Genet. 2007;23:238–242. doi: 10.1016/j.tig.2007.03.007. [DOI] [PubMed] [Google Scholar]
- 45.Marks RW. Genetic Variability for Density Sensitivity of Three Components of Fitness in DROSOPHILA MELANOGASTER. Genetics. 1982;101:301–316. doi: 10.1093/genetics/101.2.301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Powell JR. Progress and prospects in evolutionary biology: the Drosophila model. Oxford: Oxford University Press; 1997. 576 [Google Scholar]
- 47.Ashburner MA, Golic KG, Hawley RS. Drosophila: A Laboratory Handbook. 2005. Cold Spring Harbor Cold Spring Harbor Laboratory Press.
- 48.Jeong Y, El-Jaick K, Roessler E, Muenke M, Epstein DJ. A functional screen for sonic hedgehog regulatory elements across a 1 Mb interval identifies long-range ventral forebrain enhancers. Development. 2006;133:761–772. doi: 10.1242/dev.02239. [DOI] [PubMed] [Google Scholar]
- 49.Dekker T, Ibba I, Siju KP, Stensmyr MC, Hansson BS. Olfactory shifts parallel superspecialism for toxic fruit in Drosophila melanogaster sibling, D. sechellia. Curr Biol. 2006;16:101–109. doi: 10.1016/j.cub.2005.11.075. [DOI] [PubMed] [Google Scholar]
- 50.Ruebenbauer A, Schlyter F, Hansson BS, Lofstedt C, Larsson MC. Genetic variability and robustness of host odor preference in Drosophila melanogaster. Curr Biol. 2008;18:1438–1443. doi: 10.1016/j.cub.2008.08.062. [DOI] [PubMed] [Google Scholar]
- 51.Mackay TF, Heinsohn SL, Lyman RF, Moehring AJ, Morgan TJ, et al. Genetics and genomics of Drosophila mating behavior. Proc Natl Acad Sci U S A. 2005;102(Suppl 1):6622–6629. doi: 10.1073/pnas.0501986102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Shevelyov YY, Lavrov SA, Mikhaylova LM, Nurminsky ID, Kulathinal RJ, et al. The B-type lamin is required for somatic repression of testis-specific gene clusters. Proc Natl Acad Sci U S A. 2009;106:3282–3287. doi: 10.1073/pnas.0811933106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Lupski JR. Genomic disorders: structural features of the genome can lead to DNA rearrangements and human disease traits. Trends Genet. 1998;14:417–422. doi: 10.1016/s0168-9525(98)01555-8. [DOI] [PubMed] [Google Scholar]
- 54.Stankiewicz P, Lupski JR. Genome architecture, rearrangements and genomic disorders. Trends Genet. 2002;18:74–82. doi: 10.1016/s0168-9525(02)02592-1. [DOI] [PubMed] [Google Scholar]
- 55.Szankasi P, Gysler C, Zehntner U, Leupold U, Kohli J, et al. Mitotic recombination between dispersed but related rRNA genes of Schizosaccharomyces pombe generates a reciprocal translocation. Molecular General Genetics. 1986;202:394–402. [Google Scholar]
- 56.Kellis M, Patterson N, Endrizzi M, Birren B, Lander ES. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature. 2003;423:241–254. doi: 10.1038/nature01644. [DOI] [PubMed] [Google Scholar]
- 57.Hill CW, Gray JA. Effects of chromosomal inversion on cell fitness in Escherichia coli K-12. Genetics. 1988;119:771–778. doi: 10.1093/genetics/119.4.771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Liu SL, Sanderson KE. Rearrangements in the genome of the bacterium Salmonella typhi. Proc Natl Acad Sci U S A. 1995;92:1018–1022. doi: 10.1073/pnas.92.4.1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Gonzalez J, Casals F, Ruiz A. Duplicative and conservative transpositions of larval serum protein 1 genes in the genus Drosophila. Genetics. 2004;168:253–264. doi: 10.1534/genetics.103.025916. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ranz JM, Díaz-Castillo C, Petersen R. Conserved gene order at the nuclear periphery in Drosophila. Mol Biol Evol. 2011 doi: 10.1093/molbev/msr178. [DOI] [PubMed] [Google Scholar]
- 61.Mathog D, Sedat JW. The three-dimensional organization of polytene nuclei in male Drosophila melanogaster with compound XY or ring X chromosomes. Genetics. 1989;121:293–311. doi: 10.1093/genetics/121.2.293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Pickersgill H, Kalverda B, de Wit E, Talhout W, Fornerod M, et al. Characterization of the Drosophila melanogaster genome at the nuclear lamina. Nat Genet. 2006;38:1005–1014. doi: 10.1038/ng1852. [DOI] [PubMed] [Google Scholar]
- 63.Lefevre G. A photographic representation and interpretation of the polytene chromosomes of Drosophila melanogaster salivary glands. In: Ashburner MA, Novitski E, editors. The Genetics and Biology of Drosophila. London: Academic Press; 1976. pp. 31–66. [Google Scholar]
- 64.Anderson WW, Arnold J, Sammons SA, Yardley DG. Frequency-dependent viabilities of Drosophila pseudoobscura karyotypes. Heredity. 1986;56:7–17. [Google Scholar]
- 65.Anholt RR, Lyman RF, Mackay TF. Effects of single P-element insertions on olfactory behavior in Drosophila melanogaster. Genetics. 1996;143:293–301. doi: 10.1093/genetics/143.1.293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Miquel J, Lundgren PR, Bensch KG, Atlan H. Effects of temperature on the life span, vitality and fine structure of Drosophila melanogaster. Mech Ageing Dev. 1976;5:347–370. doi: 10.1016/0047-6374(76)90034-8. [DOI] [PubMed] [Google Scholar]
- 67.Gong WJ, Golic KG. Loss of Hsp70 in Drosophila is pleiotropic, with effects on thermotolerance, recovery from heat shock and neurodegeneration. Genetics. 2006;172:275–286. doi: 10.1534/genetics.105.048793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Gibbs AG, Chippindale AK, Rose MR. Physiological mechanisms of evolved desiccation resistance in Drosophila melanogaster. J Exp Biol. 1997;200:1821–1832. doi: 10.1242/jeb.200.12.1821. [DOI] [PubMed] [Google Scholar]
- 69.Faul F, Erdfelder E, Lang AG, Buchner A. G*Power 3: a flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods. 2007;39:175–191. doi: 10.3758/bf03193146. [DOI] [PubMed] [Google Scholar]
- 70.Xia XQ, McClelland M, Porwollik S, Song W, Cong X, et al. WebArrayDB: cross-platform microarray data analysis and public data repository. Bioinformatics. 2009;25:2425–2429. doi: 10.1093/bioinformatics/btp430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Yang YH, Dudoit S, Luu P, Lin DM, Peng V, et al. Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 2002;30:e15. doi: 10.1093/nar/30.4.e15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Smyth GK. Limma: linear models for microarray data. In: Gentleman R, Carey V, Dudoit S, Irizarry R, Huber W, editors. Bioinformatics and Computational Biology Solutions using R and Bioconductor. New York: Springer; 2005. pp. 397–420. [Google Scholar]
- 73.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Statist Soc Ser B. 1995;57:289–300. [Google Scholar]
- 74.Hothorn T, Bretz F, Westfall P. Simultaneous inference in general parametric models. Biom J. 2008;50:346–363. doi: 10.1002/bimj.200810425. [DOI] [PubMed] [Google Scholar]
- 75.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
- 76.Ohler U. Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction. Nucleic Acids Res. 2006;34:5943–5950. doi: 10.1093/nar/gkl608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Haider S, Ballester B, Smedley D, Zhang J, Rice P, et al. BioMart Central Portal–unified access to biological data. Nucleic Acids Res. 2009;37:W23–27. doi: 10.1093/nar/gkp265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Lawson D, Arensburger P, Atkinson P, Besansky NJ, Bruggner RV, et al. VectorBase: a data resource for invertebrate vector genomics. Nucleic Acids Res. 2009;37:D583–587. doi: 10.1093/nar/gkn857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Tamura K, Peterson D, Peterson N, Stecher G, Nei M, et al. MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Mol Biol Evol. 2011 doi: 10.1093/molbev/msr121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18:691–699. doi: 10.1093/oxfordjournals.molbev.a003851. [DOI] [PubMed] [Google Scholar]
- 82.Engstrom PG, Fredman D, Lenhard B. Ancora: a web resource for exploring highly conserved noncoding elements and their association with developmental regulatory genes. Genome Biol. 2008;9:R34. doi: 10.1186/gb-2008-9-2-r34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Gaunt MW, Miles MA. An insect molecular clock dates the origin of the insects and accords with palaeontological and biogeographic landmarks. Mol Biol Evol. 2002;19:748–761. doi: 10.1093/oxfordjournals.molbev.a004133. [DOI] [PubMed] [Google Scholar]
- 84.Krzywinski J, Grushko OG, Besansky NJ. Analysis of the complete mitochondrial DNA from Anopheles funestus: an improved dipteran mitochondrial genome annotation and a temporal dimension of mosquito evolution. Mol Phylogenet Evol. 2006;39:417–423. doi: 10.1016/j.ympev.2006.01.006. [DOI] [PubMed] [Google Scholar]
- 85.Vieira FG, Sanchez-Gracia A, Rozas J. Comparative genomic analysis of the odorant-binding protein family in 12 Drosophila genomes: purifying selection and birth-and-death evolution. Genome Biol. 2007;8:R235. doi: 10.1186/gb-2007-8-11-r235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Klemenz R, Weber U, Gehring WJ. The white gene as a marker in a new P-element vector for gene transfer in Drosophila. Nucleic Acids Res. 1987;15:3947–3959. doi: 10.1093/nar/15.10.3947. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.