Abstract
Distinguishing subpopulations in group behavioral experiments can reveal the impact of differences in genetic, pharmacological and life-histories on social interactions and decision-making. Here we describe Fluorescence Behavioral Imaging (FBI), a toolkit that uses transgenic fluorescence to discriminate subpopulations, imaging hardware that simultaneously records behavior and fluorescence expression, and open-source software for automated, high-accuracy determination of genetic identity. Using FBI, we measure courtship partner choice in genetically mixed groups of Drosophila.
Introduction
Natural behavior has evolved in the context of social interactions between conspecifics as well as between species. This is most apparent in the courtship rituals and aggression behaviors observed across the animal kingdom, including in the fruitfly, Drosophila melanogaster [1]. Interactions within groups of individuals must therefore be taken into account for a complete understanding of how behavior unfolds. Drosophila is poised to reveal important insights in the study of group behaviors as substantial progress in the precision of behavioral quantification has recently been made: Ctrax [2], Cadabra [3], and other software packages enable the semi-automated tracking and analysis of groups and pairs of fruitflies [4]. These tools dramatically expand the potential resolution and sophistication of behavioral studies. However, tracking methods relying on morphological criteria have so far only been able to discriminate large differences between animals, for example smaller males from females [2]. Moreover, morphology is an ambiguous metric because of size variability between strains due to genetic background or culture conditions. Identifying differences using other criteria would bridge a wide methodological gap in Drosophila, an organism whose strength lies in the ease of genetic manipulations, by revealing social behaviors and decision-making within groups consisting of individuals of different genotypes and life histories.
Here we describe Fluorescence Behavioral Imaging (FBI), a toolkit that complements tracking methods by enabling the discrimination of subpopulations within heterogeneous groups of freely behaving flies. FBI bookends behavioral experiments (Figure 1A), making it independent of advances in position/orientation tracking. To discriminate individuals, FBI exploits the expression of a fluorescent protein in a subpopulation, drawing inspiration from physical tagging approaches that are used in larger insects [5] and leveraging the power of Drosophila genetics. By analogy with clonal cellular analyses using fluorescent markers in Drosophila [6], FBI also confers the advantages inherent in allowing the phenotypic comparison of two distinct populations of animals within the same experiment. Although this approach is imminently scalable to discriminate many subpopulations using multiple fluorophores, here we illustrate the distinction of two subgroups of flies using a single fluorophore, enhanced Green Fluorescent Protein (eGFP).
Results and Discussion
Tools for Fluorescence Behavioral Imaging (FBI)
To tag one subgroup of flies we generated transgenic animals expressing eGFP under the control of the actin88F promoter, which drives expression in indirect flight muscles of the thorax [7] (Figure 1B). This approach confers an advantage over a ubiquitous expression strategy since ubiquitous expression of eGFP using a Tubulin promoter sequence can result in changes in basal locomotion (data not shown) while the translational velocity, angular velocity, and courtship duration of Actin88F:eGFP flies and control flies are indistinguishable (Figure S1). Homozygous Actin88F:eGFP flies are fecund and able to fly but assay specific controls should not be neglected in new experimental scenarios requiring high precision behavioral measurements. Another advantage of this transgene expression pattern is that the spatial intensity distribution of thoracic eGFP fluorescence is distinct from typical cuticular auto-fluorescence, facilitating discrimination of eGFP-expressing flies (GFP) from those lacking this transgene (non-GFP) flies.
To simultaneously access genetic and positional information, we developed a macroscopic imaging system for synchronous fluorescence and infrared (IR) backlight video recording (Figure 1C). Due to the effect of visible light on locomotion [8], it is less intrusive to perform fluorescence imaging only after the completion of each experiment (Figure 1D). However, to explore the robustness of our method under multiple conditions, we synchronously recorded images from IR backlight illumination (Figure 2A) and visible eGFP excitation (Figure 2B) for a brief period (10 seconds) both prior to and following each experiment. To coordinate the timing of LED activation and camera acquisition we developed open-source software called sQuid (available at: http://lis.epfl.ch/squid), which permits the control of multiple cameras and a computer output interface with millisecond temporal precision. Using these tools, we recorded groups of eighteen freely walking flies in an enclosed arena for up to five minutes. Following each experiment IR videos were tracked using Ctrax [2]. Using tracking data (Figure 2C), we could delineate regions of interest (ROI) for each fly in each fluorescence image (Figure 2D). For subsequent analysis this region (Total ROI) was divided into a subregion containing the head and thorax (Front ROI; Figure 2D, green) and a second subregion containing the abdomen (Rear ROI; Figure 2D, blue).
Automation of Genotypic Identification
While in principle these images can be used to discriminate between GFP and non-GFP subpopulations by eye (Figure 2E; GFP in blue, non-GFP in red), such an approach is very time consuming and susceptible to human error. We therefore developed FBI post-processing Matlab scripts for automatically discriminating genetic identity with high accuracy (see Text S1 for details; scripts available at http://lis.epfl.ch/FBI). We began by measuring the range of fluorescence values for GFP or non-GFP flies. After recording IR and fluorescence videos of genetically homogeneous groups, pixel values were extracted from Front and Rear ROIs for each fly in fluorescence images. Next we evaluated fifteen quantitative metrics for their accuracy (Figure S2) in processing fluorescence pixel values to produce a result that is above a threshold (Figure S3) for GFP flies and below this threshold for non-GFP flies. We identified two metrics that most effectively separated pixel value histograms for GFP and non-GFP flies into non-overlapping distributions (Figure 3A, B). The first metric, Max 5% Ratio, is the mean of the brightest 5% of pixel values in the Front ROI divided by the mean of the brightest 5% of pixel values in the Rear ROI (Figure 3A). This ratiometric normalization reduces the impact of variability in GFP excitation and expression levels. The second metric, Skewness, is a statistical measure of the pixel value distribution for the Total ROI (Figure 3B, see Materials and Methods for mathematical formulation). These two metrics are dimensionless making them more robust to hardware and illumination differences across experimental platforms.
We discovered that these two metrics were also complementary: each provided optimal discrimination for different mixtures of fly genders and fluorescence expression (Figure S2). Such complementarity suggested that these metrics might be even more effective when used in combination. By systematically testing different proportions of the two metrics with different discrimination thresholds on all genotypic mixture combinations, we confirmed that higher and more robust discrimination accuracy could be achieved with a combination of both metrics rather than one alone (Figures S4 & S5).
To test this automated approach for discriminating genetic identity in heterogeneous groups of flies, we performed experiments using GFP and non-GFP females or males together (female-female: n = 123 GFP flies and 125 non-GFP flies from 14 experiments; male-male: n = 142 GFP flies and 136 non-GFP flies from 15 experiments). Using optimal proportions and discrimination thresholds derived from homogeneous group experiments (Figure S4), we could accurately identify GFP expression in heterogeneous groups of flies. To achieve >90% discrimination accuracy in both experiments, only 4 images were needed, while 20 images brought accuracy to above 95% (Figure S6 insets). However, achieving >99% discrimination accuracy required 602 images for female flies (Figure S6A, more than males due to abdominal autofluorescence) and 386 images in male flies (Figure S6B). Such high performance might therefore require prohibitively long eGFP excitation periods for light-sensitive experiments at our frame-rate of 20 frames per second.
To overcome this problem, we reasoned that incorporating prior information of the expected number of GFP and non-GFP flies might reduce the number of images needed for high accuracy discrimination. We used this additional information by sorting processed fluorescence values for each fly in descending order and then dividing this list in two. The top portion denoted putative GFP flies (based on the expected number) while the lower denoted putative non-GFP flies. By exploring the dependence of discrimination accuracy on the weighting of each metric and the number of images used, we observed that this strategy could reach >99% discrimination accuracy with fewer images (102 images in females and 4 images in males) and using a wide range of metric weightings (Figure 3C, D). Importantly, >99% discrimination accuracy could also be achieved with FBI only after each experiment (134 images in females and 2 images in males, Figure S7) precluding the requirement for blue light illumination prior to experimental recordings, which could potentially influence locomotor and other behaviors [8]. In summary, using a combination of complementary pixel value metrics as well as prior knowledge of the proportion of labelled flies, FBI post-processing scripts can achieve high accuracy automated identification requiring only a brief period of fluorescence imaging at the beginning and/or end of each experiment.
Measuring Courtship Choice Using FBI
Tracking algorithms allow high-throughput quantitative measurements of behavior but cannot resolve differences in genotype or life-history. Consequently, large-scale studies requiring mixed populations such as those measuring social decision-making are out of reach. To illustrate how FBI overcomes this limitation, we studied courtship choice in genetically heterogeneous groups of male flies. We examined the initial chasing/orienting steps of the courtship ritual in males mutant for fruitless (fru−/−), which lack an important genetic determinant of sexual behavior [9]. fru−/− males have altered sexual orientation, and court other males. It is not known whether fru−/− mutants prefer to court wild-type males (which normally rebuff homosexual advances) or other fru−/− mutants, which might be more receptive to courtship. We therefore tested whether fru−/− males preferred to court wild-type males over other fru−/− males when mixed in groups of twelve (n = 10 experiments). We confirmed that fru−/− males court other males, sometimes forming “chains” that incorporate both wild-type and fru−/− mutant animals (Figure 4A). This can also be visualized in encounter density plots [2] showing a dramatically high proportion of fru−/− male encounters occurring near the head since sensory cues promoting courtship are detected by neurons on the head or forelegs (Figure 4B, right). When we quantified the proportion of courtship events (Figure 4C, left) and courtship time (Figure 4C, right) as well as courtship event duration (Figure 4D), we observed that fru−/− males courted wild-type and fru−/− flies with similar intensity (Student’s t-test, P = 0.37 for events and P = 0.23 for time compared to chance; Wilcoxon rank-sum test, P = 0.14 for event duration). These data suggest that at least the initial courtship decisions of fru−/− males are not strongly influenced by partner behavior and that receptivity of males to initial advances by other males is not altered in fru−/− mutants.
Discussion
FBI can be used to complement tracking methods, providing a general way to link quantitative Drosophila group behavior with subgroup specific experimental perturbations such as genetic mutations or life-history modifications such as drug treatments. We envision that this approach could be easily applied to the behavioral analysis of any species amenable to transgenesis and tracking (e.g. mosquitoes [10], C. elegans [11], zebrafish [12], and mice [13]). Additionally, it could be modified to incorporate a wealth of fluorescent tools towards the study of behavior. For example, one might tag more than two subgroups using multiple fluorophores [14], measure gene expression during behavior [15], use fluorophore photo-activation [16] for behavior-triggered marking, or study real-time feeding by measuring the ingestion of synthetic fluorescent dyes.
Materials and Methods
Molecular Biology
For the Actin88F:eGFP construct, a 2053 bp region immediately upstream of the actin88F gene was amplified by the Expand High Fidelity PLUS PCR system (Roche) from Oregon-R genomic DNA using the following forward primer containing a BmtI site: 5′-GCT AGC ATG CAC AAT AGG CAA ATT TAG TT-3′ and reverse primer containing an EcoRI site: 5′-GAA TTC CTT GGC AGT TGT TTA TCT GGA A-3′. eGFP was similarly amplified using the following forward primer containing a KpnI site: 5′-GGT ACC ATG GTG AGC AAG GGC GA-3′ and reverse primer containing an XbaI site: 5′-TCT AGA TTA CTT GTA CAG CTC GTC CAT GC-3′. PCR products were T:A cloned into pGEM-T Easy (Promega), sequenced, and subcloned with BmtI and EcoRI into the pattB vector [17]. eGFP was subsequently amplified and subcloned downstream of this promoter fragment.
Drosophila Strains
Transgenic Actin88F:eGFP strains (“GFP” flies) were generated (Genetic Services, Inc., Cambridge, Massachusetts, USA) with the phiC31-based integration system using attP40 (second chromosome) or attP2 (third chromosome) landing sites [18]. To examine a worst-case-scenario for discrimination analysis, potentially encountered in the context of experiments with flies carrying other transgenes, in most experiments we used “non-GFP” flies with a Minos transposable element insertion in the IR64a locus (IR64ami) [19] whose marker drives GFP expression in the ocelli and eyes. fruitless mutant flies (fru−/−) were homozygous fru Gal4 [20]. All flies were back-crossed to w1118 for five generations and self-crossed to achieve homozygosity. For courtship control experiments, GFP males were compared to w1118 males.
Fluorescence Behavioral Imaging (FBI) System
The experimental arena consisted of an 80 mm×20 mm enclosure with a height of 1.3 mm restricting flies to walking in two-dimensions (custom designed and machined from polyoxymethylene and acrylic glass). To achieve spectral separation of the two channels for each camera (Allied Vision Technologies, Stadtroda, Germany), we used a 580 nm long-pass dichroic filter (F38-580 HC beamsplitter BS 580, AHF analysentechnik, Germany) to pass infrared (IR) photons emitted from back-light illuminating 850 nm IR LEDs (IR-1WS-850-w/Star, Super Bright LEDs Inc. St. Louis Missouri, USA) through a diffusing glass (ThorLabs, USA) to a camera bearing a 785 nm IR long-pass filter (F76-787 Edge Basic Long Pass, AHF analysentechnik, Germany). This dichroic also reflected photons below 580 nm into a camera bearing a GFP band-pass filter (AHF analysentechnik, Germany). GFP was excited using a panel of blue super-bright 470 nm LEDs (LED470-66-60, Roithner Lasertechnik GmbH, Germany) placed incident to the behavioral arena.
Behavioral Experiments
All experiments were performed on 2 day post-eclosion adult Drosophila raised at 25°C on a 12 h light:12 h dark cycle. Experiments were performed in a temperature-controlled room at 25°C.
Homogeneous group FBI experiments
These experiments (Figures 3A, B; S1A, B; S2, S3, S4, S5) used 18 flies (either all male or all female; either all GFP or all non-GFP) and were performed as follows: GFP/IR imaging (10 s) – IR imaging (1 min) – GFP/IR imaging (10 s).
Heterogeneous group FBI experiments
These experiments (Figures 3C, D; S6, S7) used 18 flies (either all male or all female; half GFP and half non-GFP) and were performed as follows: GFP/IR imaging (1 min).
fru−/−/wild-type group FBI experiments
These experiments (Figure 4) used 12 flies (all male; half fru−/− and half GFP wild-type) and were performed as follows: GFP/IR imaging (10 s) – IR imaging (5 min) – GFP/IR imaging (10 s).
Courtship control experiments
These experiments (Figure S1C) used 2 flies (1 intact male and 1 headless female as in [21]) and were performed as follows: IR imaging (20 min). Male courtship behavior (defined as proximity/licking, wing-extension, or mounting) was manually scored.
Following all FBI experiments, Ctrax [2] was run on IR video data to obtain the position, orientation, and size of each fly. These data were then used to construct rectangular regions of interest (ROIs) on GFP fly images for subsequent analyses using custom-written shell scripts and Matlab scripts (The Mathworks, Natick, Massachusetts, USA). These scripts are freely available at freely available at http://lis.epfl.ch/FBI.
Automation Metric Evaluation Using Homogeneous Group Data
Homogeneous groups of flies were used for metric evaluations (Figure S2, S3, S4, S5) to ensure genotype identity and to provide a model for the distribution of data values. After tracking, a vector of pixel values from Total, Front, and Rear ROIs were extracted for each fly in each image (Figure 2D). Metrics were used to process these pixel values. One-thousand threshold values within the possible range were tested on the output of each metric. Flies with metric values above a given threshold were assigned the identity of GFP fly while those below this threshold were assigned the identity of non-GFP fly. These assignments were tested against the known genotype of each fly to determine the error or, inversely, the discrimination accuracy (100% - error%). Our comprehensive evaluations yielded two metrics with best discrimination accuracy: Max 5% Ratio and Skewness (Figure S2). Max 5% Ratio is the time-averaged mean of the maximum 5% pixel values in the Front ROI divided by the time-averaged mean of the maximum 5% pixel values in the Rear ROI. The ratio of single maximum pixel values (Maximum Front ROI/Maximum Rear ROI) performed equally well but was not selected due to low robustness against pixel value noise. Skewness was measured using the Matlab function of the same name and is defined as follows:
Where the skewness s, is defined by the mean of the data x, μ, the standard deviation of x, σ, and the expected value of t, E(t) [22].
Automation Tests Using Heterogeneous Group Data
For heterogeneous group experiments, the fluorescence identity ground-truth for each fly was obtained by human observer evaluation of videos with ROIs superimposed on GFP images. When the algorithm did not take the known number of GFP flies into account (Figure S6), it used proportions and thresholds that generated the largest cross-section of maximum accuracy regions in the discrimination accuracy heat maps from homogeneous group experiments (Figure S5). When the algorithm took the known number of GFP flies into account (Figure 3C, D & Figure S7), for each experiment, values for each fly obtained using the mixture of metrics were sorted in descending order. The top N flies, where N is the number of GFP flies expected, were assigned the identity GFP fly, while those remaining were assigned the identity non-GFP fly. Image-number analyses (Figure 3) were performed using data from both the beginning and the end of each experiment to exploit fly movement and reduce the impact of spatial inhomogeneity in fluorescence illumination. For example, when two images were used, one image was taken from the start of the experiment and one was taken from the end. Additional image-number analyses (Figure S7) only took images from the end of the experiment. All measurements and evaluations were performed using custom-written Matlab scripts (The Mathworks, Massachusetts, USA).
FBI Courtship Experiment Analysis
For FBI male-male courtship experiments, videos were first processed using Ctrax and FBI post-processing scripts to derive the behavioral statistics and genotypic identity of each fly. Subsequently, videos with tracking/genotypic identity overlaid (a modification of Ctrax’s showtrx.m script named showtrx_GENO.m available at: lis.epfl.ch/FBI) were manually annotated for courtship chasing/orientation events. For each event, the genetic identity of the chase target and the duration of the chase were noted. Data were tested for normality using the Lilliefors test. Normally distributed courtship probability and duration data were analyzed using the Student’s t-test. Non-normal chase duration data were compared using the Wilcoxon rank sum test.
Supporting Information
Acknowledgments
We thank Pawel Lichocki, Lucia Prieto Godino, Bryan Schubert and Suliana Manley for helpful comments.
Funding Statement
This work was supported by Human Frontier Science Program Long-Term Fellowship (www.hfsp.org); SystemsX.ch Initiative (http://www.systemsx.ch/); and a European Research Council Starting Independent Researcher Grant (http://erc.europa.eu/starting-grants). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Dahanukar A, Ray A (2011) Courtship, aggression and avoidance: pheromones, receptors and neurons for social behaviors in Drosophila. Fly (Austin) 5: 58–63. [DOI] [PubMed] [Google Scholar]
- 2. Branson K, Robie AA, Bender J, Perona P, Dickinson MH (2009) High-throughput ethomics in large groups of Drosophila. Nat Methods 6: 451–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Dankert H, Wang L, Hoopfer ED, Anderson DJ, Perona P (2009) Automated monitoring and analysis of social behavior in Drosophila. Nat Methods 6: 297–303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Schaefer AT, Claridge-Chang A (2012) The surveillance state of behavioral automation. Curr Opin Neurobiol 22: 170–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Hagler JR, Jackson CG (2001) Methods for marking insects: current techniques and future prospects. Annu Rev Entomol 46: 511–43. [DOI] [PubMed] [Google Scholar]
- 6. del Valle Rodríguez A, Didiano D, Desplan C (2011) Power tools for gene expression and clonal analysis in Drosophila. Nat Methods 9: 47–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Geyer PK & Fyrberg EA (1986) 5′-flanking sequence required for regulated expression of a muscle-specific Drosophila melanogaster actin gene. Mol Cell Biol 6: 3388–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Martin JR, Ernst R, Heisenberg M (1999) Temporal pattern of locomotor activity in Drosophila melanogaster. J Comp Physiol A. 184(1): 73–84. [DOI] [PubMed] [Google Scholar]
- 9. Gailey DA, Hall JC (1989) Behavior and cytogenetics of fruitless in Drosophila melanogaster: different courtship defects caused by separate, closely linked lesions. Genetics 121: 773–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Ito J, Ghosh A, Moreira LA, Wimmer EA, Jacobs-Lorena M (2002) Transgenic anopheline mosquitoes impaired in transmission of a malaria parasite. Nature 417: 452–5. [DOI] [PubMed] [Google Scholar]
- 11. Swierczek NA, Giles AC, Rankin CH, Kerr RA (2011) High-throughput behavioral analysis in C. elegans. Nat Methods 8: 592–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Cachat J, Stewart A, Utterback E, Hart P, Gaikwad S, et al. (2011) Three-dimensional neurophenotyping of adult zebrafish behavior. PLoS ONE 6: e17597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. de Chaumont F, Coura RD, Serreau P, Cressant A, Chabout J, et al. (2012) Computerized video analysis of social interactions in mice. Nat Methods 9: 410–7. [DOI] [PubMed] [Google Scholar]
- 14. Shaner NC, Steinbach PA, Tsien RY (2005) A guide to choosing fluorescent proteins. Nat Methods 2: 905–9. [DOI] [PubMed] [Google Scholar]
- 15. Grover D, Yang J, Tavaré S, Tower J (2008) Simultaneous tracking of fly movement and gene expression using GFP. BMC Biotechnol 8: 93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Lippincott-Schwartz J, Patterson GH (2009) Photoactivatable fluorescent proteins for diffraction-limited and super-resolution imaging. Trends Cell Biol 19: 555–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Bischof J, Maeda RK, Hediger M, Karch F, Basler K (2007) An optimized transgenesis system for Drosophila using germ-line-specific phiC31 integrases. Proc Natl Acad Sci USA 104: 3312–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Markstein M, Pitsouli C, Villalta C, Celniker SE, Perrimon N (2008) Exploiting position effects and the gypsy retrovirus insulator to engineer precisely expressed transgenes. Nat Genet 40: 476–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Ai M, Min S, Grosjean Y, Leblanc C, Bell R, et al. (2010) Acid sensing by the Drosophila olfactory system. Nature 468: 691–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Stockinger P, Kvitsiani D, Rotkopf S, Tirián L, Dickson BJ (2005) Neural circuitry that governs Drosophila male courtship behavior. Cell 121: 795–807. [DOI] [PubMed] [Google Scholar]
- 21. Grosjean Y, Rytz R, Farine JP, Abuin L, Cortot J, Jefferis GS, Benton R (2011) An olfactory receptor for food-derived odours promotes male courtship in Drosophila. Nature 478(7368): 236–40. [DOI] [PubMed] [Google Scholar]
- 22.MATLAB version 7.13.0 (2011) Natick, Massachusetts: The MathWorks, Inc.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.