SUMMARY
Deciphering how brains generate behavior depends critically on an accurate description of behavior. If distinct behaviors are lumped together, separate modes of brain activity can be wrongly attributed to the same behavior. Alternatively, if a single behavior is split into two, the same neural activity can appear to produce different behaviors. Here, we address this issue in the context of acoustic communication in Drosophila. During courtship, males vibrate their wings to generate time-varying songs, and females evaluate songs to inform mating decisions. For 50 years, Drosophila melanogaster song was thought to consist of only two modes, sine and pulse, but using unsupervised classification methods on large datasets of song recordings, we now establish the existence of at least three song modes: two distinct pulse types, along with a single sine mode. We show how this seemingly subtle distinction affects our interpretation of the mechanisms underlying song production and perception. Specifically, we show that visual feedback influences the probability of producing each song mode and that male song mode choice affects female responses and contributes to modulating his song amplitude with distance. At the neural level, we demonstrate how the activity of four separate neuron types within the fly’s song pathway differentially affects the probability of producing each song mode. Our results highlight the importance of carefully segmenting behavior to map the underlying sensory, neural, and genetic mechanisms.
In Brief
For 50 years, Drosophila melanogaster song was thought to consist of only two modes: sine and pulse. Using unsupervised classification methods, Clemens et al. establish the existence of three distinct song modes. They show how this distinction affects the interpretation of the mechanisms underlying song production and perception.
INTRODUCTION
A central aim of systems neuroscience is to combine neural activation, silencing, and recording techniques to causally link particular neurons and circuits to behavior and to show how activity patterns within these neurons relate to behavioral activity. Computational methods (e.g., [1–3]) are facilitating the automated detection and classification of behaviors to drive these studies forward, particularly in genetic model systems, such as worms, flies, and mice [4, 5]. But solving the neural basis for behavior requires the appropriate temporal and spatial parsing of behaviors in order to generate meaningful connections with the neural and muscle activity patterns that drive them [6]. A comprehensive classification may reveal that a seemingly variable behavior is in fact a mixture of distinct and stereotyped behavioral modes, each with its own neural basis and evolutionary trajectory. Failing to discriminate distinct behavioral modes will make the underlying brain activity appear noisy or redundant, because distinct activity patterns will appear to produce a single behavior. Here, we apply unsupervised analysis techniques to examine a behavior that has been studied for more than 50 years [7]—the courtship song produced by males of the species Drosophila melanogaster.
During courtship, D. melanogaster males vibrate their wings to produce an acoustic signal to convince the female that he is a suitable mating partner. Manual inspection of song recordings has identified two distinct song modes: sine song, which consists of a sustained sinusoidal oscillation, and pulse song, which comprises trains of short impulses [7, 8]. Based on this observation, automated methods were developed to effectively detect and segment these two modes of song [9], and using this software, large datasets have been analyzed to link both genes and neural activity with song patterns [10–14]. However, the claim that song comprises only two modes has never been tested using statistical methods, leaving open the possibility that more than two modes exist (as has been indicated for other Drosophila species, including D. simulans and D. mauritiana [15]).
Whether D. melanogaster produces additional song modes has important consequences for the study of the neural basis of behavior. For instance, previously described neural circuits for song production might be incomplete if they only explain the production of a subset of song modes. In addition, the arrangement of song modes into bouts depends on dynamic sensory feedback from the female: males bias toward singing more pulse song when moving fast or further from the female [10] and also modulate the amplitude of their pulses with distance to the female [11]. This patterning strategy also has an impact on female behavior [12]. Elaborating the song modes of Drosophila melanogaster therefore has implications for studies of how sensory information is integrated into or modulates the motor circuits that generate song.
By analyzing a large dataset of song recordings with unsupervised classification methods, we here show that D. melanogaster produces two distinct types of pulses in addition to sine song — one pulse type with relatively low frequency and a more symmetrical shape and one pulse type with relatively faster frequency and asymmetrical shape, which we term Pslow and Pfast, respectively. We then show how the discovery of two distinct pulse modes affects our interpretation of (1) the evolution of pulse song, (2) the sensory cues controlling pulse song amplitude, and (3) the neuronal basis of pulse song production. By examining pulse structure in D. melanogaster’s close relatives D. simulans, D. mauritiana, and D. sechellia, we find that the distinction between Pslow and Pfast is common to three of these species, although D. sechellia only produces Pslow. We also show that Pfast is much louder than Pslow and produced more frequently when the male is far away from the female. The choice between these two pulse types is thus part of the male strategy to sing louder when the female is distant [11]. Additionally, we show that the activity of identified song pathway premotor and motor neurons [14, 16] affects the choice between different song modes. Lastly, we demonstrate that the two pulse types differentially affect female behavior, suggesting these communication signals have different meanings for the female. Overall, our results illustrate how correct segmentation of behavior is important for understanding the evolutionary, sensory, and neural drivers of behavior.
RESULTS
Drosophila melanogaster Courtship Song Comprises at Least Three, Not Two, Distinct Song Modes
To determine how many song modes Drosophila melanogaster males produce, we clustered 25-ms waveforms of previously collected raw song recordings [10]. We first classified each sample as signal or noise based on signal amplitudes that exceeded background noise (33% of the waveforms passed this criterion); no other constraints on waveform shape were imposed (Figure 1A; see STAR Methods for details). We then aligned the signal samples to their peak energy, normalized them to correct for differences in waveform amplitude (induced by variations in the intensity of male singing or his position relative to the microphone), and adjusted their sign so that the waveform was positive immediately preceding the peak (this was done because waveform inversions are most likely caused by changes in male position relative to the microphone) (Figure 1B). The resulting set of ~20,000 normalized signals from 47 wild-type males of the strain NM91 contained courtship song as well as non-song noises originating from grooming, jumping, or other behaviors.
To facilitate the classification and the visualization of the waveforms, we reduced the dimensionality of the dataset using t-distributed stochastic neighbor embedding (tSNE) [2, 17]. The tSNE method is a nonlinear dimensionality reduction method that preserves local similarity structure in a dataset and is therefore particularly suited for classification. We clustered the low-dimensional representation of our signals in two steps. First, we partitioned the signal distribution along local minima using the watershed algorithm (Figure 1C). This procedure initially yielded seven clusters, which represent an over-partitioning of the waveform space, because it cuts the signal space along relatively weak local minima and thereby assigns similar waveforms to different clusters. We therefore employed a second hierarchical clustering step to consolidate the watershed clusters based on the similarity of their centroids (Figure 1D), and we chose the number of modes based on the similarity of the waveforms in the watershed clusters (Figure 1E). This resulted in four distinct signal modes (Figure 1D): (1) A “noise” mode lumps all signals that lack common structure across exemplars. (2) A “sine” mode joins three clusters that contain waveforms with sustained oscillations that tile a continuum of carrier frequencies between 120 and 180 Hz (Figure S1A). The three sine clusters all contain similar waveforms as indicated by the strongly overlapping and high correlation values (Figure 1E, right). This justifies them being merged into a single sine song mode by the hierarchical cluster algorithm. (3) A “pulse” mode with relatively slow (200–250 Hz) and symmetrical waveforms connects two very similar clusters with either a stronger leading or lagging lobe. These two clusters form a single pulse mode, because the waveforms are highly correlated (see Figure 1E, left), and they are thus joined by the hierarchical cluster algorithm. (4) A second “pulse” mode that, in contrast to the previous one, is biphasic (asymmetric) with faster oscillations (250–400 Hz). This pulse mode is highly dissimilar with the previous pulse mode (correlation values centered around zero; Figure 1E, left), indicating that it forms a distinct pulse mode. We term the two pulse modes “Pslow” and “Pfast,” according to their frequencies. Note that, whereas there is considerable variability within each song mode, the modes are clearly distinct from each other.
To facilitate analyses of the production, perception, and evolution of these two pulse types within Drosophila, we developed a simpler and more efficient pulse type classifier. The method takes as input pulses detected from an automatic song segmenter [9, 10]—which can detect both pulse types with high speed and reliability (detection rates: 78% for all pulses; Figure S1B; 80% for Pslow; 77% for Pfast)—and then classifies returned pulses based on their similarity with templates derived from the tSNE analysis pipeline (Figures 2A and S1C; see STAR Methods for details). This classifier reliably and efficiently reproduces the classification from the full analysis pipeline into the slow, symmetrical Pslow and the fast, asymmetrical Pfast in all individual males of eight geographically diverse wild-type strains, suggesting that the existence of two pulse types is common across isolates of Drosophila melanogaster (Figures 2B, S1D, S2A, and S2B).
We next ran several control analyses to ensure that the two pulse types were not data recording or analysis artifacts. First, we were able to recover both pulse types when using hand-segmented pulses (Figure S2E), when using linear as opposed to nonlinear dimensionality reduction (principal-component analysis instead of tSNE; Figure 2C), when omitting dimensionality reduction altogether (Figure S2F, left), and also when omitting parts of the normalization procedure (Figure S2G), demonstrating that our finding does not crucially depend on the details of our analysis pipeline. In addition, the existence of two pulse types could be due to the male changing his position relative to the directional microphones used to record song. However, when we induced singing via thermogenetic activation of P1 song pathway neurons in a tethered male walking on a spherical treadmill [11]—thereby fixing his position relative to the microphone—we still observed both pulse types (Figure 2D). Lastly, we examined the fly’s pose during singing and found that the two pulse types are associated with distinct, largely unilateral, wing positions (Figures 2E and 2F): Pslow is produced with one wing more fully extended (ca. 60°) and Pfast is produced most often with one wing only weakly extended (ca. 20°). Together, these analyses demonstrate that the Pslow and Pfast are separate song modes, likely to be produced by separate motor programs.
For all three song modes, we observed that the secondary wing was minimally extended to angles <10degrees (Figure 2F). To determine whether the secondary wing contributes to song production, we recorded song from courting males that had one wing cut (either left or right). As expected, these males showed a reduction of singing to ~50% of intact males (Figure S3A). Whereas these males still produced both pulse types, the individuality of these pulses (how similar a given male’s pulses were to the population average) was strongly reduced (Figures S3B and S3C). This suggests that the existence of two wings increases the individuality of pulse shapes produced by males, and hence, the function of the secondary wing may be to make the pulse shape more idiosyncratic. We do not know whether the female is sensitive to these small changes in pulse shape. However, we observed no strong effect of cutting one wing on pulse variability (Figure S3D).
How do the two pulse types—Pfast and Pslow—in Drosophila melanogaster compare to the pulse types produced in other species [15]? We applied our analysis pipeline to song data from three closely related species: Drosophila simulans; Drosophila mauritiana; and Drosophila sechellia (Figure 2G). Although D. simulans and D. mauritiana have been shown previously to produce two pulse types [15], little is published about the pulse shape of D. sechellia [19, 20]. Because the pulses in D. simulans and D. mauritiana are about twice as fast as D. melanogaster, we adapted the automated song segmentation and pulse normalization procedure to reliably detect and classify pulses in these related species (see STAR Methods for details; Figure S1B). Similar to D. melanogaster, pulses reproducibly clustered into two major pulse types for all strains of D. simulans and D. mauritiana examined (Figure 2H). One cluster contains slow, symmetrical pulses and the other faster, asymmetrical pulses (Figures 2H–2J, S2A, S2B, and S2D). D. melanogaster pulses are distinguishable from D. simulans and D. mauritiana pulses only by frequency: Pslow and Pfast are shifted from ~220 to ~470 Hz and from ~350 Hz to 670 Hz with only small differences between D. simulans and D. mauritiana (Figures 2I and 2J). D. sechelia produced only a single, symmetrical pulse type that resembles Pslow and is of slightly lower frequency than Pslow in D. melanogaster (Figures 2H, 2I, S2A, and S2D). That our analysis does not always classify pulses into two types validates our approach. Note that our failure to find Pfast in D. sechellia is not proof of its non-existence—the species could produce Pfast under different experimental conditions. This result also shows how false classification can lead to false estimates of the magnitude of evolutionary changes in pulse shapes: when comparing the pulse frequencies in D. sechellia and D. melanogaster, disregarding the distinction between Pfast and Pslow, the difference appears >100% higher than when doing the correct comparison of D. sechellia pulses with Pslow in D. melanogaster (Figure S2H). The phylogenetic pattern of pulse shapes in the four species suggests the following hypothesis about the evolution of pulse shape in the melanogaster species group: D. simulans and D. mauritiana increased the speed with which two ancestral motor primitives—one for each pulse type—were executed. D. sechellia has lost Pfast without significant change to the frequency of Pslow. Further studies are necessary to test these hypotheses. In the sections below, we investigate how the existence of two distinct pulse modes affects our understanding of courtship song patterning and perception in D. melanogaster.
Pulse Type Choice Contributes to the Modulation of Song Amplitude with Distance
Males structure their songs into bouts composed of trains of sines and pulses [9], with the choice to sing either sine or pulse biased by sensory feedback from the female [10]. How does classifying song into three modes affect our understanding of how song is patterned by feedback cues? We found that males do not continually switch between Pfast and Pslow but compose pulse trains of one type or the other more frequently than expected by chance (Figures 3A and 3B). By employing generalized linear models [10, 21] to predict pulse type from sensory and movement features (see STAR Methods), we found that the distance between the male and female was the strongest predictor of pulse type (Figures 3C and S3E). Males from eight different wild-type strains consistently bias toward producing Pfast at larger distances from the female (Figure 3D). In addition, we observed that blind males (but not deaf or pheromone-insensitive males) produce a significantly higher fraction of Pfast pulses (Figure 3E), indicating that vision plays a role in the choice between Pslow and Pfast. This builds on previous work that demonstrated males produce sine song when close to the female [10]; we now see that males continually bias toward modes with faster—and higher intensity—wing movements, as their distance from the female increases (Figure 3F). We tested each pulse mode for evidence of amplitude modulation with distance [11] and found that, although Pfast amplitude increases with distance (males ratchet up Pfast amplitude as they get farther from the female without changing pulse shape; Figures S3F and S3G), Pslow amplitude modulation was comparatively weak (Figure 3G). Taken together, these data suggest a reinterpretation of previous work on amplitude modulation [11]: the behavior is driven by at least two processes—the binary choice to sing the softer or the louder pulse type (Pslow versus Pfast) coupled with continuous modulation of Pfast amplitude.
Individual Cell Types within the Song Pathway Modulate the Choice between Song Modes
Above, we showed that males select between pulse types based on sensory feedback (Figures 3C and 3D) and that these two pulse types correspond to different wing poses (Figure 2E). This suggests that neurons within the fly’s song pathway can bias the motor output toward one of the three song modes. To test this hypothesis, we focused on four previously characterized Fruitless+ neurons implicated in driving wing extension and/or generating pulse song (Figures 4A and S4B): P1, a cluster of ~20 central brain neurons per hemibrain [24, 25]; pIP10, a pair of neurons that descend from the brain to the ventral nerve cord (VNC) [16]; vPR6, a set of ~5 thoracic interneurons per side [16]; and ps1, a wing muscle motor neuron in the VNC [14]. Previous work indicates that P1, pIP10, and vPR6 constitute a connected pathway from the brain to the wing motor neuropil of the VNC [16].
P1, pIP10, or vPR6 neural activation results in song production, even in the absence of a female [11, 16]. P1 neurons are known to be involved not only in song production but also in the integration of pheromonal and visual signals from the female [25–28], the production of aggressive behaviors [23], and overall mating drive [29]. We reasoned that, when males are close to females, the pheromonal and visual cues provided by the female would be particularly strong and hence P1 activity should be elevated. Because males bias toward both sine song and Pslow production when close to the female [10] (Figure3D), optogenetically activating P1 neurons should preferentially drive either sine or Pslow, but not Pfast, even in the absence of a female. pIP10 and vPR6 neurons, via connections to P1 [16], might therefore exert similar effects on song patterning.
To investigate how the activation of P1 and pIP10 affects the probability of producing the three different song modes, we used csChrimson [30] to optogenetically stimulate each neuron type in solitary males. We chose a light-emitting diode (LED) stimulation protocol that allowed us to vary the duty cycle (DC) between 0.1 and 0.9 during three-second-long trials, thereby leaving either very long pauses (DC = 0.1 corresponds to a pause of 2,700 ms) or very short pauses (DC = 0.9 corresponds to a pause of 300 ms) between subsequent stimulations—in other words, short DCs correspond to weaker stimulation and long DCs to stronger stimulation. Consistent with the idea that P1 encodes female proximity, males produced mainly sine song during P1 activation, and both Pfast and Pslow were largely suppressed (Figures 4B1, 4B2, 4C1, and 4C2). We observed this effect for two genetic drivers that target P1 neurons: R71G01 [22] and a split GAL4 driver termed P1a that is the intersection of R71G01 with a second driver [23] (Figure S4B). Interestingly, for P1a, sine song was only dominant for DCs < 0.8; at higher DCs, pulse song was more dominant, suggesting that sine production requires a pause in P1a activity (Figures S4A2 and 4C2). Pfast and Pslow probabilities exhibited similar dynamics for both P1 drivers (Figures 4B1, 4B2, S4A1, and S4A2), suggesting that P1 mainly biases the choice between sine and pulse song, affecting the balance between Pfast and Pslow relatively little. Moreover, P1 activation did not affect the waveform shape of either Pfast or Pslow, only the probability of producing pulses (Figures S4C and S4D).
pIP10 is a descending neuron thought to be postsynaptic to P1 [16]. However, we found that pIP10 does not simply relay P1 activity, because Pfast, and not sine song, was the dominant mode produced during optogenetic activation (Figures 4B3 and 4C3). Sine became the dominant song mode only after stimulation ended and only if the pauses between stimulation were sufficiently long (DCs ≤ 0.9). Consistent with these results, we also found that pIP10 inactivation (using either Kir2.1 [31] or TNT [32]) during courtship with a female decreased Pfast production (Figure 4E). We found that the probability of producing Pslow was largely independent of optogenetic stimulus duration, whereas the opposite was true for Pfast: for short DCs (<0.6), we observed high peak probabilities, intermediate DCs (0.6–0.8) produced an onset transient with reduced steady-state levels, and at the longest DCs, Pfast levels were relatively constant throughout the stimulation period and resembled those of Pslow (Figures 4B3 and S4A3).
vPR6 is a thoracic local neuron thought to be postsynaptic to pIP10—constitutive thermogenetic activation of vPR6 is known to modulate the inter-pulse interval of pulse trains and so this neuron was proposed to be part of the central pattern generator for pulse song (Figure 4B4) [16]. We found that optogenetic activation of vPR6 drives only pulses (and no sine) and that Pfast and Pslow production dominate early and late during stimulation, respectively (Figure 4C4). Thus, this neuron affects the choice between Pfast and Pslow—weak stimulation drives Pfast, but stronger stimulation can drive Pslow.
Our results with P1, pIP10, and vPR6 activation reveal complex dynamics in the neural circuits driving song production. For activation of all three cell types, we observed that stimulus history (and thereby the history of song produced) has a strong impact on the song types elicited by optogenetic activation (Figures 4B and S4A). Although optogenetic activation of all three cell types affected male speed (Figure S4A), we observed no fixed relationship between male speed and song choice among the three cell types examined here, suggesting that the song dynamics induced by optogenetic activation cannot be explained as an indirect effect of changes in male speed alone.
Finally, we investigated the activity of a song pathway motor neuron implicated in regulating both the amplitude and carrier frequency of song pulses [14]. This effect is consistent with ps1 being involved in Pfast production, because Pfast is the louder and faster of the two pulse types. Because activation of ps1 alone does not generate song, we instead silenced ps1 using tetanus toxin (TNT) and paired males with females to determine the effect of ps1 silencing on song production. We found that males with ps1 neurons inactivated still produce both pulse types, with no deficit in the shape of either Pslow or Pfast (Figure 4I) and only a subtle effect on the frequency of Pslow pulses (Figures S4E and S4F). Instead, silencing ps1 strongly affects the switching between Pslow and Pfast with distance (Figure 4F) and thereby reduces the amount of Pfast pulses being produced (Figure 4G): wild-type flies bias away from Pslow with increasing distance, but ps1 inactivated flies continue to sing Pslow even when far from the female. ps1 inactivation also reduces the amplitude of both pulse types, demonstrating that it has additional functions beyond pulse choice (Figure 4H). This implies that, although ps1 is not strictly necessary for Pfast production —~10% of pulses are still of the Pfast type and indistinguishable from wild-type Pfast pulses (Figures 4I, S4E, and S4F)—this motor neuron contributes to the choice between pulse types and thereby the overall structure of male song.
Female Song Responses Depend on Pulse Type Choice in Males
Up to now, we have shown that the distinction between Pfast and Pslow is functionally significant for the male, for instance, as a strategy to efficiently produce pulse song at different amplitudes (Figures 3F and 3G). That is, it may be energetically more efficient for the male to produce Pfast at high amplitudes. Under this scenario, two pulse types could evolve without having different effects on the female, and experimentally discriminating the two pulse types would still be important for our understanding of song evolution (Figures 2G–2J) and of the role of particular neuron types in patterning song (Figures 3 and 4). However, independent of their function for the male, the two pulse types could also serve as distinct signals to the female. We hence examined whether females respond differentially to Pfast and Pslow and whether regular male song mode choice—e.g., producing more Pslow when close to the female (Figure 3D) — influences mating success.
Previous studies have shown that sine and pulse song are correlated with a reduction of locomotor speed in sexually receptive females and that this correlation depends on females being able to hear the male song [10, 12]. We examined female locomotor speed relative to the amount of each song mode the male produced. We used time windows of 30 s to investigate this correlation, because previous work demonstrated that females integrate across song bouts [12]. When examined across all distances, Pslow and sine are negatively correlated with female speed (Figures 5A and 5B), and there is no correlation between Pfast and female speed (Figure 5C). However, males choose between pulse types in a context-dependent manner (Figure 3D)—distance-dependent female responses would indicate that females discriminate between pulse types and between the context in which they are produced. Indeed, when examining correlations relative to male-female distance, we found that all three song modes have the potential to slow females (Figure 5D). Pslow and sine were correlated with reductions in female speed across a wide range of distances, whereas Pfast was correlated with a reduction in female speed only when males were far away from females (and then only weakly). All three modes were correlated with increases in female speed when produced very close to the female.
If females are sensitive to the “correct” song mode choice, then experimental disruption of the male song pattern during courtship should result in a less attractive song and consequently reduced copulation. Males with inhibited pIP10 or ps1 still sing abundantly but produce either less Pfast or do not effectively switch between pulse types based on distance (Figures 4E–4G; see figure legends for number of pulses produced); we found that these males also copulated less with wild-type females (Figures 5E and 5F). Likewise, optogenetic activation of pIP10 in males during courtship abrogates distance-dependent pulse choice (Figure S5A) and disrupts song mode choice in general (Figure S5B). The resulting song was unattractive to females: relative to controls for this genotype, both pulse types are now correlated with female acceleration (versus slowing) when pIP10 neurons are optogenetically activated in males (Figure 5G). In addition, pIP10-activated males copulated less (Figure 5H). Overall, this suggests that females discriminate between the two pulse types and that the context in which each pulse type is produced is likely to be an important determinant of a song’s attractiveness. Further elucidation of the differential roles of Pslow and Pfast on female behavior will most likely require higher resolution readouts of female postural movement changes [2, 33, 34]. The reclassification of male courtship song therefore also has implications for the study of auditory perception in the female nervous system.
DISCUSSION
Here, we show that Drosophila melanogaster courtship song comprises at least three distinct modes, and we demonstrate how this distinction affects our interpretation of the mechanisms underlying evolution (Figure 2), song patterning (Figures 3 and 4), and song perception (Figure 5). Song patterning relies on the integration of multiple sensory cues, and discriminating between Pslow and Pfast now reveals a novel layer of song control. Previous studies had shown that, similar to humans, Drosophila males increase the amplitude of their acoustic communication signals relative to a visual estimate of distance to the receiver [11]. These studies implied a single control system that upregulates pulse amplitude with perceived distance. Analyzing the effect of distance on both pulse types separately, we now find that visual information mediates this amplitude modulation using two separate control mechanisms: first, a digital control system affects the binary pulse choice and biases pulse production toward the louder Pfast when far from the female (Figure 3D), and second, an analog control system upregulates a continuous variable—the amplitude of Pfast, but not of Pslow—with distance (Figure 3G). These two modes of control are most likely implemented through distinct circuits.
Female proximity cues activate the song premotor neuron P1 [25, 26], and using optogenetics, we have shown that activation of P1 biases solitary males toward singing more sine song (Figures 4B1, 4B2, 4C1, and 4C2)—which is naturally produced when the male is closest to the female [10]. However, P1 activation had no strong effect on the choice between PSlow and Pfast, suggesting that pulse choice is implemented elsewhere in the circuit (Figures 4B1 and 4B2). By contrast, pIP10 is thought to be downstream of P1 [16], but its activation strongly biases males toward producing Pfast instead (Figures 4B3 and 4C3). Thus, rather than being a relay of P1 activation, pIP10 also actively shapes the dynamics of song mode choice (Figures 4B and S4A). Parallel descending pathways (e.g., pMP2 [25, 35]) most likely bias motor circuits toward the production of sine song or Pslow, but these pathways have not yet been functionally characterized due to a lack of sparse and specific genetic drivers to target these descending neurons. The activation of P1 or pIP10 in solitary males produces all three song modes but with different probabilities and latencies. This suggests that a static analysis of song production is insufficient for understanding how song is generated in Drosophila. Rather, the activity dynamics of different elements of the song pathway shape what is sung and when. These activity dynamics will be a product of connections between song pathway neurons, the history of activation of each neuron, and the sensory pathways that bias the output toward each of the three song modes.
ps1 is a motor neuron previously implicated in setting the frequency and amplitude of pulses [14]. By segmenting the behavior to identify multiple pulse types, we find instead that this motor neuron does not strongly affect pulse shape (Figure 4I) but rather the choice of which pulse type to produce relative to the distance to the female (Figure 4F). This suggests that this neuron represents one of the ends of the pathway that connects visual information with pulse choice. The dynamical preference of the premotor neurons P1 and pIP10 for driving the production of particular pulse types (Figures 4B and 4C) could be mediated by differential recruitment of the ps1 muscle, which in turn could push the song production system into the Pfast mode by increasing thoracic rigidity [14]. Further studies into the activity of premotor neurons, motor neurons, and muscles [14, 16, 35, 36] during singing will shed additional light on the circuits that drive the three song modes.
We also found that females slow differentially to Pslow and Pfast (Figures 5A–5D) and that this slowing depends on the context: females reduce their locomotor speed to Pfast only when it is sung at farther distances (Figure 5D)—females may accelerate in response to Pfast when it is sung at close distances because they prefer quieter songs. Further, manipulations that disrupted the natural song mode choice strongly reduced male mating success (Figures 5E–5H), suggesting that context is an important determinant of the female mating decision. These results highlight that distinguishing between pulse types matters for studying the perception of song as well. Future experiments that allow precise control over the sensory cues available to the female—e.g., through sound playback [37]—will be necessary to elucidate the neural basis of the differential slowing response.
Whereas previous studies demonstrated the existence of multiple pulse types in Drosophila species [15, 38–40], our statistical analysis of pulse shapes in four species of the melanogaster group now shows that pulse shapes are conserved in these species (Figures 2H–2J and S2): D. melanogaster, D. simulans, and D. mauritiana all produce Pfast and Pslow, and D. sechellia only produces Pslow. The most parsiminous explanation for this pattern is that the existence of these two pulse types predates the species split and that D. sechellia has lost the ancestral Pfast type. Interestingly, D. yakuba—a member of the melanogaster group outside of the branch considered here (Figure 2G)—produces two pulse types termed “thud” and “clack” that are produced at different distances to the female [40, 41], just as with Pslow and Pfast in D. melanogaster. Notably, pIP10 activation in D. yakuba [41] and in D. melanogaster (Figure 4B3) biases the song toward the louder and higher frequency pulse type. This common neural control suggests that D. melanogaster Pfast and D. yakuba clack are functionally homologous, but further studies are required. It is also not yet clear whether distance to the female is the main driver of pulse choice in D. simulans and D. mauritiana.
In summary, by now segmenting Drosophila melanogaster song into three (not two, as previously thought) modes, we have revealed two major pitfalls when lumping distinct behavioral modes. First, the changes in the prevalence or the complete loss of one mode upon experimental manipulation will appear as changes in the shape of the lumped mode. For instance, the motor neuron ps1 was reported to change the frequency of pulses [14]. By discriminating two pulse types, we show that pulse frequency is relatively stable (Figure S4E) and that what changes is the prevalence of each pulse type with distance to the female (Figure 4G). The same danger looms for studies on the genetics and evolution of behavior—for instance, the loss of one pulse type can appear as a large change in overall pulse shape, even when the homologous pulse types change only little (Figure S2H). Second, lumping distinct behavioral modes may lead one to completely miss levels of neural control. For instance, previous studies showed that males produce louder pulses with increasing distance to the female [11]. Assuming the existence of a single pulse type, this implied a single, analog control mode. The discovery of two pulse modes now implies two control circuits for producing louder pulses: one for biasing toward the louder Pfast pulse type (Figure 3D) and one for upregulating the amplitude Pfast, but not of Pslow (Figure 3G). The existence of two pulse types also implies two functional modes of the song-pattern-generating network, potentially occupying non-overlapping sets of neurons. Premotor and motor neurons bias not only between sine and pulse but also between the two pulse types (Figure 4), suggesting that they are able to engage these different network modes as neural substrates for the distance-dependent pulse choice. The multiple hypothesized—and previously seemingly redundant—song-triggering descending neurons pIP10 [16] and pMP2 [24, 42] most likely play specific roles in biasing the downstream circuits toward producing one of the three song modes based on sensory cues. Likewise, although females respond to both pulse types, they do so in a context-dependent manner, and the effectiveness of courtship song depends on the male singing the right song mode in the right context (Figure 5). The female auditory system must therefore be able to discriminate between pulse types and to integrate acoustic information with context information to inform her behavior. All of the above examples hint at the existence of novel but still unknown circuits. This highlights the benefits of a detailed characterization of behavior for interpreting data from neural activation and silencing experiments, recordings of neural activity, and comparisons of behavior across species.
STAR*METHODS
CONTACT FOR REAGENTS AND RESOURCE SHARING
Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Mala Murthy (mmurthy@princeton.edu).
EXPERIMENTAL MODEL AND SUBJECT DETAILS
Animals
If not stated otherwise, virgin male and female flies were isolated within 6 hr of eclosion and aged for 3-7 days prior to experiments. Flies were raised at low density on a 12:12 dark:light cycle, at 25°C and 60% humidity.
METHOD DETAILS
Behavioral chamber
Behavioral chambers were constructed as previously described [10, 11]. For optogenetic activation experiments we used a modified chamber whose floor was lined with white plastic mesh and equipped with 16 recording microphones. To prevent the LED light from interfering with the video recording and tracking, we used a short-pass filter (Thorlabs FESH0550, cut-off wavelength: 550 nm). When recording with a female, males were painted on the thorax with a white dot 20 hr prior to experiment under cold anesthesia for identification of sex during tracking. Flies were introduced gently into the chamber using an aspirator. Recordings were timed to be within 150 min of the behavioral incubator lights switching on to catch the morning activity peak. Recordings were stopped after 30 min or earlier if copulation occurred. If males did not sing in the first 5 min of the recording, the experiment was discarded. For the analysis of wing angles during the production of all three song modes, we recorded video at 100 frames per second (instead of 60 frames per second as in all other recordings) and used a chamber that was lined with fine nylon mesh (Ted Pella Nylon 300 Mesh) instead of the coarse mesh used for the other recordings. This created a cleaner background for video recordings and facilitated resolving the wings.
Optogenetic activation
Flies were kept for at least 3 days prior to the experiment on fly food supplanted with retinal (1 mL all-trans retinal solution (100mM in 95% ethanol) per 100 mL food).
CsChrimson [30] was activated using a 627 nm LED (Luxeon Star) at an intensity of 0.46 mW/mm2 (driving voltage 3.2V). Stimulus period 3 s, 9 duty cycles (DCs) ranging between 0.1 and 0.9, filling between 300 and 2700ms of the period with constant LED illumination. Each DC was presented in randomized blocks of 90 s (30 trials each). For the experiments of males paired with females (Figures 5G and 5H), we only used DCs 0.3 and 0.9. When optogenetic activation experiments were performed using solitary males (Figure 4), recordings were not timed to peaks in circadian activity. When paired with wild-type females (Figure 5), recordings were timed to be within 150 min of the behavioral incubator lights switching on to catch the morning activity peak. For these experiments, we used control flies that had the same genotype but were kept on regular food without added retinal. Sound recording and video were synchronized by positioning a green LED that blinked with a predetermined temporal pattern in the field-of-view of the camera and whose driving voltage was recorded alongside the song.
QUANTIFICATION AND STATISTICAL ANALYSIS
Song segmentation and tracking
Both tracking of flies and segmentation of song recordings were performed as previously described [10, 11, 48]. Pulses were detected with high detection rates (true positive rates 78%, 74%, 85% for D. melanogaster, simulans and mauritiana) and few false detections (false positive rate ~2% for D. melanogaster and D. simulans, 7% for D. mauritiana because some pairs produced substantial movement noise during courtship) (Figure S1B). Detections of the same pulse on different microphones of the chamber were consolidated and the pulse waveform was taken from the loudest channel. The song segmenter (with the parameters used in [10], not those of [9] - see [49] for a comparison of segmenter performance with these different parameters) for D. melanogaster detected both Pfast and Pslow at high rates without requiring any modifications (77% and 80%, respectively). We now extended the song segmenter to classify pulses into Pfast and Pslow (available at https://github.com/murthylab/songSegmenter, see below). For segmenting the pulses produced by D. simulans, D. mauritiana we modified the segmenter by building strain specific pulse models from sets of manually annotated data (~500 pulses per strain).
The song segmenter performed poorly for recordings from D. sechellia with a low true positive rate of only 53% and a high false positive rate 22%. Classification of these automatically segmented pulses yielded two clusters: One that contained virtually all the true positive pulses and resembled Pslow, and one that was dominated by the false positives. The “pulses” from this second cluster did not occur in regular trains of fixed IPI. By contrast, the waveforms in the Pslow cluster almost always occurred in regular trains with clearly peaked IPI distributions. For all other species, pulses of both types occurred in regular trains with near identical IPI distributions (Figure S2C) and with similar and small false positive rates. We therefore used hand-segmented waveforms for classification of D. sechellia pulses. Both pulse types were detectable in hand-segmented data for all other strains used (e.g., Figure S2E).
Analysis of wing angles during song production
To compute wing angles during bouts of song, we estimated the positions of wing tips and the center line of the thorax and abdomen in each song frame. Each video frame was cropped and oriented such that the fly was always facing right, irrespective of its position in the behavioral chamber. We manually annotated single frames and then trained a convolutional neural network to automate the annotation of wings and the center line of the thorax and abdomen [34]. Wing angles were normalized to fall in the range [−180°, 180°] with 0° denoting the wing retracted state and >0° denoting wing positions away from the center line. To plot the fly body position density (Figure 2E) for each song mode, we flipped the foreground mask of each fly in each video frame such that the maximally extended wing was always on the left side of the body. For these experiments, song was segmented using custom software optimized for low signal to noise scenarios.
Waveform selection for segmented pulses
For D. melanogaster and D. sechellia pulse waveforms were extracted from the recordings by taking 25.1 ms (251 samples at 10 kHz) around the pulse center detected by the song segmenter from the channel on which the pulse was recorded with the highest energy. For the much shorter pulses produced by D. simulans and D. mauritiana, 25.1 ms around the pulse peak contained mostly noise. We therefore upsampled the waveforms to 20 kHz and extracted the same 251 samples - now corresponding to 12.55 ms - around the pulse center.
Waveform selection for unsupervised analysis of song waveforms
For the unsupervised analysis of song structure (Figure 1), we selected waveforms from the raw recordings without using the song segmenter. As a first step, all 9 recording channels were merged by choosing the signal from the channel with the highest absolute value in 5 ms windows. To avoid our analysis being drowned out by background noise (up to 90% of a song recording can contain no song), we chose only signals that exceeded the energy of the recording noise. Sound energy was calculated by estimating the signal envelope using the Hilbert transform, transforming it to a logarithmic scale and then smoothing it using a sliding window lasting 100ms. The logarithmic transformation produced an energy distribution that resembled a mixture of two Gaussians, one corresponding to baseline noise and one corresponding to song signals. We fitted a Gaussian Mixture Model with two components to the distribution energy values and identified as “signal” the component with the higher mean energy. From all time points classified as signal we then extracted 25 ms long, non-overlapping waveforms. Note that the resulting data set contained many snippets that contained only noise if they were at the very beginning of or between song bouts.
Waveform normalization
To remove variability in the waveforms arising from the position and distance of the singing male from the microphone, we
Divide the raw waveforms, x(t), by their norm: .
Center to their peak energy obtained by smoothing the square pulse waveform with a rectangular window spanning 15 samples (1.50 ms for D. melanogaster and D. sechellia, 0.75 ms for D. simulans and D. mauritiana).
Flip sign such that the average of the 10 samples (1.00 ms for D. melanogaster and D. sechellia, 0.5 ms for D. simulans and D. mauritiana) preceding the pulse center is positive.
Timescales were shortened for D. simulans and D. mauritiana to enable robust alignment of the much shorter pulses produced by these species (see e.g., Figure 2H). The last step – adjusting waveform sign – does not artifactually create clustering into two pulse types but rather reduces the degeneracy of the waveform space since the same pulse can occur with different signs on different microphones, indicating that the position of the male wing relative the microphone (in front of or behind) determines waveform sign. Clustering waveforms with randomized sign yields clustering in which each pulse type appears in two clusters – one in its original shape and one with the sign inverted (Figure S2G).
Dimensionality reduction
The dimensionality of the normalized waveforms was reduced from 251 data points per waveform to two dimensions using two different methods:
T-distributed stochastic neighbor embedding (tSNE) preserves local neighborhood structure while ignoring global similarity between data points. It is therefore ideally suited for cluster analysis, which relies on local similarity. To embed large datasets (more than 100000 pulses) we chose the computationally and memory-efficient Barnes-Hut implementation [44] (parameters: initial PCA dimensions = 30, perplexity = 50, accuracy = 0.1). The shape of the embedded distribution was highly reproducible across runs and did not depend critically on the choice of parameter values. Since large perplexity values increased the separation between clusters - while preserving the overall shape of the embedded distribution - we chose a perplexity value of 50.
Principal component analysis (PCA) is a linear method for dimensionality reduction and works by finding a small set of orthogonal basis functions in which the global data variance is maximized. Clustering using a PCA-based representation of the waveforms yields qualitatively similar results (Figure 2C), indicating that our findings do not critically depend on the nonlinear tSNE embedding. However, the cluster separation corresponding to Pslow and Pfast was not as strong and hence clustering was not as reproducible as with tSNE. Consequently, we chose to use tSNE embedding for all analyses.
Waveform clustering
To identify groups of similar pulses, we clustered the waveforms in two steps:
The 2D distribution of data was partitioned using the watershed algorithm, which cuts a distribution along local minima. To that end we estimated the distribution of the tSNE-embedded pulses using kernel density estimation (MATLAB function mvksdensity), with the bandwidth chosen automatically by the algorithm (Figure 1C). This resulted in seven clusters (thin lines in Figure 1C and cluster waveforms in Figure 1D), some of which exhibited highly similar mean waveforms (Figure 1E) indicating that this first step overpartitioned the waveform space.
We therefore consolidated these watershed clusters using hierarchical clustering of the cluster centroids (MATLAB function clusterdata). For the unsupervised analysis of song waveforms (Figure 1D), the cluster tree revealed four principal waveform shapes corresponding to noise, sine song and the two pulse types. Cluster consolidation (a) joins three sine clusters, which is supported by them containing all similar waveforms (Figure 1E, right), or (b) joins two Pslow clusters but leaves the Pfast separate, which is again supported by the waveforms in the Pslow clusters being highly similar and those in the Pfast cluster being highly dissimilar to those of Pslow.
When clustering only the pulses detected by the song segmenter (Figure 2A), we chose a clustering cutoff that produced two modes.
Note that there exists a myriad of approaches for clustering data. Watershed plus consolidation using hierarchical clustering on the centroids produced the most robust clustering. Direct agglomerative clustering of the tSNE embedded data – without first using the watershed cluster step – also results in four clusters (not shown), but these clusters are not as “clean” – for instance, the Pfast mode extends into the noise cluster. Direct clustering on all waveforms yields unreliable results, likely because of the large variability of waveforms in this dataset (it includes noise). However, Gaussian Mixture Modeling of the full waveforms without dimensionality reduction also reliably clusters pulses into Pfast and Pslow (for details see below and Figure S2F).
The full analysis pipeline yielded near-identical results across different runs - individual pulses were almost always assigned to the same cluster with very high probability (adjusted Rand index ~0.97). Code illustrating all steps of the pipeline is available athttps://murthylab.github.io/pulseTypePipeline/.
Template-based pulse type classifier
Normalized pulses from the D. melanogaster wild-type strain NM91 were projected onto the centroids of the Pslow and Pfast clusters and a quadratic boundary separating both pulse types was obtained using quadratic discriminant analysis cross-validated using an 80:20 partition into training and test data (Figure S1C). The classifier performs well on the test data and generalizes well to different D. melanogaster strains (Figure S1D). It is therefore a fast and robust method for classifying pulses that does not require the computationally extensive pipeline outlined above. Code for building and evaluating the classifier is available at: https://github.com/murthylab/pulseTypeClassifier. The classifier has also been integrated into the song segmenter software (available at https://github.com/murthylab/songSegmenter).
Gaussian Mixture Modeling
As an alternative to the above methods, we clustered pulses using a Gaussian Mixture Model (GMM). GMMs fit the data as a sum of Gaussian distributions with one Gaussian per cluster in the data. For fitting, we used MATLAB’s “fitgmdist” function, with regularization set to 0.01 to ensure convergence of the algorithm. To speed up fitting, we used random subsets of 3551 out of the 71029 pulses (1/20th) from the wild-type strain NM91. To determine the optimal number of clusters, we fit GMMs to 20 different random subsets each with 1-6 clusters. The model likelihood is prone to overfitting, since adding more parameters to a model will almost always improve the model fit for instance if the true data distribution is not perfectly Gaussian. We therefore used Akaike’s Information Criterion (AIC) [50] as a measure of error adjusted for the number of parameters (Figure S2F, right panel). When fitted using the full waveforms (all 251 points per pulse), a fit with 2 clusters reproduced the partition into Pfast and Pslow obtained with our standard procedure with >97% accuracy for all subsets (centroids shown in Figure S2F, left panel). Model likelihood and AIC were low for the GMMs fit to the full waveforms since it included many data points at the beginning and end of the pulse waveforms that only contributed noise to the data. We therefore used only time points with sufficient signal by selecting those whose variance across the full dataset exceeded 2−8 roughly corresponding to the 40 central time points of each pulse. Note that the results obtained were robust to the specific threshold chosen for inclusion of time points. Using this method, the AIC exhibited a minimum at 2 components for all 20 random subsets of the data indicating that the data is most parsimoniously represented using two pulse types.
Female song responses
For each pair of flies, we split the annotated song and the tracking data into 30 s time windows with 15 s overlap and computed the number of Pfast and Pslow pulses, the fraction of the time window occupied by sine song, the average female speed and the average male-female distance. Time windows that were occupied by song for <5% or >95% of the time were excluded from the analysis. Song features and speed values were z-scored for each pair to correct for individual differences in overall speed or singing. We then computed the rank correlation between each song feature and female speed either across all distances or binned by distance (2-10 mm, bin width 1 mm).
Effect of disrupted pulse choice on mating success
To test the effect of disrupting song mode choice on the male’s mating success, we paired genetically manipulated males with wild-type females of the strain NM91 and scored the times to copulation up to 30 min. UAS-Kir controls did not copulate and we hence did not use this effector to assess male copulation success. For statistics, we performed a two-sided binomial test on the fraction of pairs that had copulated at the end of the scoring window.
Characterization of pulse shapes
Song pulses are transient signals and their raw magnitude spectra are relatively broad (Figures 2B and S4E). Accordingly, the peak frequency of the spectrum is an unreliable measure of pulse carrier frequency. We hence used the center of mass of the magnitude spectra thresholded at e−1 = 0.37 as a measure of frequency. The values obtained tended to closely resemble 1) the spectral peak frequencies in cases of sufficiently peaked spectra (e.g., for most instances of Pslow) and 2) the carrier frequency values extracted from fits of Gabor functions to the pulse waveforms (not shown), demonstrating that the center of mass faithfully extracts the pulse carrier frequency.
A pulse symmetry index was calculated as the dot product of the first half of the pulse and the flipped second half of the pulse. Positive indices correspond to even symmetry and negative indices to odd symmetry. Pulse amplitudes were normalized for the gain of individual microphones as described in [11].
Identification of sensory cues driving pulse choice
The Generalized Linear Model analysis for identifying the song features driving the choice between Pslow and Pfast was performed as in [11].
Amplitude modulation plots
We evaluated the relation between pulse amplitude and distance to the pulse at a delay of 470ms since the distance at this delay is most predictive of pulse amplitude [11].
Pulse amplitude was taken as the range (peak to trough) of the recorded waveform, corrected for position in the chamber since actual and recorded pulse amplitude can differ due to differences in the gain between the microphones or due to the male being on or off the microphone. For correction, a position-dependent gain map of the recording chamber was estimated from all recorded pulses based on the assumption that the average amplitude of the pulse produced by the male should be the same throughout the chamber (see [11] for details). Distance was binned between 1.75 and 10.25 mm in 0.5mm steps and the mean pulse amplitude (and its standard error) were calculated for each bin. The first three pulses of each train were excluded from the analysis since flies do not modulate pulse amplitude at pulse train start [11]. Distance bins with <50 pulses across all flies were excluded before calculating r2 values.
Analysis of optogenetic activation data
The song probability traces (Figures 4B and S4A) were constructed by calculating the fraction of trials (for each time bin, pooled across all flies) during which a male produced sine, Pslow or Pfast. Traces do not add up to 1.0 since there are trials during which males did not sing any song. Song probability traces and male speed were down sampled to 400Hz and smoothed with a sliding Gaussian kernel lasting 150ms with standard deviation σ = 64ms.
DATA AND SOFTWARE AVAILABILITY
The song segmenter with pulse classifier is available at https://github.com/murthylab/songSegmenter. The stand-alone pulse classifier is available at https://github.com/murthylab/pulseTypeClassifier. The script illustrating the analysis pipeline is available at https://murthylab.github.io/pulseTypePipeline/. Raw data are available upon reasonable request from the Lead Contact, Mala Murthy (mmurthy@princeton.edu). Pictograms of flies were modified from Benjamin de Bivort’s lab web page (http://debivort.org/docs/flyclipart.pdf)
Supplementary Material
KEY RESOURCES TABLE.
REAGENT or RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Chemicals, Peptides, and Recombinant Proteins | ||
all-trans retinal | Sigma-Aldrich | R2500 |
Sigmacote | Sigma-Aldrich | SL2 |
Deposited Data | ||
Manually annotated D. melanogaster pulses | [43] | http://www.pnas.org/content/114/8/1970/tab-figures-data#fig-data-additional-files |
Experimental Models: Organisms/Strains | ||
All fly strains including their complete genotypes, sources, and the figures they appear in are listed in a table in Experimental Model and Subject Details. | N/A | N/A |
Software and Algorithms | ||
MATLAB R2017a | MathWorks | https://mathworks.com/products/matlab.html |
tSNE code | [17, 44] | https://github.com/lvdmaaten/bhtsne |
Fly song segmenter with pulse classifier | [9], this paper | https://github.com/murthylab/songSegmenter |
Stand-alone pulse classifier | This paper | https://github.com/murthylab/pulseTypeClassifier |
Pulse classification pipeline | This paper | https://murthylab.github.io/pulseTypePipeline/ |
Figure | Genotypes |
---|---|
1, 2A, 2B S1A, S1C, S2E, S3A–S3D, and 5A–5C | D. melanogaster NM91 males (provided by Peter Andolfatto) courting pheromone insensitive and blind (PIBL) females (GMR-hid/GMR-hid; orco-/orco- [45, 46]). *, ** |
2E, 2F, and S3A–S3D | D. melanogaster NM91 males (provided by Peter Andolfatto) courting NM91 wild type females. |
S1B, S1D, 2C, 2H–2J, S2, 3A–3D, 3F, 3G, S3E–S3G, and 5A–5C | 8 D. melanogaster strains CM07, CarM03, N30, NM91, TZ58, ZH23, ZW109 (provided by Peter Andolfatto or the Drosophila species stock center), and Canton S (lab stock). Males courting PIBL females *,**. |
2D | P1-TrpA1: UAS>stop>TrpA1/+; fruFLP/NP2631 (fruFLP [42] provided by Barry Dickson;UAS>stop>TrpA1 was generated in [16]; NP2631 [16, 42] was obtained from the Kyoto stock center) |
3E | Canton S(WT1), GMR-hid/GMR-hid;+ (blind [46]), +; orco-/orco- (Pheromone Insensitive or PI [45]) in a Canton S background, NM91 (WT2), arista cut 20 hr prior to experiment (deaf) in an NM91 background, *,**. Males court PIBL females. |
2H–2J, S1B, S2A–S2D, and S2H | D. simulans (sim194, sim195, simNS05), D. mauritiana strains (mau317, mauR12, mauR61), and D. sechellia (sec137, sec138, secAQTNF25) (provided by P. Andolfatto). Males court females of the same strain. |
4B–4D and S4A–S4D | P1 (R71G01): w/+; GMR71G01-LexA/+; LexAop-CsChrimson/+ (GMR71G01-LexA [22] and LexAop-CsChrimson [30] were obtained from the Bloomington stock center, GMR71G01-LexA [22] was contributed to the Bloomington stock center by Gerry Rubin, and LexAop-csChrimson [30] by Vivek Jayaraman). P1a: UAS-CsChrimson/+; GMR15A01-AD (attp40)/BRP>stop>-V5-2A-LexA-VP16, LexAop-myr-tdtomato; GMR71G01-DBD (attp2)/LexAop-GCaMP6s (GMR15A01-AD (attp40); GMR71G01-DBD (attp2) [23] provided by David Anderson; UAS-CsChrimson [30] and LexAop-GCaMP6s [47] were obtained from the Bloomington stock center (UAS-CsChrimson [30] was contributed to the stock center by Vivek Jayaraman and LexAop-GCaMP6s [47] by Douglas Kim)). pIP10: w/+; UAS>stop>CsChrimson/+; VT40556, FruFLP/+ (VT40556 [16] and FruFLP [42] were provided by Barry Dickson; UAS>stop>CsChrimson [30] was provided by Vivek Jayaraman). vPR6: w/+; UAS>stop>CsChrimson/+; VT57239, FruFLP/+ (VT57239 [16, 42] was provided by Barry Dickson). |
4E and 5E | pIP10-control: +/+; fruFLP/VT40556. control-Kir: R53G02AD attP40/UAS-Kir2.1. control-TNT: R53G02AD attP40/UAS-TNT. pIP10-Kir: UAS>stop>Kir2.1/+; fruFLP/VT40556**. pIP10-TNT: UAS>stop>TNT/+; fruFLP/VT40556**. Males court PIBL females. (R53G02AD attP40 was obtained from the Bloomington stock center and contributed by Gerry Rubin; UAS>stop>Kir2.1 was provided by Troy Shirangi; Kir2.1 and TNT where described in [31] and [32], respectively). |
4F–4I, S4E, S4F, and 5F | ps1-TNT: R48F07-LexA, p65/LexAop2-FlpL; R73C03-GAL4/UAS>stop>TNT [14]. ps1-control: R48F07-LexA, p65/LexAop2-FlpL; R73C03-GAL4/UAS>stop>TNTinactive (R48F07-LexA, p65/LexAop2-FlpL; R73C03-GAL4 [14] was provided by Troy Shirangi; UAS>stop>TNT and UAS>stop>TNTinactive [16] were obtained from the Bloomington stock center and contributed by Barry Dickson). Males court NM91 wild type females. |
5G, 5H, S5A, and S5B | pIP10: w/+; UAS>stop>CsChrimson/+; VT40556, FruFLP/+. Males court NM91 wild type females. |
Highlights.
D. melanogaster males produce three song modes: one sine and two pulse modes
Males select between pulse modes based on distance to the female
Activity levels in song pathway neurons affect song mode choice
The three song modes differentially affect female behavior
ACKNOWLEDGMENTS
We thank Asif Ghazanfar, Joshua Shaevitz, and David Stern for comments on the manuscript and Georgia Guan for technical assistance. J.C. was supported by a postdoctoral fellowship through the Princeton Sloan-Swartz Center, F.A.R. was supported by the German Research Foundation (DFG Forschungsstipendium RO 5787/1-1), and M.M. was supported by an NSF BRAIN EAGER award, an NIH New Innovator Award (DP2 NS092378), a McKnight Scholar Award, and a Faculty Scholar Award from the Howard Hughes Medical Institute.
Footnotes
SUPPLEMENTAL INFORMATION
Supplemental Information includes five figures and can be found with this article online at https://doi.org/10.1016/j.cub.2018.06.011.
DECLARATION OF INTERESTS
The authors declare no competing interests.
REFERENCES
- 1.Wiltschko AB, Johnson MJ, Iurilli G, Peterson RE, Katon JM, Pashkovski SL, Abraira VE, Adams RP, and Datta SR (2015). Mapping sub-second structure in mouse behavior. Neuron 88, 1121–1135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Berman GJ, Choi DM, Bialek W, and Shaevitz JW (2014). Mapping the stereotyped behaviour of freely moving fruit flies. J. R. Soc. Interface 11,20140672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Robie AA, Hirokawa J, Edwards AW, Umayam LA, Lee A, Phillips ML, Card GM, Korff W, Rubin GM, Simpson JH, et al. (2017). Mapping the neural substrates of behavior. Cell 170, 393–406. e28. [DOI] [PubMed] [Google Scholar]
- 4.Calhoun AJ, and Murthy M (2017). Quantifying behavior to solve sensorimotor transformations: advances from worms and flies. Curr. Opin. Neurobiol 46, 90–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Anderson DJ, and Perona P (2014). Toward a science of computational ethology. Neuron 84, 18–31. [DOI] [PubMed] [Google Scholar]
- 6.Krakauer JW, Ghazanfar AA, Gomez-Marin A, MacIver MA, and Poeppel D (2017). Neuroscience needs behavior: correcting a reductionist bias. Neuron 93, 480–490. [DOI] [PubMed] [Google Scholar]
- 7.Bennet-Clark HC, and Ewing AW (1967). Stimuli provided by courtship of male Drosophila melanogaster. Nature 215, 669–671. [Google Scholar]
- 8.Bennet-Clark HC, Dow M, Manning A, and von Schilcher F (1976). Courtship stimuli in Drosophila melanogaster. Behav. Genet 6, 93, 95. [DOI] [PubMed] [Google Scholar]
- 9.Arthur BJ, Sunayama-Morita T, Coen P, Murthy M, and Stern DL (2013). Multi-channel acoustic recording and automated analysis of Drosophila courtship songs. BMC Biol 11, 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Coen P, Clemens J, Weinstein AJ, Pacheco DA, Deng Y, and Murthy M (2014). Dynamic sensory cues shape song structure in Drosophila. Nature 507, 233–237. [DOI] [PubMed] [Google Scholar]
- 11.Coen P, Xie M, Clemens J, and Murthy M (2016). Sensorimotor transformations underlying variability in song intensity during Drosophila courtship. Neuron 89, 629–644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Clemens J, Girardin CC, Coen P, Guan XJ, Dickson BJ, and Murthy M (2015). Connecting neural codes with behavior in the auditory system of Drosophila. Neuron 87, 1332–1343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ding Y, Berrocal A, Morita T, Longden KD, and Stern DL (2016). Natural courtship song variation caused by an intronic retroelement in an ion channel gene. Nature 536, 329–332. [DOI] [PubMed] [Google Scholar]
- 14.Shirangi TR, Stern DL, and Truman JW (2013). Motor control of Drosophila courtship song. Cell Rep 5, 678–686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Riabinina O, Dai M, Duke T, and Albert JT (2011). Active process mediates species-specific tuning of Drosophila ears. Curr. Biol 21,658–664. [DOI] [PubMed] [Google Scholar]
- 16.von Philipsborn AC, Liu T, Yu JY, Masser C, Bidaye SS, and Dickson BJ (2011). Neuronal control of Drosophila courtship song. Neuron 69, 509–522. [DOI] [PubMed] [Google Scholar]
- 17.van der Maaten L, and Hinton G (2008). Visualizing data using t-SNE. J. Mach. Learn. Res 9, 2579–2605. [Google Scholar]
- 18.David JR, Lemeunier F, Tsacas L, and Yassin A (2007). The historical discovery of the nine species in the Drosophila melanogaster species subgroup. Genetics 177, 1969–1973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Cobb M, Burnet B, Blizard R, and Jallon J-M (1989). Courtship in Drosophila sechellia: its structure, functional aspects, and relationship to those of other members of the Drosophila melanogaster species subgroup. J. Insect Behav 2, 63–89. [Google Scholar]
- 20.Tomaru M, and Oguma Y (2000). Mate choice in Drosophila melanogaster and D. sechellia: criteria and their variation depending on courtship song. Anim. Behav 60, 797–804. [DOI] [PubMed] [Google Scholar]
- 21.Mineault PJ, Barthelmé S, and Pack CC (2009). Improved classification images with sparse priors in a smooth basis. J. Vis 9, 1–24. [DOI] [PubMed] [Google Scholar]
- 22.Pan Y, Meissner GW, and Baker BS (2012). Joint control of Drosophila male courtship behavior by motion cues and activation of male-specific P1 neurons. Proc. Natl. Acad. Sci. USA 109, 10065–10070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hoopfer ED, Jung Y, Inagaki HK, Rubin GM, and Anderson DJ (2015). P1 interneurons promote a persistent internal state that enhances inter-male aggression in Drosophila. eLife 4, e11346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kimura K, Hachiya T, Koganezawa M, Tazawa T, and Yamamoto D (2008). Fruitless and doublesex coordinate to generate male-specific neurons that can initiate courtship. Neuron 59, 759–769. [DOI] [PubMed] [Google Scholar]
- 25.Kohatsu S, Koganezawa M, and Yamamoto D (2011). Female contact activates male-specific interneurons that trigger stereotypic courtship behavior in Drosophila. Neuron 69,498–508. [DOI] [PubMed] [Google Scholar]
- 26.Clowney EJ, Iguchi S, Bussell JJ, Scheer E, and Ruta V (2015). Multimodal chemosensory circuits controlling male courtship in Drosophila. Neuron 87, 1036–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kohatsu S, and Yamamoto D (2015). Visually induced initiation of Drosophila innate courtship-like following pursuit is mediated by central excitatory state. Nat. Commun 6, 6457. [DOI] [PubMed] [Google Scholar]
- 28.Kallman BR, Kim H, and Scott K (2015). Excitation and inhibition onto central courtship neurons biases Drosophila mate choice. eLife 4, e11188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zhang SX, Rogulja D, and Crickmore MA (2016). Dopaminergic circuitry underlying mating drive. Neuron 91, 168–181. [DOI] [PubMed] [Google Scholar]
- 30.Klapoetke NC, Murata Y, Kim SS, Pulver SR, Birdsey-Benson A, cho YK, Morimoto TK, Chuong AS, Carpenter EJ, Tian Z, et al. (2014). Independent optical excitation of distinct neural populations. Nat. Methods 11, 338–346 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Baines RA, Uhler JP, Thompson A, Sweeney ST, and Bate M (2001) Altered electrical properties in drosophila neurons developing without synaptic transmission. J. Neurosci 21, 1523–1531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Sweeney ST, Broadie K, Keane J, Niemann H, and O’Kane CJ (1995). Targeted expression of tetanus toxin light chain in Drosophila specifically eliminates synaptic transmission and causes behavioral defects. Neuron 14,341–351. [DOI] [PubMed] [Google Scholar]
- 33.Klibaite U, Berman GJ, Cande J, Stern DL, and Shaevitz JW (2017). An unsupervised method for quantifying the behavior of paired animals. Phys. Biol 14, 015006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pereira T, Aldarondo D, Willmore L, Kislin M, Wang SS, Murthy M, and Shaevitz JW (2018). Fast animal pose estimation using deep neural networks. bioRxiv. 10.1101/331181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Shirangi TR, Wong AM, Truman JW, and Stern DL (2016). Doublesex regulates the connectivity of a neural circuit controlling Drosophila male courtship song. Dev. Cell 37, 533–544. [DOI] [PubMed] [Google Scholar]
- 36.Lindsay T, Sustar A, and Dickinson M (2017). The function and organization of the motor system controlling flight maneuvers in flies. Curr. Biol 27, 345–358. [DOI] [PubMed] [Google Scholar]
- 37.Clemens J, Deutsch D, Thiberge SY, and Murthy M (2018). Shared song object detector neurons in Drosophila male and female brains drive divergent, sex-specific behaviors. bioRxiv 10.1101/366765. [DOI] [Google Scholar]
- 38.Tomaru M, and Yamada H (2011). Courtship of Drosophila, with a special interest in courtship songs. Low Temperature Science 69, 61–85. [Google Scholar]
- 39.Cowling DE, and Burnet B (1981). Courtship songs and genetic control of their acoustic characteristics in sibling species of the Drosophila melanogaster subgroup. Anim. Behav 29, 924–935. [Google Scholar]
- 40.Demetriades MC, Thackeray JR, and Kyriacou CP (1999). Courtship song rhythms in Drosophila yakuba. Anim. Behav 57, 379–386. [DOI] [PubMed] [Google Scholar]
- 41.Ding Y, Lillvis JL, Cande J, Berman GJ, Arthur BJ, Xu M, Dickson BJ, and Stern DL (2017). Neural changes underlying rapid fly song evolution. bioRxiv 10.1101/238147. [DOI] [PubMed] [Google Scholar]
- 42.Yu JY, Kanai MI, Demir E, Jefferis GSXE, and Dickson BJ (2010). Cellular organization of the neural circuit that drives Drosophila courtship behavior. Curr. Biol 20, 1602–1614. [DOI] [PubMed] [Google Scholar]
- 43.Kyriacou CP, Green EW, Piffer A, and Dowse HB (2017). Failure to reproduce period-dependent song cycles in Drosophila is due to poor automated pulse-detection and low-intensity courtship. Proc. Natl. Acad. Sci. USA 114, 1970–1975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.van der Maaten L (2014). Accelerating t-SNE using tree-based algorithms. J. Mach. Learn. Res 15, 3221–3245. [Google Scholar]
- 45.Larsson MC, Domingos AI, Jones WD, Chiappe ME, Amrein H, and Vosshall LB (2004). Or83b encodes a broadly expressed odorant receptor essential for Drosophila olfaction. Neuron 43, 703–714. [DOI] [PubMed] [Google Scholar]
- 46.Grether ME, Abrams JM, Agapite J, White K, and Steller H (1995). The head involution defective gene of Drosophila melanogaster functions in programmed cell death. Genes Dev 9, 1694–1708. [DOI] [PubMed] [Google Scholar]
- 47.Chen TW, Wardill TJ, Sun Y, Pulver SR, Renninger SL, Baohan A, Schreiter ER, Kerr RA, Orger MB, Jayaraman V, et al. (2013). Ultrasensitive fluorescent proteins for imaging neuronal activity. Nature 499, 295–300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Deng Y, Coen P, Sun M, and Shaevitz JW (2013). Efficient multiple object tracking using mutually repulsive active membranes. PLoS ONE 8, e65769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Stern DL, Clemens J, Coen P, Calhoun AJ, Hogenesch JB, Arthur BJ, and Murthy M (2017). Experimental and statistical reevaluation provides no evidence for Drosophila courtship song rhythms. Proc. Natl. Acad. Sci. USA 114, 9978–9983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Akaike H (1974). A new look at the statistical model identification. IEEE Trans. Automat. Contr 19, 716–723 [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The song segmenter with pulse classifier is available at https://github.com/murthylab/songSegmenter. The stand-alone pulse classifier is available at https://github.com/murthylab/pulseTypeClassifier. The script illustrating the analysis pipeline is available at https://murthylab.github.io/pulseTypePipeline/. Raw data are available upon reasonable request from the Lead Contact, Mala Murthy (mmurthy@princeton.edu). Pictograms of flies were modified from Benjamin de Bivort’s lab web page (http://debivort.org/docs/flyclipart.pdf)