Abstract
Some combinations of musical notes sound pleasing and are termed “consonant,” but others sound unpleasant and are termed “dissonant.” The distinction between consonance and dissonance plays a central role in Western music, and its origins have posed one of the oldest and most debated problems in perception. In modern times, dissonance has been widely believed to be the product of “beating”: interference between frequency components in the cochlea that has been believed to be more pronounced in dissonant than consonant sounds. However, harmonic frequency relations, a higher-order sound attribute closely related to pitch perception, has also been proposed to account for consonance. To tease apart theories of musical consonance, we tested sound preferences in individuals with congenital amusia, a neurogenetic disorder characterized by abnormal pitch perception. We assessed amusics’ preferences for musical chords as well as for the isolated acoustic properties of beating and harmonicity. In contrast to control subjects, amusic listeners showed no preference for consonance, rating the pleasantness of consonant chords no higher than that of dissonant chords. Amusics also failed to exhibit the normally observed preference for harmonic over inharmonic tones, nor could they discriminate such tones from each other. Despite these abnormalities, amusics exhibited normal preferences and discrimination for stimuli with and without beating. This dissociation indicates that, contrary to classic theories, beating is unlikely to underlie consonance. Our results instead suggest the need to integrate harmonicity as a foundation of music preferences, and illustrate how amusia may be used to investigate normal auditory function.
Keywords: music perception, aesthetic preferences, roughness
Music is made by combining notes, but not all combinations are comparable aesthetically. When notes are produced simultaneously, some combinations sound pleasing and are termed “consonant,” and others sound unpleasant or out of place, and are termed “dissonant” (1–3). This contrast forms one of the key ingredients of music composition, in which dissonant chords are used to create feelings of tension that are later released by consonant chords. Although their perception is in some cases dependent on context (4), marked differences between consonance and dissonance are typically apparent even in isolated chords (5–8). Preferences for consonance appear to be present in human infants (9–12) and perhaps also in other species with little exposure to music (13, 14, although see also ref. 15), consistent with a biological basis. However, the possibility also remains that consonance preferences are learned—such preferences are correlated with musical experience in Western listeners (8), and their existence in non-Western cultures is controversial (16–18). Consonance has fascinated scholars since the time of the Greeks, in part because consonant and dissonant chords can be viewed as atomic aesthetic elements, the investigation of which provides a point of entry to understanding the roles of biology and culture in music perception.
Greek scholars believed that consonance was related to the ratio between the length of the strings on which notes were sounded, inspired by ratio-based theories of aesthetics that applied to domains ranging from music to architecture. However, contemporary thinking on consonance is instead rooted in acoustics, beginning with the fact that musical instrument and voice sounds are composed of multiple discrete frequencies. These frequencies are termed “harmonics” because they are typically integer multiples of the fundamental frequency (F0, inverse of the period) of the sound. Harmonics are combined in a single waveform when traveling in the air but are partly segregated by the cochlea, because different auditory nerve fibers respond to different frequencies (19).
When several notes are combined to form a chord, the resulting sound waveform that enters the ear contains all of the individual frequencies of each note. Auditory scientists have long noted that aspects of the pattern of component frequencies differ between consonant and dissonant chords. Prevailing theories ascribe consonance to the fact that dissonant chords contain frequency components that are too closely spaced to be resolved by the cochlea (5, 6, 20–22). Two such components shift in and out of phase over time, producing an interaction that oscillates between constructive and destructive interference. The amplitude of the combined physical waveform thus alternately waxes and wanes. If the components are close enough to excite the same set of auditory fibers, amplitude modulations are directly observable in the response of the auditory nerve (23). These amplitude modulations are called “beats,” and result in an unpleasant sensation known as “roughness,” analogous to the tactile roughness felt when touching a corrugated surface [in practice, the perception of roughness is dependent on the depth and rate of amplitude modulation, as well as the center frequency of the tones involved (24)]. Beats are mostly inaudible when two pure tones are presented independently to separate ears, presumably because they do not interact at the level of the cochlea [although “binaural beats” can be audible for very small frequency differences (25)]. Helmholtz (20) is usually credited with the idea that dissonant chords are unpleasant because they contain the sensation of roughness, a notion that was fleshed out by ensuing generations of psychoacousticians (5, 6, 21, 22, 26). Theories of dissonance based on beating have been dominant in the last century and are now a regular presence in textbooks (1–3).
However, a second acoustic property also differentiates consonance and dissonance: the component frequencies of the notes of consonant chords combine to produce an aggregate spectrum that is typically harmonic, resembling the spectrum of a single sound with a lower pitch (Fig. 1B). In contrast, dissonant chords produce an inharmonic spectrum. Such observations led to a series of analyses and models of consonance based on harmonicity (23, 27–29). Although beating-based theories are widely accepted as the standard account of consonance, harmonicity has remained a plausible alternative.
Fig. 1.
Example chords: the perfect fifth (Left; conventionally consonant), and the minor second (Right; conventionally dissonant) composed of saxophone notes. (A) Musical notation. (B) Amplitude spectra. Spectrum of lower (root) note is shown in blue, that of higher note in red. Blue and red dots denote the frequencies belonging to each note. In the consonant perfect fifth, the two notes share common frequencies and produce an aggregate spectrum that is harmonic (corresponding to a harmonic series with a lower F0, indicated by the vertical bars at the top). In the dissonant minor second, the notes share no harmonics and the overall spectrum is inharmonic. (C) Predicted activation of auditory nerve fibers (55) for each chord. The beating produced by the closely spaced frequencies of the minor second is visible in the low frequency channels of the right-hand plot.
One reason these candidate explanations of consonance have proven difficult to test is that they share qualitative predictions, because dissonant chords both contain inharmonic frequencies and produce beats. However, it is possible to produce synthetic sounds that isolate the two acoustic properties. A recent study (8) used such stimuli to examine individual differences in acoustic preferences within large cohorts of subjects. It was found that the degree of preference for consonance in individual subjects correlated with the degree of preference for harmonicity but not with the degree of aversion to beating. These results support a theory of consonance based on harmonicity but are limited by the restricted variance in behavioral responses obtained with populations of normal listeners. In particular, nearly all normal listeners possess some degree of preference for consonance and aversion to beating, leaving open the possibility that beating contributes in an important way to their perception of consonance.
Here, we adopted a neuropsychological approach to investigate the basis of consonance by testing individuals with congenital amusia. Congenital amusia is a neurogenetic disorder that is characterized by a deficit in melody processing that cannot be explained by hearing loss or intellectual deficiencies (30). Congenital amusia is hereditary (31, 32) and is associated with reduced connectivity between the right auditory and inferior frontal cortices (33–35). The root functional cause of the musical impairments appears to lie in the processing of pitch. Amusic individuals are impaired at detecting the direction of pitch changes between notes (36, 37) as well as pitch deviations (irrespective of direction) that are small (e.g., less than two semitones) (38, 39). As pitch changes in typical music are often below their abnormally high thresholds (40), amusics are bound to miss an essential part of musical structure.
The aim of the present study was to exploit the perceptual deficits observed in congenital amusia to dissect the basis of consonance. There is one previous demonstration that amusics do not show the normally observed preference for consonant over dissonant music (30). In that study, amusics were presented with classical musical excerpts containing melodies and accompaniment on the piano. The amusics were asked to rate the pleasantness of the original excerpts as well as manipulated versions in which the accompaniment was pitch-shifted to introduce dissonance. Amusics gave similar ratings to the two versions, suggesting deafness to dissonance. We sought to understand the basis of this phenomenon. We first assessed the perception of consonant and dissonant chords in 10 amusic and 10 matched control subjects. Synthetic stimuli were then used to investigate the perception of harmonicity and beating.
Results
Consonant and Dissonant Chords.
In the first experiment, participants were asked to rate the pleasantness of chords (intervals or triads composed of notes sung by a vocalist or played on a saxophone) on a nine-step scale. The chords included the unison, the intervals spanning the minor second (1 semitone) to the major seventh (11 semitones), and the four most common triads in Western music. Fig. 2 shows the pleasantness ratings of the two subject groups. Control subjects’ ratings were consistent with previous results (8), with conventionally dissonant chords such as the minor second or major seventh rated as unpleasant, and conventionally consonant chords such as the major third or major triad rated as pleasant. In contrast, amusics’ ratings varied across subjects, generally with little resemblance to the pattern of results in normal listeners. On average, amusics gave similar ratings to all chords, with consonant chords rated no more highly than dissonant chords.
Fig. 2.
Mean (black) and individual (gray) pleasantness ratings of chords by the control (A) and amusic (B) groups. Ratings of each subject were z-scored before analysis to remove coarse differences in the use of the rating scale, as in ref. 8. Intervals contained two notes separated by an integer number of semitones; triads (major, minor, diminished, and augmented) contained three notes. Here and elsewhere, error bars denote one SE.
To assess agreement among raters in the two populations, we estimated the null distribution of Kendall’s W for ratings of 16 conditions by 10 subjects. Ninety-five percent of 10,000 W values obtained from randomly drawn ratings were situated below 0.164. Control subjects’ ratings yielded a W of 0.68, showing far more consistency than would be expected by chance. In contrast, amusics’ ratings were inconsistent (W = 0.08), no different from what would be expected by chance.
We note that the ratings obtained from controls, despite generally resembling normal ratings, show less-pronounced rating variation for the medium-sized intervals, particularly the tritone, than what was reported in ref. 8, for example, with identical stimuli. This result might reflect two characteristics of our control population (matched to the demographics of the amusic group as closely as possible) that are known to weaken the strength of consonance preferences: some subjects showed mild to moderate hearing loss (41), and all subjects had very little, if any musical training (8). Overall, however, the results of the control group are largely as expected given prior work.
To test whether any amusic showed a pattern of results close to that of the controls, we computed correlations between each individual participant’s ratings of the 16 chord types and the mean ratings of the control group. As shown in Fig. 3A, the correlations between the amusics’ data and the average ratings of the control group were small, never exceeding that expected by chance, whereas those of each control subject (correlated with the mean of the ratings obtained from the nine other controls) were statistically significant.
Fig. 3.
(A) Correlations between individual ratings of chords and mean chord ratings of the control group. Null distribution is drawn from correlations between random permutations of individual control subject’s data and the mean of the control group. Vertical dashed lines here and in B denote 95% confidence intervals around zero. (B) Test-retest correlations between ratings averaged over first and second halves of the experiment for each subject. Null distribution is drawn from correlations between the first half and a randomly permuted second half of randomly selected individual control subject’s data.
To test whether the participants exhibited internally consistent rating patterns, we measured test-retest reliability for individual subjects, using correlations between ratings averaged over the first and second half of the trials for each chord type. As shown in Fig. 3B, all control subjects gave consistent ratings, whereas only two of the amusics did. These two subjects’ ratings seem to be based on idiosyncratic pitch height preferences, rather than on consonance (one subject preferred high-pitched tones, and the other preferred low-pitched tones, yielding preferences that covaried with interval size). The amusics thus never resembled control subjects with respect to consonance judgments, and were inconsistent in their ratings, indicating that consonance and dissonance as heard by normal listeners played little role in their perception of chords.
Pleasantness of Emotional Vocal Stimuli.
To ensure that the group differences in chord ratings were not a result of general differences in the ability to ascribe pleasantness ratings to sounds, we measured pleasantness ratings of emotional vocalizations [Montreal Affective Voices (42)]. The stimuli were recordings of nonverbal vocal sounds made by humans to express sadness, fear, surprise, or happiness (cries, gasps, laughing, etc.). In contrast to the results with chords, the two groups showed nearly identical rating patterns for vocalizations (Fig. 4), rating happy vocalizations the highest and sad and fearful vocalizations the lowest. This result indicates that the groups made similar use of the rating scale, and that amusics do not differ generally from controls in the way they assign pleasantness ratings to sounds.
Fig. 4.
Mean (black) and individual (gray) pleasantness ratings of emotional vocalizations by control (A) and amusic (B) groups. Ratings of each subject were z-scored. Error bars are not visible as they were smaller than the symbol size.
Harmonicity and Beating.
In a third experiment, we investigated preferences for harmonicity and beating using stimuli taken from a previous study in normal listeners (8). Preference measures were derived from differences of ratings between stimuli that either did or did not contain these acoustic factors (harmonic and inharmonic, nonbeating and beating), but that were matched in other respects (number of frequencies, amplitude, etc.).
Harmonicity was isolated by comparing ratings of harmonic complex tones (“harmonic-subset” and “octave-spaced”) with those of inharmonic complex tones generated by subtly perturbing the frequencies of the harmonic tones (Fig. 5A: Jittered, Shifted, and 13st-spaced). All tones contained widely spaced frequency components to avoid beating. The difference between the average ratings of harmonic and inharmonic tones provided “measure 1” of harmonicity preference (Fig. 5C). We also computed a second measure of harmonicity preference from the difference of the ratings for pure tones (the simplest harmonic stimulus) and dichotically presented tone pairs (which were inharmonic by virtue of being closely spaced, but produced little perceptible beating because of the absence of interactions at the level of the cochlea) (8); this was termed “measure 2” of harmonicity preference (Fig. 5C). Beating was isolated using the fact that dichotic presentation of two tones (one tone to the left ear and the other to the right ear, via headphones) prevents their interaction at the level of the cochlea, and thus mostly eliminates the perception of beating (25). We contrasted ratings of pairs of closely spaced pure tones presented either diotically (both tones to both ears, producing substantial beating) or dichotically (minimizing beating) (Fig. 5B). For comparison of these harmonicity and beating preferences with consonance preferences, we computed analogous measures for the preference for consonant dyads (e.g., perfect fifth) over dissonant dyads (e.g., minor second) and for consonant triads (major triad) over dissonant triads (augmented triad) using the ratings from Exp. 1.
Fig. 5.
(A) Schematic spectra of harmonicity test stimuli (Exps. 3 and 4): Thick and thin lines depict spectra of harmonic and inharmonic tones, respectively. See Materials and Methods for details. (B) Beating stimuli (Exps. 3 and 5) were made of two pure tones close in frequency (0.75 or 1.5 semitones apart). They were presented diotically (both tones to both ears), resulting in the perception of beats, or dichotically (one tone to each ear), producing a substantial decrease in perceived beating. (C) Mean and individual preferences for the two groups of listeners. Preference measures, as in ref. 8, were differences in the average ratings for two groups of sounds. Consonance (dyads): mean rating (MR) for the four most consonant dyads minus MR for the four most dissonant dyads; Consonance (triads): MR for the major triad minus MR for the augmented triad; Harmonicity1: MR for the harmonic complexes minus MR for the inharmonic complexes; Harmonicity2: MR for the pure tones minus MR for the dichotic pairs of pure tones; Beating: MR for the dichotic pairs of pure tones minus MR for the diotic pairs of pure tones.
Control subjects showed preferences for consonance, harmonicity, and the absence of beats, as expected (Fig. 5C). In contrast, amusic listeners failed to show preferences for consonance or harmonicity but exhibited an aversion to beating comparable to that of controls. The two populations’ preferences were significantly different for both consonance and both harmonicity measures (P < 0.001 in all cases; t tests) but not for beating (P = 0.71).
Harmonicity and Beating Discrimination.
In the fourth experiment, we assessed whether the lack of preference for harmonicity in amusics was because of an inability to discriminate harmonic from inharmonic frequency relations. On each trial, subjects were presented with two harmonic tones and one of its inharmonic counterparts (generated by jittering or shifting the spectrum of the harmonic tones). The three stimuli were generated with distinct F0s, such that the discrimination had to be made on the basis of the frequency relations within the tones rather than any individual frequency. Subjects were asked to report which of the three stimuli sounded as if played by a different instrument. Whereas control subjects scored high on this task, amusics performed poorly (P < 0.00001; t test), suggesting that harmonic and inharmonic spectra are poorly distinguished by their auditory system (Fig. 6).
Fig. 6.
Mean and individual performance for harmonicity (Exp. 4) and beating (Exp. 5) discrimination tasks. Horizontal dashed line represents chance level in this 3-AFC task: 33.33%.
For comparison, we also tested discrimination between our beating and nonbeating stimuli (Exp. 5). On each trial, subjects were presented with two dichotic (nonbeating) and one diotic (beating) sounds, again with distinct F0s. Subjects were again asked to report which of the three stimuli sounded as if played by a different instrument. Performance was high in both groups (Fig. 6) and did not differ significantly between groups (P = 0.53; t test), suggesting normal perception of beating in amusics despite abnormal harmonicity perception.
Discussion
We tested the perception of dissonance, harmonicity, and beating in amusic listeners. The pleasantness ratings for chords showed that amusics have highly abnormal perception of consonance and dissonance. Whereas control participants preferred consonance to dissonance, amusic participants were indifferent to the distinction. Amusic listeners’ perception of harmonicity was also impaired. Unlike controls, amusics did not prefer harmonic to inharmonic tones and could not discriminate them from each other. In contrast to these marked deviations from normal perception, amusics’ judgments of beating were indistinguishable from those of normal listeners; every amusic participant preferred stimuli without beating over those with beats, to a degree comparable to that of control subjects, and every amusic could distinguish sounds with and without beating.
Implications for the Basis of Consonance and Dissonance.
The unique contribution of this study to the understanding of consonance was to use congenital amusia to demonstrate a dissociation between the perception of beating and the perception of dissonance. Amusics exhibit the normal aversion to beating, but lack the normal preference for consonance over dissonance, suggesting that consonance preferences do not derive from an aversion to beating. Rather, our results suggest that the quality of roughness (induced by beating) constitutes an aesthetic dimension that is distinct from that of dissonance. This conclusion is consistent with earlier work in normal listeners, suggesting that harmonicity is more closely related to consonance than is beating (8), but the present study provides the unique demonstration of a clear-cut dissociation, something not possible in normal listeners. Amusia can thus provide a tool for studying the underpinnings of music perception.
Given the prevalence of beating-based accounts of dissonance (5, 6, 21), it is natural to ask why amusics’ aversion to beating is not sufficient to produce preferences for consonance over dissonance. We believe the explanation is that for naturally occurring musical chords, beating is not in fact closely associated with dissonance. Naturally occurring tones produced by instruments or voices generally have spectra whose component amplitudes decrease with increasing frequency. As a result, roughness in naturally occurring chords originates primarily from the interaction of the first few frequency components of the notes. In contrast, classic studies of dissonance (e.g., ref. 5) relied on artificial synthetic tones with equal amplitude components, in which beating can be produced by all components in the spectrum, yielding artificially strong and consistent beating. Moreover, roughness is influenced by factors such as the amplitude (43) or the phase (44) of frequency components, both of which can vary greatly from one instrument to another, or with the way keys or strings are sounded on a given instrument. Consistent with these ideas, we have found that for real-world musical notes, as were used in our experiments, beating is not consistently more pronounced in dissonant chords than in consonant chords, that it often varies more with timbre than with chord type, and that overall amounts of beating in real-world chords are often far below that in the synthetic chords used in classic psychoacoustic studies (See Figs. S1–S3 and SI Text). It may be that beating is in practice sufficiently low or variable that it has little effect on the average aesthetic response to a chord. Alternatively, the inconsistency of beating across instances of a chord may cause listeners to implicitly learn to ignore it in musical contexts, where it might otherwise cause consonance judgments to fluctuate across instruments.
Our results are consistent with the alternative account of consonance in terms of harmonicity (8, 23, 27–29), in that amusics were consistently abnormal in their perception of both consonance and harmonicity. Unlike the perception of beating, harmonicity is not strongly dependent on frequency amplitudes or phases (45). This relative invariance might explain the similarity in consonance judgments for different instrument timbres (8). Models measuring harmonicity with periodicity detection mechanisms (29) can predict the degree of consonance of musical intervals, and harmonicity preferences in normal listeners have been shown to be correlated with their consonance preferences (8), suggesting a link (see also SI Text and Fig. S4 for correlational analyses of the present data). Differential auditory brainstem responses to consonant and dissonant intervals (23, 46) are also consistent with a role for harmonicity in consonance perception.
The origins of the aesthetic response to harmonicity remain to be determined. One might envision that humans are born with preferences for harmonicity, or that they derive from exposure to natural sounds because of the prevalence of harmonic spectra in mammalian vocalizations (47), or that they derive from exposure to music (8). Our results leave each of these possibilities open, but indicate that the nature/nurture debate surrounding consonance should be oriented toward harmonicity rather than beating/roughness.
Dissonance, Pitch, and Amusia.
The pitch- and rhythm-related musical abilities of congenital amusic individuals have been extensively studied over the last decade (see ref. 48 for a review). Whereas only some amusics show impaired perception of rhythm, deficits in pitch perception appear to be universally present and have thus been considered the hallmark of amusia (36, 38). Recently, however, it was reported that amusic listeners also have a deficit in memory for pitch (49–51). This memory impairment could potentially play a role in all previously reported amusic deficits, because threshold measurements or melody comparisons require participants to compare sounds presented sequentially, and thus to store pitch-related information over time. The deficits evident in the present study are less vulnerable to this possibility because pleasantness ratings do not require sounds to be held in memory. Our results thus suggest that amusia cannot be solely caused by memory impairments and indeed reflects defective encoding of pitch information.
The deficit we observed in the perception of harmonicity in amusic listeners further suggests that the root of congenital amusia may lie in the inadequate representation of this basic acoustic property of pitched sounds. A harmonicity-based deficit would likely extend outside of music. Amusic deficits in prosody perception remain controversial (30, 37), but our results suggest other domains where amusics and normal listeners may differ. Voice attractiveness, for instance, is positively correlated with the degree of harmonicity (52). Harmonicity is also an important cue for concurrent sound segregation (53). Assessing amusic individuals with tasks involving such nonmusical harmonicity-dependent abilities is an intriguing direction for future research. In particular, our results raise the possibility that amusia could be used to test the role of pitch in these and other aspects of normal auditory function.
Materials and Methods
Subjects and Set-Up.
Ten amusic and 10 matched control participants were tested. The two groups did not differ significantly in age, years of education, years of musical training, and audiometric thresholds (Table S1). Each amusic scored 2 SD below the mean of the general population when tested with the Montreal Battery of Evaluation of Amusia, six tests that assess musical pitch perception, rhythm perception, and memory (54). Stimuli were delivered by an external soundcard (RME Fireface 800) with a 48-kHz sampling rate and 16-bit resolution. Sounds were played out with closed headphones (Beyerdynamic DT 770 Pro) at 65 dB SPL. Participants were tested in a sound-insulated booth. All listeners were fully informed about the goal of the present study and provided written consent before their participation.
Stimuli.
Chords.
Natural recordings of single notes from the equal-tempered scale (from a saxophone or a trained female vocalist) were summed to create intervals or triads. Eight different versions of each type of chord were used, each with a different root note, ranging from G#3 to D#4.
Harmonicity stimuli.
Harmonic and inharmonic complexes were generated by summing pure tones with aligned starting phases. The complexes amplitudes were decreased by 14 dB per octave to resemble natural sound spectra. Harmonic stimuli comprised: (i) pure tone, (ii) harmonic-subset (F0, 2F0, 3F0, 5F0, and 9F0), and (iii) octave-spaced (F0, 2F0, 4F0, 16F0). Inharmonic stimuli comprised: (i) jittered (even harmonics of the harmonic-subset tones were shifted up and odd harmonics were shifted down, by half a semitone), (ii) shifted (all harmonics of the harmonic-subset tones were shifted up by 30 Hz), and (iii) 13st-spaced (four frequency components, each 13 semitones apart). We also tested dichotic versions of these stimuli (see SI Text and Figs. S5 and S6 for motivation and results).
Beating stimuli.
Stimuli consisted of pairs of pure tones, either 0.75 or 1.5 semitones apart. These frequency separations are small enough that both tones fall within the same critical band when presented diotically, and thus produce a fair amount of beating. Nonbeating stimuli consisted of the same pure tones presented dichotically, largely eliminating perceived beating (8, 25).
The stimuli of Exps. 1, 3, 4, and 5 (chord, harmonicity, and beating test stimuli) were taken from a previous study (8). See Audios S1, S2, S3, S4, S5, S6, S7, S8, S9, and S10 for audio examples of stimuli.
Procedure.
Responses were collected using a keyboard. Sounds were presented automatically 500 ms after the subject’s previous response.
Pleasantness ratings.
Chords of Exp. 1 were presented in two blocks of 128 trials each, in which all 16 chord types and eight root pitches were randomly interleaved. Saxophone and voice chords were presented in separate blocks. Vocalizations of Exp. 2 were presented in a single block of 80 trials in which all 10 speakers and four types of emotions were repeated once and randomly interleaved. Exp. 3 was divided into two blocks of 69 trials that featured all types of harmonicity and beating stimuli randomly interleaved. The F0s of the stimuli were randomly assigned to C#4, D#4, or F4 in one of the blocks and to D4, E4, or F#4 in the other block.
Discrimination experiments.
Exp. 4 was carried out in one block of 90 trials. On each trial, one randomly chosen interval contained the inharmonic tone; the two others contained its harmonic counterpart (harmonic-subset for shifted and jittered, and octave-spaced for 13st-spaced). Trial types were randomly intermixed within the block. The three tones had F0s two semitones apart (C#4, D#4, F4) or (D4, E4, F#4) in a random order. Exp. 5 was carried out in one block of 30 trials in which the 0.75 and 1.25 semitones pairs of pure tones were randomly interleaved. On each trial, one randomly chosen interval contained a pair of tones presented diotically; the other two contained pairs of tones with the same frequency separation presented dichotically. The most salient difference between the two types of stimuli was thus the presence or absence of beating. The lower tones of the stimuli within a trial were either (C#4, D#4, F4) or (D4, E4, F#4), randomly assigned within and across trials. Feedback was provided in both discrimination experiments.
Supplementary Material
Acknowledgments
This work was supported by a postdoctoral grant from the Fyssen Foundation (to M.C.); the Howard Hughes Medical Institute (to J.H.M.); and grants from Natural Sciences and Engineering Research Council of Canada, the Canadian Institutes of Health Research, and from the Canada Research Chairs program (to I.P.).
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1207989109/-/DCSupplemental.
References
- 1.Dowling WJ, Harwood DL. Music Cognition. Orlando, FL: Academic; 1986. [Google Scholar]
- 2.Krumhansl CL. Cognitive Foundations of Musical Pitch. New York: Oxford Univ Press; 1990. [Google Scholar]
- 3.Deutsch D. The Psychology of Music. 2nd Ed. San Diego: Academic; 2009. [Google Scholar]
- 4.Johnson-Laird PN, Kang OE, Leong YC. On musical dissonance. Music Percept. 2012;30(1):19–35. [Google Scholar]
- 5.Plomp R, Levelt WJ. Tonal consonance and critical bandwidth. J Acoust Soc Am. 1965;38(4):548–560. doi: 10.1121/1.1909741. [DOI] [PubMed] [Google Scholar]
- 6.Kameoka A, Kuriyagawa M. Consonance theory part I: Consonance of dyads. J Acoust Soc Am. 1969;45(6):1451–1459. doi: 10.1121/1.1911623. [DOI] [PubMed] [Google Scholar]
- 7.Kameoka A, Kuriyagawa M. Consonance theory part II: Consonance of complex tones and its calculation method. J Acoust Soc Am. 1969;45(6):1460–1469. doi: 10.1121/1.1911624. [DOI] [PubMed] [Google Scholar]
- 8.McDermott JH, Lehr AJ, Oxenham AJ. Individual differences reveal the basis of consonance. Curr Biol. 2010;20(11):1035–1041. doi: 10.1016/j.cub.2010.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Trainor LJ, Tsang CD, Cheung VHW. Preference for sensory consonance in 2- and 4-month-old infants. Music Percept. 2002;20(2):187–194. [Google Scholar]
- 10.Zentner MR, Kagan J. Perception of music by infants. Nature. 1996;383(6595):29. doi: 10.1038/383029a0. [DOI] [PubMed] [Google Scholar]
- 11.Masataka N. Preference for consonance over dissonance by hearing newborns of deaf parents and of hearing parents. Dev Sci. 2006;9(1):46–50. doi: 10.1111/j.1467-7687.2005.00462.x. [DOI] [PubMed] [Google Scholar]
- 12.Perani D, et al. Functional specializations for music processing in the human newborn brain. Proc Natl Acad Sci USA. 2010;107(10):4758–4763. doi: 10.1073/pnas.0909074107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sugimoto T, et al. Preference for consonant music over dissonant music by an infant chimpanzee. Primates. 2010;51(1):7–12. doi: 10.1007/s10329-009-0160-3. [DOI] [PubMed] [Google Scholar]
- 14.Chiandetti C, Vallortigara G. Chicks like consonant music. Psychol Sci. 2011;22(10):1270–1273. doi: 10.1177/0956797611418244. [DOI] [PubMed] [Google Scholar]
- 15.McDermott J, Hauser M. Are consonant intervals music to their ears? Spontaneous acoustic preferences in a nonhuman primate. Cognition. 2004;94(2):B11–B21. doi: 10.1016/j.cognition.2004.04.004. [DOI] [PubMed] [Google Scholar]
- 16.Butler JW, Daston PG. Musical consonance as musical preference: A cross-cultural study. J Gen Psychol. 1968;79(1st Half):129–142. doi: 10.1080/00221309.1968.9710460. [DOI] [PubMed] [Google Scholar]
- 17.Maher TF. “Need for resolution” ratings for harmonic musical intervals: A comparison between Indians and Canadians. J Cross Cult Psychol. 1976;7(3):259–276. [Google Scholar]
- 18.Fritz T, et al. Universal recognition of three basic emotions in music. Curr Biol. 2009;19(7):573–576. doi: 10.1016/j.cub.2009.02.058. [DOI] [PubMed] [Google Scholar]
- 19.Plomp R. The ear as a frequency analyzer. J Acoust Soc Am. 1964;36:1628–1636. doi: 10.1121/1.1910894. [DOI] [PubMed] [Google Scholar]
- 20.Helmholtz H. On the Sensation of Tone as a Physiological Basis for the Theory of Music. New York: Dover; 1954. [Google Scholar]
- 21.Sethares WA. Local consonance and the relationship between timbre and scale. J Acoust Soc Am. 1993;94(3):1218–1228. [Google Scholar]
- 22.Hutchinson W, Knopoff L. The acoustic component of western consonance. J New Music Res. 1978;7(1):1–29. [Google Scholar]
- 23.Tramo MJ, Cariani PA, Delgutte B, Braida LD. Neurobiological foundations for the theory of harmony in western tonal music. Ann N Y Acad Sci. 2001;930:92–116. doi: 10.1111/j.1749-6632.2001.tb05727.x. [DOI] [PubMed] [Google Scholar]
- 24.Terhardt E. On the perception of periodic sound fluctuations (roughness) Acustica. 1974;30(4):202–213. [Google Scholar]
- 25.Rutschmann J, Rubinstein L. Binaural beats and binaural amplitude-modulated tones: Successive comparison of loudness fluctuations. J Acoust Soc Am. 1965;38(5):759–768. doi: 10.1121/1.1909802. [DOI] [PubMed] [Google Scholar]
- 26.Vassilakis P. Auditory roughness as a means of musical expression. Selected Reports in Ethnomusicology. 2005;12:119–144. [Google Scholar]
- 27.DeWitt LA, Crowder RG. Tonal fusion of consonant musical intervals: The oomph in Stumpf. Percept Psychophys. 1987;41(1):73–84. doi: 10.3758/bf03208216. [DOI] [PubMed] [Google Scholar]
- 28.Huron D. Tonal consonance versus tonal fusion in polyphonic sonorities. Music Percept. 1991;9(2):135–154. [Google Scholar]
- 29.Ebeling M. Neuronal periodicity detection as a basis for the perception of consonance: A mathematical model of tonal fusion. J Acoust Soc Am. 2008;124(4):2320–2329. doi: 10.1121/1.2968688. [DOI] [PubMed] [Google Scholar]
- 30.Ayotte J, Peretz I, Hyde K. Congenital amusia: A group study of adults afflicted with a music-specific disorder. Brain. 2002;125(Pt 2):238–251. doi: 10.1093/brain/awf028. [DOI] [PubMed] [Google Scholar]
- 31.Drayna D, Manichaikul A, de Lange M, Snieder H, Spector T. Genetic correlates of musical pitch recognition in humans. Science. 2001;291(5510):1969–1972. doi: 10.1126/science.291.5510.1969. [DOI] [PubMed] [Google Scholar]
- 32.Peretz I, Cummings S, Dubé MP. The genetics of congenital amusia (tone deafness): A family-aggregation study. Am J Hum Genet. 2007;81(3):582–588. doi: 10.1086/521337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hyde KL, et al. Cortical thickness in congenital amusia: When less is better than more. J Neurosci. 2007;27(47):13028–13032. doi: 10.1523/JNEUROSCI.3039-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Loui P, Alsop D, Schlaug G. Tone deafness: A new disconnection syndrome? J Neurosci. 2009;29(33):10215–10220. doi: 10.1523/JNEUROSCI.1701-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hyde KL, Zatorre RJ, Peretz I. Functional MRI evidence of an abnormal neural network for pitch processing in congenital amusia. Cereb Cortex. 2011;21(2):292–299. doi: 10.1093/cercor/bhq094. [DOI] [PubMed] [Google Scholar]
- 36.Foxton JM, Dean JL, Gee R, Peretz I, Griffiths TD. Characterization of deficits in pitch perception underlying ‘tone deafness’. Brain. 2004;127(Pt 4):801–810. doi: 10.1093/brain/awh105. [DOI] [PubMed] [Google Scholar]
- 37.Liu F, Patel AD, Fourcin A, Stewart L. Intonation processing in congenital amusia: Discrimination, identification and imitation. Brain. 2010;133(Pt 6):1682–1693. doi: 10.1093/brain/awq089. [DOI] [PubMed] [Google Scholar]
- 38.Peretz I, et al. Congenital amusia: A disorder of fine-grained pitch discrimination. Neuron. 2002;33(2):185–191. doi: 10.1016/s0896-6273(01)00580-3. [DOI] [PubMed] [Google Scholar]
- 39.Hyde KL, Peretz I. Brains that are out of tune but in time. Psychol Sci. 2004;15(5):356–360. doi: 10.1111/j.0956-7976.2004.00683.x. [DOI] [PubMed] [Google Scholar]
- 40.Vos PG, Troost JM. Ascending and descending melodic intervals: Statistical findings and their perceptual relevance. Music Percept. 1989;6:383–396. [Google Scholar]
- 41.Tufts JB, Molis MR, Leek MR. Perception of dissonance by people with normal hearing and sensorineural hearing loss. J Acoust Soc Am. 2005;118(2):955–967. doi: 10.1121/1.1942347. [DOI] [PubMed] [Google Scholar]
- 42.Belin P, Fillion-Bilodeau S, Gosselin F. The Montreal Affective Voices: A validated set of nonverbal affect bursts for research on auditory affective processing. Behav Res Methods. 2008;40(2):531–539. doi: 10.3758/brm.40.2.531. [DOI] [PubMed] [Google Scholar]
- 43.Kohlrausch A, Fassel R, Dau T. The influence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriers. J Acoust Soc Am. 2000;108(2):723–734. doi: 10.1121/1.429605. [DOI] [PubMed] [Google Scholar]
- 44.Pressnitzer D, McAdams S. Two phase effects in roughness perception. J Acoust Soc Am. 1999;105(5):2773–2782. doi: 10.1121/1.426894. [DOI] [PubMed] [Google Scholar]
- 45.Ritsma RJ. Frequencies dominant in the perception of the pitch of complex sounds. J Acoust Soc Am. 1967;42(1):191–198. doi: 10.1121/1.1910550. [DOI] [PubMed] [Google Scholar]
- 46.Bidelman GM, Krishnan A. Neural correlates of consonance, dissonance, and the hierarchy of musical pitch in the human brainstem. J Neurosci. 2009;29(42):13165–13171. doi: 10.1523/JNEUROSCI.3900-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Schwartz DA, Howe CQ, Purves D. The statistical structure of human speech sounds predicts musical universals. J Neurosci. 2003;23(18):7160–7168. doi: 10.1523/JNEUROSCI.23-18-07160.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Stewart L. Characterizing congenital amusia. Q J Exp Psychol (Hove) 2011;64(4):625–638. doi: 10.1080/17470218.2011.552730. [DOI] [PubMed] [Google Scholar]
- 49.Tillmann B, Schulze K, Foxton JM. Congenital amusia: A short-term memory deficit for non-verbal, but not verbal sounds. Brain Cogn. 2009;71(3):259–264. doi: 10.1016/j.bandc.2009.08.003. [DOI] [PubMed] [Google Scholar]
- 50.Williamson VJ, McDonald C, Deutsch D, Griffiths TD, Stewart L. Faster decline of pitch memory over time in congenital amusia. Adv Cogn Psychol. 2010;6:15–22. doi: 10.2478/v10053-008-0073-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gosselin N, Jolicoeur P, Peretz I. Impaired memory for pitch in congenital amusia. Ann N Y Acad Sci. 2009;1169:270–272. doi: 10.1111/j.1749-6632.2009.04762.x. [DOI] [PubMed] [Google Scholar]
- 52.Bruckert L, et al. Vocal attractiveness increases by averaging. Curr Biol. 2010;20(2):116–120. doi: 10.1016/j.cub.2009.11.034. [DOI] [PubMed] [Google Scholar]
- 53.Bregman AS. Auditory Scene Analysis. Cambridge, MA: MIT Press; 1990. [Google Scholar]
- 54.Peretz I, Champod AS, Hyde K. Varieties of musical disorders. The Montreal Battery of Evaluation of Amusia. Ann N Y Acad Sci. 2003;999:58–75. doi: 10.1196/annals.1284.006. [DOI] [PubMed] [Google Scholar]
- 55.Patterson RD, Allerhand MH, Giguère C. Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform. J Acoust Soc Am. 1995;98(4):1890–1894. doi: 10.1121/1.414456. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






