Skip to main content
PLOS One logoLink to PLOS One
. 2025 Jul 3;20(7):e0316963. doi: 10.1371/journal.pone.0316963

An album is a story: Feature arcs in sequences of tracks

Pedro Neto 1,*,#, Martin Hartmann 1,#, Geoff Luck 1,#, Petri Toiviainen 1,#
Editor: Seung-Goo Kim2
PMCID: PMC12225790  PMID: 40608693

Abstract

When releasing an album, musicians often think about the order in which their songs will be presented to the listener. In a previous study, we analyzed sequencing patterns within a set of 50,000 published works and found that louder, more energetic tracks are usually placed before quieter ones. Here we asked 130 music professionals to deliberately arrange the order of songs for a hypothetical album release. Our findings reveal consistent statistical patterns of track sequencing, with musicians showing broad agreement on the placement of songs within an album. Overall, tempo follows an inverted U-shape, while valence, arousal and loudness exhibit the opposite pattern. Results are interpreted in light of auditory science, and compared with sequencing trends commonly found in song lyrics and literary works.

1 Introduction

Human activities are often performed according to a set of well established sequencing principles. These principles can dictate, for instance, the order in which ingredients should be added to a dish [1], the progression of exercises in a gym routine [2], and the types of quests best suited for the early, middle and late stages of a game [3]. In every case, these principles are intended to facilitate, improve, or even enable the success of the activity that is being performed.

As a time-based activity, music is often seen as the art of organizing sounds into a sequence. French composer André Hodeir notes that “a musical phrase, no matter how beautiful it is, reaches its expressive summit only when it is in perfect harmony with preceding and following phrases” (as cited in [4, p. 811]). Contemporary artists often agree, and state that the order in which tracks are presented in an album can affect the musical experience of the listener [58].

Anecdotally, musicians hold varying beliefs about how songs should be sequenced. It has been said, for instance, that pairs of consecutive tracks should not be too similar in terms of tempo and key [68], but that similar tonalities are acceptable if the paces of the songs are dramatically different [5]. According to ref. [6], musicians should not put two slow songs next to each other, and singer David Brewis adds that “if you go straight from a quite fast song [...] to just a little bit of a slower song, it can make the slower song seem like it’s dragging. We have to avoid that” [5].

Musicians also seem to pay special attention to the first song of an album [7,911]. According to [11], “if you don’t catch people right off the bat, they might not hear the hits at the end.” In an interview to Garvey [5], producers Peter Hammill and Ashley Abram corroborate this idea, suggesting that “hits” should be at the front of the album.

It is also believed that album sequences should establish a well-defined trajectory from the first to the last track. According to [10], “arranging songs by key from lowest to highest creates a positive feeling, and a gradual increase in tempo through an album might evoke a rising sensation.” Hahn [9] suggests, in addition, that if sequences of tracks “build and release tension over the whole release [...] tracks will hit harder individually, and the overall effect [of the album] will be enhanced.”

Inspired by these anecdotal beliefs, as well as by a lack of systematic investigations on this issue, we conducted a study [12] in which track features were analyzed for 51010 published albums. It was found that songs with high values of valence, energy and loudness are more likely to be positioned at the beginning of each album, and that consecutive tracks tend to alternate between increases and decreases of valence and energy. Overall, our findings expose the algorithmic nature of album sequencing, and suggest that some rationales are followed by a wide range of artists and producers, somewhat independent from musical genre (Fig 1).

Fig 1. Feature values along album segments.

Fig 1

Standardized feature distribution of 51000 albums found by [12]. Given a varying number of tracks, albums were discretized into 3 segments (i.e., beginning, middle, end). Feature values represent the average feature for all tracks pertaining to a given album segment.

Albums were chosen for our previous investigation because these are prime examples of a musical context in which professionals make deliberate decisions about the order in which their tracks will be presented to the listener. While playlists available on major streaming platforms may also be curated with attention to track order, these are essentially different from albums in a few key aspects: they are not necessarily created by music professionals, they can be shuffled and modified by any user, and they may serve a diverse range of purposes, such as exercise, mood regulation and background listening [1315].

Even though we chose albums as a test case for our track sequencing investigation, findings in this area have the potential to fuel the development of applications in other musical contexts, such as automated playlist generation [16], DJ set creation [17], and music recommendation algorithms [18], which can incorporate principles used by professional musicians to determine the order in which their songs will be presented to the listener.

In our previous study, our analyses were limited to a kind of between-subjects exploration of album sequencing, where each musician/music producer sequenced a specific set of tracks for their own album. We could not investigate, for instance, the extent to which musicians agree about the ordering of a common set of songs. Here, we focus on expanding upon our previous results by asking professional musicians to sequence a set of tracks, as if they were to be released on one of their albums. We also improve on the reproducibility and interpretability of our last study by computing audio features with the open-source MIRtoolbox [19], rather than using proprietary features computed by Spotify. Finally, we administer an exploratory survey to investigate participants’ opinions about different aspects of album sequencing.

2 Materials and methods

2.1 Participants

Professional musicians and music producers were recruited through social media channels, and no compensation was offered for their participation. Our final sample comprised 130 individuals who had played an average of 263.9 concerts (SD = 663.8), recorded 6.12 albums (SD = 15.11), and had 14.1 years of musical experience (SD = 10.9), for detailed participant statistics, see Supporting Information (S2. Participants’ musical experience). A total of 25 participants answered the survey in Portuguese, whereas the other 105 in English. We did not exclude any participant based on their musical experience, and considered their self-report of professional musicianship both necessary and sufficient for participation in our study.

2.2 Stimuli

For the sequencing task, we devised 3 sets of 5 tracks within jazz and classical genres. These genres were chosen because, in comparison with Electronic Dance Music (EDM) and Funk, for instance, jazz and classical compositions are notorious for offering wider variation of rhythm, harmony, and dynamics from one piece to another. This is important because, if music-related features influence how albums are sequenced, as suggested by anecdotal evidence [58], as well as by our previous study [12], maximizing the variability of song styles within each set would make this effect clearer. Finally, Jazz and Classical music are often instrumental, which allows us to control for the possible effect of lyrics.

A relatively small number of pieces was chosen in order to minimize the necessary time to complete the online survey. Also, the choice of working with only 2 genres is supported by our previous study, which suggested that sequencing patterns are, at least to some extent, generalizable between different genres (Fig 1; [12]). Still, we chose to work with 3 distinct sets of songs, as a wider spectrum of features values may increase the generalizability of our results.

Set 1 was composed of pieces from a Jazz album with relatively low popularity (i.e., less than 1000 monthly listeners on Spotify), which controls for possible effects of familiarity [20]. This album also offered high recording quality, a wide spectrum of track styles, and a single instrumental formation throughout all tracks (i.e., piano and electric guitar), which would also control for the effect that instrumentation may have on track sequencing.

Tracks within sets 2 and 3 were manually selected from the Dataset on Emotional Annotations [21], which is composed of 400 pieces from the recording company Magnatune, equally divided between classical, rock, pop and electronic genres. As reported by [21], this dataset provides high quality audio recordings, and pieces that are generally not known by a wide audience. Only classical tracks were chosen for both sets because of their abundance of instrumental compositions. Individual tracks were selected irrespective of authorship or album.

For set 1, we manually selected 60-second segments that we judged to be representative of their respective tracks. We did not compare the features of these segments with those of the full tracks; rather, selection was based on perceptual judgment. While systematic audio thumbnailing approaches [22] could have been employed, we argue that the representativeness of specific segments is not particularly relevant to the research question posed in this study. Even if a segment is not fully representative of the entire track, it still allows us to analyze how its features influence sequencing decisions. Tracks in Sets 2 and 3 were pre-segmented into 60-second clips within the dataset provided by [21]. All clips included 1-second fade-ins and fade-outs. A list of song names and corresponding clips is available as Supporting Information, S1. Track information.

2.3 Procedures

Each participant was assigned to one of the 3 sets of songs. Their task was to organize the tracks into a sequence that would be released in a musical album. The computer interface presented to participants is displayed in Fig 2. The initial disposition of tracks was randomly determined to each participant. Songs could be heard any number of times, and there was no time limit for the completion of the task. After sequencing, participants were presented with a survey comprising the following propositions:

Fig 2. Computer interface.

Fig 2

Computer interface used to sequence tracks and rate survey propositions.

  • (a) Determining a good song sequence is important for the overall quality of an album

  • (b) In an album, there should not be 3 or more consecutive songs that are too similar to each other

  • (c) Usually, it is better for an album to start with the most striking and energetic songs

  • (d) Out of all the time I spend thinking about song ordering, I spend most of this time trying to decide which one will be the first song of my albums

  • (e) Usually, it is better for an album to end with the most striking and energetic songs

  • (f) In an album, there should not be 3 or more consecutive songs in the same tempo

  • (g) Out of all the time I spend thinking about song ordering, I spend most of this time trying to decide which one will be the last song of my albums

  • (h) In an album, there should not be 3 or more consecutive songs in the same tonality

  • (i) In an album, consecutive songs should be as different as possible from each other

Propositions were rated on a visual analog scale (Fig 2) from 0 to 100, with 0 representing “Completely disagree” and 100 representing “Completely agree.”

We ran our survey and sequencing tasks in 3 separate stages, one for each set of songs. Recruitment occurred between 09.11.2021, and 05.02.2022. Responses for set 1 were collected over a period of 2 months, and responses for sets 2 and 3 were collected over a period of one month each, concomitantly. Set 1 was sequenced by 62 participants, while sets 2 and 3 were sequenced by 37 and 31 participants, respectively.

All surveys were performed in accordance with the guidelines and regulations of the National Advisory Board on Research Ethics in Finland (TENK, see https://www.tenk.fi/sites/tenk.fi/files/ethicalprinciples.pdf). Individuals were informed about their rights as participants, the content of the research, and gave their informed consent by clicking an Agree button that appeared on the computer screen. Furthermore, all data was collected anonymously, and no sensitive information was obtained.

2.4 Sequencing analysis

Two aspects of track sequencing were analyzed: positional regularity (PR), and feature distribution. The first set of analysis indicated the extent to which participants agree about how to position specific tracks in albums, whereas the later indicated how different music-related features are distributed in a sequence of songs.

PR is defined as the normalized Shannon entropy η of track frequencies within each album position. An album is composed by a set of 5 tracks L={A,B,C,D,E}, and a list of positions P=(P1,P2,P3,P4,P5). For each position pP , we define the frequency vector fp=(fpA,fpB,fpC,fpD,fpE), where fpl represents the frequency with which track lL was chosen for position pP. The vector fP1 = (0.6, 0.0, 0.0, 0.4, 0.0) indicates, for instance, that tracks A and D were chosen for P1 60% and 40% of the times, respectively. PR is thus calculated as:

η(fp)=1H(fp)Hmax=1lLfpllog2(fpl)log2(fp) (1)

High PR indicates that one or more tracks are chosen for a given position with a higher probability than the others (e.g., 90% of participants choose track A as the opening of an album). Conversely, if PR is low, all tracks have approximately the same probability of being placed in a given album position (i.e., 20%).

For hypothesis testing, we created a control dataset consisting of 1,300,000 randomly generated track sequences. This number was obtained by shuffling each participant-generated album 10,000 times. Statistical significance was then calculated by means of bootstrapping, where we iteratively computed PRsim from a subsample of 62, 37 and 31 albums within sets 1, 2 and 3 of the control dataset, respectively. An empirical cumulative distribution (ECDF) was obtained by repeating this process 10,000 times, allowing for replacement. Finally, p-values were calculated as the probability of obtaining a given PRtrue or higher in the ECDF of PRsim, which represents the null hypothesis.

2.5 Feature analysis

For feature analyses, we computed tempo, loudness, valence, and arousal of each track, and evaluated how each feature was organized sequentially. Instead of focusing on lower-level features, such as spectral flux and zero-crossing rate, we chose to work with features that are easily interpretable in terms of their perceptual value.

Valence represents how positive or negative a given stimulus is perceived to be, while arousal measures the perceived intensity of the stimulus on a scale from low to high. In music, valence and arousal have been shown to have predictive and explanatory power with regards to memory [23,24], neural activity [25], and even decision making [26]. Furthermore, these dimensions of perceived emotion have been shown to be robust across cultures, leading authors to hypothesize that valence and arousal represent universal aspects of music perception [27].

In this study, valence and arousal are computed with a Multiple Linear Regression model proposed by [28]. This method is implemented as MIRtoolbox’s miremotion function [19], and it estimates emotional ratings from lower level acoustic features such as spectral centroid, Root Mean Square (RMS), spectral spread, entropy, and spectral novelty. As reported by [28], this model was trained on human ratings of 360 soundtrack excerpts, given by 116 participants, and was capable of explaining 70% of the variance in their data. Predictions from miremotion have been used, for instance, in the context of emotion visualization systems [29], movie-scene emotion recognition [30], and modeling of bodily sensations [31].

Loudness and tempo, in turn, are also important musical features, which are not only used to calculate valence and arousal [19,32], but have themselves been shown to affect cognitive processing speed [33], music-induced chills [34] and music-induced movement [35,36], for instance. Loudness and tempo were also computed with MIRtoolbox version 1.3 [19].

In order to test the hypothesis that feature values would differ between positions of an album, we conducted a mixed-design Analysis of Variance (ANOVA) with position (i.e., positions 1 to 5) and feature (i.e., valence, arousal, loudness and tempo) as within-subjects factors, and set (i.e., 1 to 3) as between subjects factor. Feature values were converted to z-scores for each participant and feature.

In order to test if, in accordance with annecdotal beliefs (see Introduction), musicians follow some patterns of feature change between consecutive tracks, we propose the metric of overall feature dissimilarity D, which computes the Euclidean distance between audio features of tracks i and i + 1. Let ti = (vi,ai,li,τi) represent the feature vector of track i where vi, ai, li and τi are valence, arousal, loudness and tempo values, respectively. The index j refers to the feature dimension, with j = 1 corresponding to valence, j = 2 to arousal, and so on. The average feature difference between a given pair of tracks is thus given by:

Di=14j=14ti+1(j)ti(j) (2)

Similarly, tempo dissimilarity (survey proposition (f)) was computed as the Euclidean distance between consecutive tracks, but with ti composed only of tempo values.

Key dissimilarity (survey proposition (h)) between consecutive tracks was assessed with the function mirkeystrength from MIRtoolbox [19], which computes correlations between the chromagram of a given audio segment and all key-profiles from [37]. The result is a vector of 24 key strength values, which corresponds to each one of the 12 tonalities within both major and minor modes. Key dissimilarity was then calculated by computing the Pearson correlation between key strength profiles of consecutive tracks within each album. We did not use Euclidean distance to calculate this metric because it would be sensitive to the magnitude of key-strength profiles, rather than only to its overall shape. For consistency with the other measures, we show results as 1ρ, which converts key dissimilarity into key dissimilarity, in a range between 0 and 2. Statistical significance of key, tempo and overall feature dissimilarity was obtained with the same bootstrap approach presented in Section 3.5.

2.6 Survey analysis

Upon visual inspection of survey responses, we identified potential polarization of opinions between agree, disagree and neutral. This trend was confirmed through Hartigan’s dip test (detailed statistics available as Supporting Information, S3. Detailed survey statistics), which measures how much a given distribution deviates from unimodality. In the context of our survey, departures from uni-modality cause central-tendency measures to render misleading interpretations.

For instance, take the hypothetical scenario where half of the participants completely agreed with proposition (a), while the other half completely disagreed. The resulting response distribution would be bimodal, and central tendency measures would suggest that participants are neutral regarding A, while, in fact, responses are strongly polarized. To address this issue, we discretized participant responses according to the following piecewise function f:

f(x)={disagreeif x33neutralif 33<x<66agreeif x66 (3)

The frequency of each categorical response was computed and then subjected to a χ2 test, which does not test for the significance of a central tendency of opinions, but rather indicates if participants leaned towards one or more segments of the scale (i.e., disagree, neutral, agree). Finally, the extent to which responses are distributed throughout these segments is visualized with a density plot.

3 Results

3.1 Sequencing regularities

For most positions and sets, the frequency of tracks is visibly different from baseline levels (i.e., 20%), indicating that track sequencing was not performed randomly, as shown in Fig 3 (left-side). For instance, in set 2, track B was chosen for position 1 (P1) by 37% of the musicians, while track C, was chosen 0% of the times. Overall, large differences between the frequency of a given track and the baseline of 20% will lead to high PRs (Fig 3, right side).

Fig 3. Track frequency by position and set.

Fig 3

PR by position and set, as well as PR across all sets (overall). Shaded area represents the standard deviation for each position.

The distribution of PRsimulated across all sets had a mean and standard deviation of .0096 and .006, respectively (Fig 4, left-side). Fig 4 shows the PRtrue for each position in relation to the distribution of PRsimulated. As visible in Fig 3 and 4, highest PR was found for position 1 (PR = .16, p<.001), followed by position 3 (PR = .09, p<.001), position 4 (PR = .05, p<.001), position 5 (PR = .05, p<.001), and position 2 (PR = 0.02, p<.001).

Fig 4. Distributions of PR.

Fig 4

Left: density plot of PRsimulated with respect to PRtrue obtained from participant-sequenced albums. Right: PRtrue (dashed lines) in relation to the Empirical Cumulative Distribution obtained from the control condition (solid line), where the y axis indicates the probability of obtaining a value equal to, or greater than a given PR.

3.2 Feature analysis

Results of the ANOVA indicated that there was a significant main effect of position, F(3.81,483.63)=6.76,p<.001,η2=.018, suggesting that feature values differed across positions of the album. There was also a significant interaction between position and feature F(7.87,999.39)=2.80,p<.01,η2=.014. Effect sizes were small for both effects. There were no main effects or interactions involving set as a factor.

Feature values show a clear U-shape pattern for loudness, valence and arousal, whereas tempo exhibits an inverted U-shape (Fig 5). Songs in the first and last positions of the albums exhibited the two highest valence (z-scores of 0.459 and –0.006, respectively) and loudness levels (0.36 and 0.006). Tempo was lowest in the first and last positions (–0.07 and –0.08) whereas arousal was stronger in the first position (0.26), but below average in the last one (–0.07).

Fig 5. Feature distributions.

Fig 5

Feature distribution per set and averaged across all sets. Shaded area represents the standard error of the mean for each position and feature.

To assess the statistical significance of the observed U-shaped relationships, a post-hoc ANOVA was performed to compare the residuals of a linear model with those of a quadratic model. Both models were constructed to explain the effect of position (ranging from 1 to 5) and feature type (i.e., valence, arousal, loudness, and tempo) on feature values. Results indicated that the model including a quadratic term for position [F(5,2594)=9.172,p<.001] had a significantly better fit than the linear model [F(5,2594)=6.05p<.001], with lower residual sum of squares, and a higher adjusted explained variance (RSSlinear = 2558.2; RSSquadratic = 2534.2; Rlinear2=.009;Rquadratic2=.018.

Overall dissimilarities (Fig 6) between album positions were not different between control and experimental condition (P1/P2 = 3.15, P2/P3 = 3.05, P3/P4 = 3.1, P4/P5 = 3.06, with all p>.05). Similarly, tempo dissimilarities (P1/P2 = 69.71, P2/P3 = 71.94, P3/P4 = 70.77, P4/P5 = 68.55) did not differ from the random control distribution (M = 69.05, SD = 3.27, p>.05). Finally, p-values for key dissimilarity between consecutive album positions were statistically significant only for pair of tracks P4/P5 = 1.086 (p<.05), but not for the other ones (P1/P2 = 1.008, P2/P3 = 1.001, P3/P4 = 1.033; p>.05) when compared to the control condition (M = 1.02, SD = .06).

Fig 6. Empirical cumulative distributions of track similarities.

Fig 6

Key dissimilarity, tempo dissimilarity and overall feature dissimilarity between pairs of consecutive tracks in the experimental condition (dashed lines), in relation to the Empirical Cumulative Distribution in the control condition. Values in the y-axis represent the probability of obtaining a given value or higher under the current ECDF.

3.3 Survey

Results of the χ2 test were statistically significant for all but proposition D, indicating that responses for that proposition are relatively balanced between disagree, neutral and agree. If compared to the continuous responses for the same propositions (Fig 7) we see that, in fact, responses are shaped in 3 strong peaks, roughly centered at 25 (disagree), 50 (neutral) and 75 (agree).

Fig 7. Distribution of survey responses.

Fig 7

Density plots show the distribution of opinions throughout a continuous scale between completely agree and completely disagree. Bar plots show the proportion of respondents who agreed, disagreed, or were neutral about a given proposition.

All other χ2 statistics were significant at p<.05 (detailed statistics available as Supporting Information S3. Detailed survey statistics), which indicates that participants are biased towards one or two particular sides of the scale. The importance of track sequence in album production was the least controversial item of the survey, with most participants (84%) indicating their agreement with proposition (a) (Fig 7). Proposition B was also relatively uncontroversial, with participants agreeing that albums should not present very similar songs consecutively (63%).

Propositions (c), (d), (f), (g) and (h), were the most controversial ones, with many responses centered around 3 levels of the scale. With the exception of proposition (d), the distribution of responses for these propositions was still statistically significant, which indicates that, despite polarization, participants were biased towards at least one particular end of the scale.

Propositions (e) and (i) (Fig 7) received the highest frequencies of responses in the left end of the scale (i.e., disagree), indicating that musicians mostly disagree with the proposition that albums should end with the most striking and energetic songs, and that consecutive songs are to be as different as possible from each other, respectively.

Due to its exploratory nature, we refrain from conducting further hypothesis tests on individual propositions, and we consider that visualizations presented in Fig 7 provide enough information for us to discuss the results. All data and code utilized in this study are available from https://osf.io/69n3x (DOI 10.17605/OSF.IO/69N3X).

4 Discussion

Our findings demonstrate significant levels of agreement between music professionals regarding how tracks should be sequenced in an album. As measured by PR, particularly high levels of agreement were found for the first position of the albums. Results of the ANOVA suggest that the position within the album significantly influences the feature values, and this effect varies depending on the feature type. Furthermore, there was no effect of set, indicating that patterns were consistent irrespective of the songs and genres chosen for each set of tracks.

4.1 Album sequencing, emotional arcs and storytelling

4.1.1 Literary works.

As shown in Fig 5, opening and closing tracks show high levels of valence and arousal. Following the circumplex model ([38]), this combination of features represents emotions such as excitement, delightfulness, and happiness. Middle tracks, on the other hand, show lower levels of valence and arousal, which corresponds to core emotions such as depression, boredom, and tiredness (for visual representations of the circumplex core emotions, see [39]).

With respect to valence, emotional arcs found in our albums bear a striking resemblance to those found in literary works. In their groundbreaking research, [40] performed computational sentiment analysis on a body of 1327 books from the Gutenberg Project (A), and found that 6 types of emotional arc generally prevail. These arcs correspond to the change in positivity (valence) of the narrative from start to finish, and have been categorized as follows:

  1. Rags to riches—a steady rise from bad to good fortune

  2. Riches to rags—a fall from good to bad, a tragedy

  3. Icarus—a rise then a fall in fortune

  4. Oedipus—a fall, a rise then a fall again

  5. Cinderella—rise, fall, rise

  6. Man in a hole—fall, rise

The Man in a hole arc, characterized by fall and a rise of positivity, was one of the most common arcs found by [40], accounting for 30% of all arcs in the literary works analyzed. Similar to literature, our findings suggest that album production favors the Man in a hole arc. This is true for overall trajectories of valence, but also for each individual set of tracks (Fig 5, left side).

It should be noted, however, that the textual features analyzed by [40] are essentially different from the audio features from our current study. It is still unclear, therefore, the extent to which similarity of emotional arcs are comparable from a perceptual standpoint. Furthermore, here we analyzed valence for aggregates of track sequences, whereas [40] analyzed emotional arcs followed by individual books. Still, even if other types of emotional arcs were also present in our sample, there is a strong prevalence of Man in a hole valence arcs within our album sequences, as revealed by our analysis of feature sequencing. Future work could still address the existence of alternative arcs within professional music albums.

4.1.2 Song lyrics.

In a recent study, [41] analyzed the progression of storytelling elements in a large database of song lyrics. Specifically, the authors analyzed the distribution of staging, plot progression and cognitive tension across different segments of songs. Their findings revealed that lyrics tend to start with higher rates of words related to staging and cognitive tension, which gradually declines as the song progresses. Notably, cognitive tension peaks midway through the songs.

Again, there are major differences between our study and that of [41]. While here we focus on audio features and music albums, Alberhasky [41] investigate lyrics of individual songs. In fact, valence, arousal, loudness, and tempo can hardly be compared to staging, plot progression and cognitive tension. Still, our studies reveal strong similarities in the overall shapes of feature progressions. For instance, cognitive tension decreases throughout the song, similar to the way that valence and arousal tend to decrease throughout an album.

Whether or not these similarities are mere coincidences remains to be investigated. It is likely, however, as suggested by [12,41,41], that these patterns reflect some perceptual aspects that generally play a role when stimuli are presented sequentially, be it in audio features, lyrics, or literary plots. Future research may also expand on the work of [41], and analyze how lyric-related features evolve throughout segments of an album or concert. In the next section, we raise hypotheses about the perceptual relevance of track sequences.

4.2 Why do musicians care about track sequencing?

So far we have provided empirical support for the idea that professional musicians see track sequencing as an important aspect of album production. The perceptual effect that different sequences of tracks may or may not have from the perspective of the listener, however, remains to be explored. It is worth noticing, in this respect, that while we found a significant effect of position in the album on feature values, small effect sizes raise questions about the practical relevance of these results. One could hypothesize, for instance, that track sequencing is an idiosyncrasy of music professionals, and that it is completely irrelevant to those who actually listen to the album.

In literary works [40], the choice of emotional arc has significant correlations with how successful the story is, as measured by the number of downloads that a particular book accumulates. For music, however, research suggests that randomly shuffling movements from Beethoven’s compositions affected neither the pleasantness nor the emotional impact of these pieces [42]. These findings have been replicated with different methods, composers and music genres [4347].

On the other hand, a significant body of research has shown that the perception of a given sound is dependent on the quality of the musical features that precede it. The renowned probe-tone paradigm demonstrates, for instance, that a single pitch can be perceived differently depending on the notes that come before it [48]. This is supported by other research paradigms, which generally show that musical context influences how individuals react to music both behaviorally and physiologically [4952].

In light of auditory science, it seems unlikely that track sequencing has no impact on any facet of auditory perception. The studies [5356] show, for instance, that variations in sound intensity, such as sudden and abrupt increases in Sound Pressure Level (SPL), can affect listener’s attention, chills, and levels of arousal. The concept of loudness adaptation [57,58] reveals that the perception of intensity diminishes over time, even if the stimulus remains at a constant SPL. This shows that not only the immediate physical properties of the stimulus are capable of influencing how a sound is perceived, but also the context in which the stimulus is inserted. Future work on album sequencing carries the burden of unraveling what–if any–aspect of music perception is shaped by different sequences of tracks, whether inside or outside the context of an album.

4.3 Comparison with our previous study

Neto et al. [12] found a negative linear trend of valence, energy, loudness and tempo across album positions (Fig 1). This is partially in contrast with our current findings, which show that albums tend to start and end with lower levels of tempo (Fig 5). Similar to our large-scale Spotify analysis, however, valence, arousal and loudness were higher for opening tracks, although here these features followed a U-shaped curve (Fig 5). Such discrepancies could be attributed to at least two factors: (1) difference between MIRtoolbox and Spotify’s feature extraction methods; (2) difference in number of albums analyzed.

We could hypothesize, for instance, that audio features computed by Spotify do not reflect the same dimensions of the features that are computed by MIRtoobox. Whereas the latter is an open-source and peer reviewed feature extraction tool, the former is proprietary method, which does not specify details of how high level features such as valence and energy are computed.

4.4 Comparison between survey responses and track sequences

Considering that this study is mainly inspired by anecdotal opinions about album sequencing, it is interesting to evaluate the extent to which explicit statements about song sequencing actually relate to the way that tracks are positioned on an album. However, as we have only checked for statistical significance of frequency distributions across different scale segments (i.e., agree, disagree and neutral), this section of our study should be interpreted as exploratory.

There was notable agreement between some propositions and sequencing data: participants believed that track sequencing is an important aspect of album production (proposition (a), Fig 7), a sentiment supported by the high levels of PR (Fig 4). With regards to audio features, most participants believed that albums should start with high-energy tracks (proposition (c), Fig 7), and our analysis showed that, in fact, albums often began with tracks exhibiting higher levels of arousal (Fig 5), but this effect did not appear to be very pronounced within individual sets. Moreover, participants mostly disagree that albums need to end with high energy levels (proposition (e), Fig 7), and this is in line with the arousal levels found in our feature analyses (Fig 5). Lastly, participants showed contrasting opinions about tonal similarity of consecutive tracks (proposition (h), Fig 7), and our key-similarity analysis showed that this feature was randomly distributed throughout all albums and sets, with the exception of one pair of tracks (Fig 7).

However, some opinions expressed in the survey diverged from what was observed in our sequencing data. While many participants felt that consecutive tracks should not be too similar, but also not maximally different from each other (propositions (b) and (i) respectively, 7), overall feature dissimilarity values were not statistically different between control and experimental condition (Fig 6). Even though the majority of participants said that tracks should be somewhat different with regards to tempo (proposition (f), Fig 7), tempo dissimilarities were also not statistically distinct from the random distribution (Fig 6).

In an overview, dissimilarity ratings between consecutive songs seemed to be secondary to overall arches of energy, valence and loudness. Even though participants had some opinions about the optimal contrast between pairs of tracks, these opinions generally did not reflect on the way that albums were sequenced. The comparisons provided here should, however, be considered in light of the fact that terms used in our propositions do not necessarily reflect the same musical feature computed in our analyses. It is likely, for instance, that the expression energetic song is understood as some musical quality that is not well captured by the way that MIRtoolbox computes arousal.

In Eerola et al.’s model [28], arousal is positively correlated with loudness, as well as with loudness variations–computed as the mean and standard deviation of the signal’s root mean square, respectively–and negatively correlated with key clarity, which indicates how well the song conforms to one of the 12 keys in both major and minor modes. Valence, on the other hand, is negatively correlated with variations of loudness, and positively correlated with key clarity. In this model, both valence and arousal are also influenced by features that do not have direct interpretability, such as the entropy of the smoothed and collapsed spectrogram [28].

We do not know the extent to which these features, which have been shown to be relevant for the perception of emotion in music, can be linked to the terms that we employed in our survey. For a stronger analysis between survey opinions and sequencing regularities, future work in this area could apply more rigorous psychometric measurements, with satisfactory values of test-retest reliability and construct validity.

4.5 Limitations

4.5.1 Positional Regularity as a Conservative measure.

Our proposed measure of PR is conservative in the sense that it reflects the agreement of musicians towards a specific track. Therefore, high levels of PR are only obtained if participants agree about the specific track which should be positioned in the beginning of an album, for instance.

It is the case, however, that each set of tracks in our samples provided individuals with more than one option of songs with a high value of valence, arousal, loudness and tempo. If tracks were not categorized by their names, but rather by their feature ranges (e.g., high, mid, low), we would be likely to see higher levels of PR.

In fact, the clear U-shape of valence in set 1 (Fig 5) suggests that participants were choosing higher values of valence for the final positions. Despite this agreement, PR was relatively low for P5 of the same set. Still, we consider that the feature analysis is enough to convey the levels of agreement that music professionals have towards the positioning of tracks according to their feature characteristics, and PR continues to be a coherent and feature-agnostic method to measure agreement about specific tracks.

4.5.2 Number of tracks.

We chose to present participants only with albums that are composed of 5 tracks. Even though the sample of [12] comprised a significant number of albums with this exact length (6680 albums out of the 50000), and sequencing regularities were consistent throughout albums of different lengths, we do not know the extent to which our results would be generalizable to albums with distinct numbers of tracks. Future studies should address the effect that this variable may have in sequencing patterns.

4.5.3 Sequencing and music taste: an alternative hypothesis.

A limitation of our work is the fact that we did not ask how much individuals liked each song. It could be hypothesized, for instance, that the most liked tracks in our sample were those positioned in the first and last positions of the album. Individuals could, in fact, be biased towards their music preferences, and therefore our sequencing regularities would no longer be attributable to the features of the track, but rather to an agreement about the quality of the songs. If that was the case, however, we would expect to see a wider variability in feature statistics throughout sequence positions. Considering previous literature on music taste, personal preferences span a wide range of music features between individuals [59].

Furthermore, this study partially replicates our previous results [12], which also suggests that published albums tend to have higher levels of energy in their opening sections (here reflected on valence and loudness values). Again, it is possible that these results also reflect a link between likable songs and high levels of loudness and valence.

While we did not explore the influence of musical genre, background, and preferences on album sequencing, our results suggest that sequencing regularities may have some level of independence from these factors, as PRs were significantly higher than chance across an heterogeneous sample of participants and musical genres, and position was found to be a statistically significant factor in our mixed-design ANOVA.

In face of these limitations and alternative hypotheses, we still consider that our results offer strong empirical evidence for the general hypotheses that (1) musicians consider track sequencing an important aspect of album production, and (2) that music-related features influence the way that professionals choose to organize their albums.

4.5.4 Demographics and levels of musicianship.

In this study, we did not formulate a specific hypothesis regarding the potential influence of musicianship (e.g., years of experience, popularity, musical genre) or demographics (e.g., age, gender, nationality) on track sequencing regularities. However, it is possible that different musicians employ distinct sequencing strategies. Future research could explore this further, either from our available dataset, or by conducting new studies that target specific musician and demographic populations.

5 Conclusion

Professional musicians consider track sequencing an important aspect of album productions. When asked to determine the order of tracks, participants show high levels of agreement about which position particular songs should occupy. Overall, tempo follows an inverted U-shape, whith faster songs being positioned in the middle of the album, whereas loudness, valence and arousal follow an inverted pattern. Our findings may be of relevance to music producers, streaming platforms, and listeners engaged in curating and/or generating music albums and playlists.

Supporting information

S1. Track information.

Details of the musical tracks used in the study, including album names, artist names, dataset sources, and track correspondence. Includes participant musical experience data summarized in tables

(PDF)

pone.0316963.s001.pdf (51.6KB, pdf)
S2. Participants’ musical experience.

Mean and standard deviation (in parentheses) of albums recorded, number of concerts performed, and years of experience as professional musicians across the three track sets.

(PDF)

pone.0316963.s002.pdf (40.9KB, pdf)
S3. Detailed survey statistics.

Chi-square and Dip-test statistics and their corresponding p-values for each survey proposition.

(PDF)

pone.0316963.s003.pdf (38.9KB, pdf)

Data Availability

All relevant data are within the paper and its Supporting Information files or from https://osf.io/69n3x/.

Funding Statement

This research was funded by the Research Council of Finland, under the Center of Excellence in Music, Mind, Body and Brain (Project number: 346210).

References

  • 1.Salvador A, Drozdzal M, Giró-i Nieto X, Romero A. Inverse cooking: recipe generation from food images. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019, pp. 10453–62. [Google Scholar]
  • 2.Nunes JP, Grgic J, Cunha PM, Ribeiro AS, Schoenfeld BJ, de Salles BF, et al. What influence does resistance exercise order have on muscular strength gains and muscle hypertrophy? A systematic review and meta-analysis. Eur J Sport Sci. 2021;21(2):149–57. doi: 10.1080/17461391.2020.1733672 [DOI] [PubMed] [Google Scholar]
  • 3.Schell J. The art of game design: a book of lenses. Boca Raton: CRC Press; 2008. [Google Scholar]
  • 4.Lalitte P, Bigand E. Music in the moment? Revisiting the effect of large scale structures. Percept Mot Skills. 2006;103(3):811–28. doi: 10.2466/pms.103.3.811-828 [DOI] [PubMed] [Google Scholar]
  • 5.Garvey G. The art of sequencing. 2013. Available from: https://www.bbc.co.uk/sounds/play/b01q95y6
  • 6.Nills. The art of album sequencing. 2015. Available from: https://www.nilsguitar.com/the-art-of-album-sequencing/
  • 7.Ruoff M. How to make great playlists for your radio station. 2018. https://live365.com/blog/how-to-make-great-playlists-for-your-radio-station/ [Google Scholar]
  • 8.Scammell R. How to make a good playlist on Spotify for someone. 2019. Available from: https://www.openmicuk.co.uk/advice/how-to-make-a-good-spotify-playlist/ [Google Scholar]
  • 9.Hahn M. Album sequencing: How to give your releases perfect flow. 2018. Available from: https://blog.landr.com/album-sequencing/ [Google Scholar]
  • 10.Wensem C. How to approach album track order in the digital age. 2016. Available from: https://blog.sonicbids.com/how-to-approach-album-track-order-in-the-digital-age/ [Google Scholar]
  • 11.Sawyer J. How you can create the perfect party playlist. 2021. Available from: https://www.handofftech.com/post/perfect-party-playlist [Google Scholar]
  • 12.Neto P, Hartmann M, Luck G, Toiviainen P. The algorithmic nature of song-sequencing: statistical regularities in music albums. J New Music Res. 2024;1:1–15. [Google Scholar]
  • 13.Oliver N, Kreger-Stickles L. PAPA: physiology and purpose-aware automatic playlist generation. In: Proceedings of ISMIR. 2006, pp. 393–6. [Google Scholar]
  • 14.Garrido S, Schubert E, Bangert D. Musical prescriptions for mood improvement: an experimental study. Arts Psychother. 2016;51:46–53. doi: 10.1016/j.aip.2016.09.002 [DOI] [Google Scholar]
  • 15.Que Y, Zheng Y, Hsiao JH, Hu X. Exploring the effect of personalized background music on reading comprehension. In: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020, pp. 57–66. 10.1145/3383583.3398543 [DOI] [Google Scholar]
  • 16.Ikeda S, Oku K, Kawagoe K. Music playlist recommendation using acoustic-feature transitions. In: Proceedings of the Ninth International C* Conference on Computer Science & Software Engineering - C3S2E ’16, 2016, pp. 115–8. doi: 10.1145/2948992.2949005 [DOI] [Google Scholar]
  • 17.Cliff D. Hang the DJ: Automatic sequencing and seamless mixing of dance-music tracks. HP Laboratories; 2000, p. 104. [Google Scholar]
  • 18.Mahadik A, Milgir S, Patel J, Jagan VB, Kavathekar V. Mood based music recommendation system. Int J Eng Res Technol. 2021;10(6). [Google Scholar]
  • 19.Lartillot O, Toiviainen P, Eerola T. A MATLAB toolbox for music information retrieval. In: Data Analysis, Machine Learning, and Applications: Proceedings of the 31st Annual Conference of the Gesellschaft für Klassifikation EV, 2008, pp. 261–8. [Google Scholar]
  • 20.Senn O, Bechtold TA, Hoesl F, Kilchenmann L. Taste and familiarity affect the experience of groove in popular music. Musicae Sci. 2019;25(1):45–66. doi: 10.1177/1029864919839172 [DOI] [Google Scholar]
  • 21.Aljanaki A, Wiering F, Veltkamp RC. Studying emotion induced by music through a crowdsourcing game. Inf Process Manage. 2016;52(1):115–28. doi: 10.1016/j.ipm.2015.03.004 [DOI] [Google Scholar]
  • 22.Bartsch MA, Wakefield GH. Audio thumbnailing of popular music using chroma-based representations. IEEE Trans Multimedia. 2005;7(1):96–104. doi: 10.1109/tmm.2004.840597 [DOI] [Google Scholar]
  • 23.Justel N, Diaz Abrahan V, Moltrasio J, Rubinstein W. Differential effect of music on memory depends on emotional valence: an experimental study about listening to music and music training. Cogent Psychol. 2023;10(1). doi: 10.1080/23311908.2023.2234692 [DOI] [Google Scholar]
  • 24.Carr SM, Rickard NS. The use of emotionally arousing music to enhance memory for subsequently presented images. Psychol Music. 2016;44(5):1145–57. doi: 10.1177/0305735615613846 [DOI] [Google Scholar]
  • 25.Putkinen V, Nazari-Farsani S, Seppälä K, Karjalainen T, Sun L, Karlsson HK, et al. Decoding music-evoked emotions in the auditory and motor cortex. Cereb Cortex. 2021;31(5):2549–60. doi: 10.1093/cercor/bhaa373 [DOI] [PubMed] [Google Scholar]
  • 26.Palazzi A, Wagner Fritzen B, Gauer G. Music-induced emotion effects on decision-making. Psychol Music. 2018;47(5):621–43. doi: 10.1177/0305735618779224 [DOI] [Google Scholar]
  • 27.Fritz T, Jentschke S, Gosselin N, Sammler D, Peretz I, Turner R, et al. Universal recognition of three basic emotions in music. Curr Biol. 2009;19(7):573–6. doi: 10.1016/j.cub.2009.02.058 [DOI] [PubMed] [Google Scholar]
  • 28.Eerola T, Lartillot O, Toiviainen P. Prediction of multidimensional emotional ratings in music from audio using multivariate regression models. In: Proceedings of the International Conference on Music Information Retrieval, Kobe, 2009. [Google Scholar]
  • 29.Lekamge S, Marasinghe A, Kalansooriya P, Nomura S. A visual interface for emotion based music navigation using subjective and objective measures of emotion perception. Int J Affective Eng. 2016;15(2):205–11. doi: 10.5057/ijae.ijae-d-15-00039 [DOI] [Google Scholar]
  • 30.Tarvainen J, Laaksonen J, Takala T. Film mood and its quantitative determinants in different types of scenes. IEEE Trans Affective Comput. 2020;11(2):313–26. doi: 10.1109/taffc.2018.2791529 [DOI] [Google Scholar]
  • 31.Putkinen V, Zhou X, Gan X, Yang L, Becker B, Sams M, et al. Bodily maps of musical sensations across cultures. Proc Natl Acad Sci U S A. 2024;121(5):e2308859121. doi: 10.1073/pnas.2308859121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Husain G, Thompson WF, Schellenberg EG. Effects of musical tempo and mode on arousal, mood, and spatial abilities. Music Percept. 2002;20(2):151–71. doi: 10.1525/mp.2002.20.2.151 [DOI] [Google Scholar]
  • 33.Lin H-M, Kuo S-H, Mai TP. Slower tempo makes worse performance? The effect of musical tempo on cognitive processing speed. Front Psychol. 2023;14:998460. doi: 10.3389/fpsyg.2023.998460 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bannister S. A vigilance explanation of musical chills? Effects of loudness and brightness manipulations. Music Sci. 2020;3. doi: 10.1177/2059204320915654 [DOI] [Google Scholar]
  • 35.Kohn D, Eitan Z. Moving music: correspondences of musical parameters and movement dimensions in children’s motion and verbal responses. Music Percept. 2016;34(1):40–55. doi: 10.1525/mp.2016.34.1.40 [DOI] [Google Scholar]
  • 36.Toiviainen P, Luck G, Thompson MR. Embodied meter: hierarchical eigenmodes in music-induced movement. Music Percept. 2010;28(1):59–70. doi: 10.1525/mp.2010.28.1.59 [DOI] [Google Scholar]
  • 37.Krumhansl CL. Cognitive foundations of musical pitch. Oxford University Press; 1990. [Google Scholar]
  • 38.Russell JA. A circumplex model of affect. J Pers Soc Psychol. 1980;39(6):1161–78. doi: 10.1037/h0077714 [DOI] [Google Scholar]
  • 39.Liu Z, Xu A, Guo Y, Mahmud JU, Liu H, Akkiraju R. Seemo: a computational approach to see emotions. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. New York: ACM Press; 2018, pp. 1–12. [Google Scholar]
  • 40.Reagan AJ, Mitchell L, Kiley D, Danforth CM, Dodds PS. The emotional arcs of stories are dominated by six basic shapes. EPJ Data Sci. 2016;5(1). doi: 10.1140/epjds/s13688-016-0093-1 [DOI] [Google Scholar]
  • 41.Alberhasky M, Durkee PK. Songs tell a story: the arc of narrative for music. PLoS One. 2024;19(5):e0303188. doi: 10.1371/journal.pone.0303188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Koněcni VJ. Elusive effects of artists’ messages. Adv Psychol. 1984;19:71–93. [Google Scholar]
  • 43.Karno M, Konečni VJ. The effects of structural interventions in the first movement of Mozart’s symphony in G minor K. 550 on aesthetic preference. Music Percept. 1992;10(1):63–72. doi: 10.2307/40285538 [DOI] [Google Scholar]
  • 44.Gotlieb H, Konečni VJ. The effects of instrumentation, playing style, and structure in the Goldberg variations by Johann Sebastian Bach. Music Percept. 1985;3(1):87–101. doi: 10.2307/40285323 [DOI] [Google Scholar]
  • 45.Cook N. The perception of large-scale tonal closure. Music Percept. 1987;5(2):197–205. doi: 10.2307/40285392 [DOI] [Google Scholar]
  • 46.Eitan Z, Granot RY. Growing oranges on Mozart’s apple tree: “Inner Form” and aesthetic judgment. Music Percept. 2008;25(5):397–418. doi: 10.1525/mp.2008.25.5.397 [DOI] [Google Scholar]
  • 47.Rolison JJ, Edworthy J. The role of formal structure in liking for popular music. Music Percept. 2011;29(3):269–84. doi: 10.1525/mp.2012.29.3.269 [DOI] [Google Scholar]
  • 48.Krumhansl CL, Cuddy LL. Music perception. Berlin: Springer; 2010, pp. 51–87. [Google Scholar]
  • 49.Bharucha JJ, Stoeckig K. Priming of chords: spreading activation or overlapping frequency spectra? Percept Psychophys. 1987;41(6):519–24. doi: 10.3758/bf03210486 [DOI] [PubMed] [Google Scholar]
  • 50.Vuvan DT, Prince JB, Schmuckler MA. Probing the minor tonal hierarchy. Music Percept. 2011;28(5):461–72. doi: 10.1525/mp.2011.28.5.461 [DOI] [Google Scholar]
  • 51.Neto PASO, Cui A-X, Rojas P, Vanzella P, Cuddy LL. Not just cents: physical and psychological influences on interval perception. Psychomusicol Music Mind Brain. 2021;31(1):49–58. doi: 10.1037/pmu0000272 [DOI] [Google Scholar]
  • 52.Koelsch S, Jentschke S, Sammler D, Mietchen D. Untangling syntactic and sensory processing: an ERP study of music perception. Psychophysiology. 2007;44(3):476–90. doi: 10.1111/j.1469-8986.2007.00517.x [DOI] [PubMed] [Google Scholar]
  • 53.Patterson B. Musical dynamics. Sci Am. 1974;231(5):78–95. [Google Scholar]
  • 54.Geringer JM. Continuous loudness judgments of dynamics in recorded music excerpts. J Res Music Educ. 1995;43(1):22–35. doi: 10.2307/3345789 [DOI] [Google Scholar]
  • 55.Geringer JM, Madsen CK. Gradual tempo change and aesthetic responses of music majors. Int J Music Educ. 2003;os-40(1):3–15. doi: 10.1177/025576140304000102 [DOI] [Google Scholar]
  • 56.Olsen KN. Intensity dynamics and loudness change: a review of methods and perceptual processes. Acoust Aust. 2014;42(3):45–66. [Google Scholar]
  • 57.Kimura Y. Effects of incrementally increasing and decreasing music intensity on the loudness preference and behaviors of college students performing either gross or fine motor activity. Psychophysiology. 2004;44(3):476–90. [Google Scholar]
  • 58.Dange A, Warm JS, Weiler EM, Dember WN. Loudness adaptation: resolution of a psychophysical enigma. J Gen Psychol. 1993;120(3):217–43. doi: 10.1080/00221309.1993.9711145 [DOI] [PubMed] [Google Scholar]
  • 59.Rentfrow PJ, Gosling SD. The do re mi’s of everyday life: the structure and personality correlates of music preferences. J Pers Soc Psychol. 2003;84(6):1236–56. doi: 10.1037/0022-3514.84.6.1236 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Seung-Goo Kim

9 Jan 2025

PONE-D-24-54145An album is a story: feature arcs in sequences of tracksPLOS ONE

Dear Dr. Neto,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Feb 23 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Seung-Goo Kim, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1.Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, we expect all author-generated code to be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Thank you for stating the following financial disclosure:

“This research was funded by the Research Council of Finland, under the Center of Excellence in Music, Mind, Body and Brain (Project number: 346210).”

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

If this statement is not correct you must amend it as needed.

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

4. Please note that your Data Availability Statement is currently missing the repository name and/or the DOI/accession number of each dataset OR a direct link to access each database. If your manuscript is accepted for publication, you will be asked to provide these details on a very short timeline. We therefore suggest that you provide this information now, though we will not hold up the peer review process if you are unable.

5. When completing the data availability statement of the submission form, you indicated that you will make your data available on acceptance. We strongly recommend all authors decide on a data sharing plan before acceptance, as the process can be lengthy and hold up publication timelines. Please note that, though access restrictions are acceptable now, your entire data will need to be made freely accessible if your manuscript is accepted for publication. This policy applies to all data except where public deposition would breach compliance with the protocol approved by your research ethics board. If you are unable to adhere to our open data policy, please kindly revise your statement to explain your reasoning and we will seek the editor's input on an exemption. Please be assured that, once you have provided your new statement, the assessment of your exemption will not hold up the peer review process.

Additional Editor Comments:

One of the PLOS's core values is transparency. As an academic editor, it is my responsibility to ensure that all authors of the articles published in PLOS ONE have made the data underlying their findings fully available. Please take a look at our criteria for publication (https://journals.plos.org/plosone/s/criteria-for-publication).

The current submission does not include where the data can be retrieved. The authors' input is simply a copy-and-paste of the sample text: 'The data underlying the results presented in the study are available from XXX'. Please see our resource (https://journals.plos.org/plosone/s/data-availability) to understand what constitutes the 'minimal data set' and ideas about sharing data.

Once all requirements for submission are met, then I will consider sending it out to reviewers for further assessment.

[Note: HTML markup is below. Please do not edit.]

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jul 3;20(7):e0316963. doi: 10.1371/journal.pone.0316963.r002

Author response to Decision Letter 1


28 Jan 2025

We have made all the requested changes and additions to our submission. Detailed responses to each point raised by the editor are available in the attached file "Response to reviewers".

Attachment

Submitted filename: rebuttal_letter.pdf

pone.0316963.s004.pdf (195.6KB, pdf)

Decision Letter 1

Seung-Goo Kim

20 Feb 2025

PONE-D-24-54145R1An album is a story: feature arcs in sequences of tracksPLOS ONE

Dear Dr. Neto,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please note that all reviewers evaluated the study highy positive while raising constructive points that would substantiate the current manusscript. In addition, the Academic Editor also added methodological points. All raised points must be addressed in a point-by-point manner in the rebuttal.

Please submit your revised manuscript by Apr 06 2025 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Seung-Goo Kim, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments:

In addition to the reviewers' comments, I would like to add a few methodological points.

1. [line 169] ANOVA only tests for the inequality of group means and does not support the findings of U or inverted-U patterns. Given that this is one of the main conclusions, it needs to be explicitly tested. To test the U or inverted U patterns, a quadratic fit can be compared with a linear fit using an F-test (e.g., general linear model).

2. [line 157] It needs to be clarified that "valence" and "arousal" are only estimates (or predictions) from the MIR toolbox and not human-rated measures. The cited studies are not the validation of the MIR toolbox estimates (which should have been presented in this context), but the investigation of the psychological constructs ("valence" and "arousal") that were self-reported by human participants. However, readers may not know how good (or poor) the MIR toolbox estimates are in predicting human ratings. It must be clarified that the "valence" and "arousal" are not human ratings but models' predictions. Also, if possible, provide validation studies that support the validity of the MIR toolbox in terms of those features.

3. [Equation 2] The averaged l2-norm of the first-order difference of the feature vectors (or scalars) was defined as an "overall dissimilarity" measure and tested against the control condition (i.e., H0: random changes) [Figure 6]. But, it seems that the changes between particular positions were reported in the text (e.g., "P1/P2 = 3.15" [line 233]), not the "overall dissimilarity" measure, which should have been averaged across all transitions. This needs to be consistently reported.

4. [line 183] When defining a distance metric based on Pearson correlation, 1-r is more conventional for its range will be positive [0, 2].

5. [line 193] It is unclear why the authors want to test "whether or not participants leaned toward one or more segments of the scale" [line 203]. As shown in [line 97], the survey was carried out to find what are the factors that the musicians would say. For example, "Do they think a good sequence is important or not". Now the authors are answering a different question: "Do they think sequence is uniformly important and unimportant?" But why do you want to know this instead of the first question? Is this a sign of p-hacking?

- Generally binning needs to be better justified. The non-Gaussianity of the ordinal measure such as the Likert scale can be still dealt with by other non-parametric alternative approaches such as the rank-sum test.

- Even if the binning can be justified, why unequal segments as [45%, 10%, 45%] instead of not [33%, 33%, 33%]?

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: (No Response)

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Dear Authors,

I had the opportunity to review your manuscript, “An album is a story: Feature arcs in sequences of tracks”. The manuscript empirically investigates whether there is truth to the claim that musicians implement consistent sequencing principles when arranging tracks in a musical album. The manuscript reveals statistically relevant patterns of track sequencing, suggesting that this process is based on both audio features and the position of a track within an album.

Overall, my impression of the manuscript is positive. However, I have noted down a few comments which I invite you to further consider. I have explained these comments in a point-by-point format, below:

Introduction

---

[1] I find the topic of the manuscript quite interesting, and while it was easy for me to understand why this topic is relevant from a theoretical standpoint (the lack of systematic investigations into the matter; p. 2, lines 32-33), I struggled to see the practical relevance of it. Would understanding specific sequencing principles help an album’s total sales or streams? A similar practical point is mentioned in the Discussion with regards to story arcs in literary works (p. 9, lines 328-329), but not in the Introduction. I feel that the manuscript would benefit from a short elaboration of why understanding album sequencing principles would be practically relevant.

Methods

---

[2] The standard deviations for participants’ average concerts and recorded albums suggest that the variance within these two demographics is relatively high (p. 2, lines 54-56). Were there any differences in sequencing principles between participants who were more of less experienced in album releases or concerts?

[3] I felt that the Methods section missed a few small, but relevant, details. For instance, while I can understand the reasoning behind using jazz and classical music to control for potential instrumentation effects in sequencing practices, I missed why it was necessary to come up with 3 sets of 5 songs in the first place. What was the purpose of comparing the 3 sets?

[4] Next, in the selection of stimuli for set 1, what made a 60 second segment “representative” of a particular track (p. 3, lines 87-88)? Compositionally, e.g., in terms of musical features, were these 60 second segments similar to those in sets 2 and 3?

[5] Several measures of dissimilarity between consecutive tracks were calculated (p. 5, lines 169-191) based on different (combinations) of audio features. However, their purpose was not immediately clear to me. Could you please elaborate what information these dissimilarity metrics add beyond the initial ANOVA of feature values?

Furthermore, with regards to the computation of these dissimilarity measures, why was overall audio feature and tempo dissimilarity calculated via Euclidian distance, while key strength profile (dis)similarity was calculated via Pearson correlations? I feel that you must have had a clear reason for doing this, and it would be helpful if this reasoning were further explained.

Results

---

[6] The manuscript reports a statistically significant interaction effect between position and feature (p. 7, lines 229-230). Were any post-hoc tests conducted to further examine this statistically significant interaction?

Discussion

---

[7] Perhaps this is a bit of a layman’s perspective on the matter, but I feel that songs are rarely experienced as individual features, rather a combination multiple. Thus, while the results and subsequent discussion points highlight the role of individual audio features in track sequencing principles, I wonder whether musicians deliberately consider individual features as opposed to a combination of several in their sequencing choices. Is this something that can be inferred from the data obtained for the manuscript, and, if so, can the Discussion be expanded to address this?

Minor suggestions

---

[8] This is a slightly nit-picky comment, but if recruitment was done in three separate stages (p. 4, lines 115-116), one per set of songs, is it still accurate to say that participants were randomly assigned to each set of songs (p. 3, line 92)? Here too, were there any pronounced differences in demographics between participants at each stage?

[9] For consistency, I suggest changing the “Survey” header (p. 7, line 240) to “Questionnaire” (p. 3, line 97; or the other way around).

Reviewer #2: This experiment looks at whether there is a particular sequence to how tracks unfold on a musical album. The authors had 130 participants sequence various album sets consisting of 5 tracks. The participants also answered a survey about their views on issues related to sequencing. Overall, the participants showed broad agreement on the placement/sequence of the songs. When analyzing the musical features of the songs, the researchers noticed that tempo follows an inverted U Shape while valence, arousal, and loudness exhibited the opposite pattern. The authors note the general trajectory of album songs is like what is seen in the literary domain, and I found this connection to other art forms quite intriguing.

In general, the paper is well-written, and the experiment seems well-designed. I am unable to speak much to the specific statistical analyses used, but based on the explanation, these approaches seem valid. The paper was an interesting read, and I only have a couple of minor comments, although these do not detract from the overall paper.

Participants: Could more details be given about these professional musicians and music producers who were recruited? For example, what type of musical training did they have? I like that the track sequencing focused on little-known instrumental jazz and classical music, however, I would like to know more about these participants. Do they have a lot of experience with jazz or classical music? Do we think this may alter how they sequence the tracks?

I understand using 5 tracks for this experiment as that seems like a reasonable amount for participants to handle sequencing. However, from an ecological validity standpoint, most albums have more than that, say 10-20 tracks. How did the authors settle on 5 tracks per set?

I really liked using the survey to investigate agreement with statements about perspectives on sequencing. I think future studies might look at WHY participants feel this way. Do we think there is something “universal” or general about auditory sequencing? In other words, is there a certain “story” that is told when sequencing tracks? Also, I imagine results may vary a lot if these were songs with lyrics. I think some of this could be mentioned, even if it’s exploratory, in the discussion to ground why it’s important to think about album sequencing (i.e. the significance).

Reviewer #3: Dear Authors,

Thank you for submitting your article to PLOS ONE. I particularly enjoyed reading about a topic I found very interesting.

The manuscript presents a well-structured and methodologically sound study, addressing an interesting intersection between music perception and auditory science. The use of professional musicians and the inclusion of different musical genres enhance the ecological validity of the findings. The authors provide clear statistical analyses, supported by ANOVA and bootstrapping approaches, ensuring robust results. However, some aspects could benefit from further clarification, particularly regarding the interpretation of feature dissimilarity and the potential impact of individual track preferences. Additionally, while the authors acknowledge limitations, the discussion of how sequencing preferences might vary across musical genres could be expanded.

Please find some comments below:

1. Abstract:

Clarify how the “inverted U-shape” of tempo and the opposite pattern of valence, arousal, and loudness were measured. It might be helpful to mention the use of MIRtoolbox here to strengthen the methodological transparency.

2. Introduction (Page 2, Line 40-44):

The introduction mentions a previous study involving 51,010 published albums. It would be useful to briefly explain whether the current study expands on those findings or addresses a different aspect of track sequencing.

3. Materials and Methods (Page 3, Line 62-72):

The rationale for selecting only jazz and classical genres is well-explained, but the authors might consider addressing whether findings could generalize to more popular or rhythmically consistent genres, such as pop, rock, or EDM.

4. Sequencing Analysis (Page 4, Line 130-140):

The calculation of Positional Regularity (PR) is detailed, but it might be clearer to provide an example illustrating how PR reflects consensus among participants.

5. Results (Page 7, Line 221-230):

While the ANOVA results are statistically significant, the η² values suggest small effect sizes. It would be helpful to discuss whether these effects, while significant, are practically meaningful in real-world album production.

6. Discussion (Page 8, Line 272-283):

The comparison between emotional arcs in album sequencing and literary works is compelling but somewhat speculative. Consider adding a disclaimer emphasizing that while the patterns are similar, the underlying cognitive mechanisms might differ.

7. Conclusion (Page 12, Line 464-469):

The conclusion effectively summarizes the findings but could benefit from emphasizing the study’s implications for music producers and platforms like Spotify or Apple Music, where track sequencing might influence user experience.

Finally: Please carefully check the spacing within parentheses and remove unnecessary spaces (e.g., p.1, lines 33 and 49, etc.).

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Karen Chan Barrett

Reviewer #3: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2025 Jul 3;20(7):e0316963. doi: 10.1371/journal.pone.0316963.r004

Author response to Decision Letter 2


24 Mar 2025

Dear Editor and Reviewers,

We were very pleased with your comments, and I believe they helped us make significant improvements to our paper. In the document "Response to reviewers" you will find our answers to each one of the issues that you raised.

A practical note: The page and line numbers in your references to our manuscript did not align with our local document, likely due to differences in LaTeX rendering. To ensure clarity, we refer to Section. Subsection. Paragraph number in our responses, all highlighted in yellow.

Best regards,

P.N.

Attachment

Submitted filename: response_reviewers.pdf

pone.0316963.s005.pdf (307.5KB, pdf)

Decision Letter 2

Seung-Goo Kim

15 Apr 2025

An album is a story: feature arcs in sequences of tracks

PONE-D-24-54145R2

Dear Dr. Neto,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Seung-Goo Kim, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: I have read the revised manuscript and believe that the critiques of all three reviewers were aimed at ensuring we understood the significance/application of this study, as well as situating this study among existing information and previous research. The authors responded effectively to each of my critiques.

I also like the newly-added text in the manuscript about 1) the previous study in the introduction so that we now know why certain decisions were made in this study since it builds upon that one, 2) explanation of High vs. Low PR scores as well details about the feature analysis, 3) why they analyzed the survey data the way they did and 4) addressing reviewer critiques through the discussion and limitations sections. I think the manuscript is much improved with these additions, and I am satisfied with the resubmission.

Reviewer #3: Dear Authors,

Thank you for your thorough and thoughtful revisions. After reviewing the revised manuscript, I am pleased to note that you have addressed all the concerns raised. The improvements have enhanced the clarity and quality of the paper. I have no further comments and recommend the manuscript for publication.

Best regards,

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Karen Chan Barrett

Reviewer #3: No

**********

Acceptance letter

Seung-Goo Kim

PONE-D-24-54145R2

PLOS ONE

Dear Dr. Neto,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

You will receive further instructions from the production team, including instructions on how to review your proof when it is ready. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few days to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Seung-Goo Kim

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1. Track information.

    Details of the musical tracks used in the study, including album names, artist names, dataset sources, and track correspondence. Includes participant musical experience data summarized in tables

    (PDF)

    pone.0316963.s001.pdf (51.6KB, pdf)
    S2. Participants’ musical experience.

    Mean and standard deviation (in parentheses) of albums recorded, number of concerts performed, and years of experience as professional musicians across the three track sets.

    (PDF)

    pone.0316963.s002.pdf (40.9KB, pdf)
    S3. Detailed survey statistics.

    Chi-square and Dip-test statistics and their corresponding p-values for each survey proposition.

    (PDF)

    pone.0316963.s003.pdf (38.9KB, pdf)
    Attachment

    Submitted filename: rebuttal_letter.pdf

    pone.0316963.s004.pdf (195.6KB, pdf)
    Attachment

    Submitted filename: response_reviewers.pdf

    pone.0316963.s005.pdf (307.5KB, pdf)

    Data Availability Statement

    All relevant data are within the paper and its Supporting Information files or from https://osf.io/69n3x/.


    Articles from PLOS One are provided here courtesy of PLOS

    RESOURCES