Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2022 Jan 21;119(4):e2110406119. doi: 10.1073/pnas.2110406119

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Copyright © 2022 the Author(s). Published by PNAS.

This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).

PMC Copyright notice

Fig. 3. — Comparison of stories prompted by different music excerpts at different geographic locations. For each of the 32 excerpts, we calculated the pairwise cosine similarities between the TF-IDF vectors for narrative documents collected at each geographic location. We excluded same-excerpt comparisons and instead examined different-excerpt similarity based on the music tradition an excerpt belongs to. The nine box-and-whisker plots depict the median value and quantiles of the distribution of different-excerpt similarity values in each comparison between geographic locations. Individual data points (diamonds) correspond to document similarity values that exceed 1.5× the IQR. For each location comparison, we used Welch’s t test to compare the different-excerpt similarity distributions within and between music traditions. Black lines spanning two distributions at the top of the figure represent significant t tests relative to the permuted difference thresholds. The long solid and dotted lines depict the 95th percentile and median value of the control narrative distributions and represent estimates of the maximum and average similarity expected between unprompted stories by US college undergraduates, respectively. The values serve as an additional reference point and not as a threshold for significance.