Abstract
Here we present an annotation of locations and temporal progression depicted in the movie “Forrest Gump”, as an addition to a large public functional brain imaging dataset ( http://studyforrest.org). The annotation provides information about the exact timing of each of the 870 shots, and the depicted location after every cut with a high, medium, and low level of abstraction. Additionally, four classes are used to distinguish the differences of the depicted time between shots. Each shot is also annotated regarding the type of location (interior/exterior) and time of day. This annotation enables further studies of visual perception, memory of locations, and the perception of time under conditions of real-life complexity using the studyforrest dataset.
Keywords: studyforrest, annotation, natural stimulation, scene perception, time perception, spatial cognition
Introduction
Cognitive neuroimaging research is moving towards studying brain behavior under conditions of real-life-like complexity, and motion pictures are being utilized with increasing frequency as stimuli in “neurocinematics” studies 1. What sets motion pictures apart from other dynamic naturalistic stimuli is that they are more likely to evoke time-locked response patterns in a larger portion of the brain while retaining synchrony across multiple individuals who are experiencing the same movie 2, 3. One likely reason for this is the structure of movies. They are typically not prolonged, contiguous captures of an environment from a first person perspective, but rather they are carefully assembled, using “cuts”, from hundreds of short sequences shot from a variety of perspectives 4. These cuts are sharp discontinuities in the sensory input that require all viewers to re-assess the depicted environment in order to perform a cognitive re-orientation in fictional space and time. This re-orientation can be complex and involve a large bandwidth of cognitive processes: interpretation of contextual cues for detection of familiar settings, retrieval of prior knowledge from memory, discovery of change in locales and depicted characters. Consequently movies, and their cuts in particular, offer an excellent instrument to study complex, concurrent, real-life cognition.
In this study, we focus on spatial and temporal viewer re-orientation, and, to this end, describe changes in depicted location and time for all cuts in the motion picture “Forrest Gump”. This movie is the core stimulus of the studyforrest project ( http://studyforrest.org). Two fMRI datasets are publicly available: 1) participants listening to an audio-movie version 5 and 2) a subset of the original participants watching the audio-visual movie with simultaneous eye tracking 6. Additional imaging data and movie annotations are available 7, 8, including an individual localization of the parahippocampal place area 9 that has been implicated in spatial perception and scene processing 10.
This new annotation extends the available knowledge about the structure of this complex natural stimulus and enriches the overall studyforrest dataset. These data can be used to investigate the formation of a representation of viewer location and the perception of (speeded or negative) temporal progression in the movie stimulus. For any study focusing on other aspects of real-life cognition, these new data can serve as additional confound measures describing key properties of major building blocks of this movie stimulus.
Materials and methods
Stimulus
The annotated stimulus was a slightly shortened ( ≈2 h) version of the movie Forrest Gump (R. Zemeckis, Paramount Pictures, 1994) with dubbed German soundtrack that is identical to the audio-visual movie annotated in 8. Further details on this particular movie cut, and how to reproduce it from commercially available sources, are available in 6.
Table 1. Example annotations for four shots at the beginning of the movie.
time | major
location |
setting | locale | int/ext | flow of
time |
ToD |
---|---|---|---|---|---|---|
300.56 | Greenbow
Alabama |
doctor’s
office |
doctor’s
office |
ext | 0 | day |
311.96 | Greenbow
Alabama |
main street | crossroads | ext | + | day |
318.28 | United
States |
flashback
countryside |
flashback
countryside |
ext | - | day |
343.04 | Greenbow
Alabama |
main street | in front of
barbershop |
ext | ++ | day |
Annotation procedure
First, the movie was explored by two people, one of whom has an academic background in documentary film making, in order to generate a consistent list of labels for depicted and recurring locations.
Subsequently, the actual annotation was performed by the first author using a multi-pass strategy. The movie was manually inspected frame-by-frame to determine the location of cuts (using the video editor Shotcut v16.02.01). For each new shot (sequence between two cuts), a number of properties (described below) were discerned and entered into a table. A total of four passes were performed by the same observer in order to validate the annotation.
Data legend
The annotation table contains one line per shot and seven columns: 1) a shot’s start time, 2) a label for the shot’s major location, 3) a label for the setting within the location, 4) a label for the locale within the setting, 5) a flag indicating an interior or exterior setting, 6) a label for the type of temporal progression with respect to the previous shot, and 7) a label for the time of day. Further details are provided in the following sections. The respective column header labels are given in parenthesis.
Shot start time ( time)
A shot’s start time is defined as the onset time of the first video frame of a shot after a cut. Time stamps a provided in seconds of movie onset.
Location
Location was coded with three labels, each describing the depicted scenery with an increasing level of detail.
Major location ( major_location) provides a coarse identification at the level of a town, county, or region where the respective story is taking place. Examples are: “Greenbow” or “Vietnam”.
Setting ( setting) further details the location by distinguishing places at the same major location, but are not in direct sight of each other. For example, Forrest Gump’s elementary school and the high school’s football field are both in Greenbow, Alabama but are not part of the same setting. A switch from one setting to another is typically synonymous with a transition to a new scene in a cinematographic sense. If the camera switched settings within a scene, the annotation deviates from the screenplay to make explicit the switch to another setting.
Locale ( locale) subdivides settings into distinguishable locales. Indoors, a locale is congruent with a particular room enclosed by walls. For example, Forrest Gump’s bedroom, the corridor downstairs, and the corridor upstairs are three different rooms inside the Gumps’ house (setting) on the Gumps’ property (major location). Outdoors, locales were distinguished when they were separated by a logical boundary, substantial distance, or shared no discernible landmarks. For example, the glade at the river and the location of the wounded Bubba are two different locales in the embattled jungle (setting) in Vietnam (major location). A locale’s label is identical to its setting label when only one locale is depicted for that setting.
Interior or exterior ( int_or_ext)
This flag indicates whether a particular location is an open (“ ext”) or enclosed space (“ int”), such as a building or a vehicle.
Temporal progression ( flow_of_time)
This label indicates the depicted progression of time between the previous and the current shot. Four categories were distinguished: “ -” labels a flashback, or jump into the past, independent of the temporal distance; “ 0” indicates no noticeable break in the ongoing stream of time, for example a sole change of viewing perspective; “ +” represents noticeable jumps in time, ranging from several seconds to about one or two hours; and lastly “ ++” marks major time jumps from several hours (e.g. night vs. day) to several years.
Time of day ( time_of_day)
This flag indicates whether a scene is at least partially illuminated by sunlight. Consequently, daytime and twilight (early sunrises and late sun settings) are labeled as “ day”. If sunlight is entirely missing, the time of day is coded as “ night”.
Dataset content
The released annotation is a single, text-based, comma-separated-value (CSV) formatted table ( Dataset 1 ).
The source code for all descriptive statistics included in this paper is available in Dataset 2 (Python script).
Dataset validation
To check for human error in the cut time annotation, timings were compared to the results of an automatic detection algorithm and any deviation was manually verified.
In summary, the shortened version of the movie comprises 870 shots (duration: min=0.48 s, max=151.08 s, median=4.92 s, SD=10.86 s). There are 612 shots depicting outdoor locations and 256 interior shots. Most shots take place during daytime (706 day vs. 162 night). The majority of cuts involve no noticeable discontinuities of depicted time (640), but there are 61 small and 135 large time jumps, as well as 32 flashbacks.
Table 2 provides information on the portrayal of unique locations in the movie.
Table 2. Descriptive statistics for all three levels of location annotation.
major locations | settings | locales | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
number of unique | 21 | 95 | 147 | |||||||||
min | med. | mean | max | min | med. | mean | max | min | med. | mean | max | |
number of shots | 1 | 14 | 41.3 | 195 | 1 | 2 | 9.1 | 104 | 1 | 2 | 5.9 | 58 |
number of
consecutive shots |
1 | 2 | 7.3 | 70 | 1 | 2 | 4.7 | 69 | 1 | 1 | 3.2 | 32 |
times revisited | 0 | 1 | 4.7 | 21 | 0 | 0 | 0.9 | 20 | 0 | 0 | 0.9 | 20 |
Data and software availability
F1000Research: Dataset 1. CSV table with of depicted locations, and temporal progression in the motion picture “Forrest Gump”, 10.5256/f1000research.9536.d134823 11
F1000Research: Dataset 2. Python script to compute all descriptive statistics presented in the Data Note manuscript from the released annotations, 10.5256/f1000research.9536.d134824 12
Acknowledgements
We are grateful to Daniel Kottke for cross-checking the timing of the cuts in the movie using an automated detection routine, and Alex Waite for editing this manuscript. We would also like to thank Gavin Theren for patiently sharing his cinematographic knowledge during movie watching and for his high-level gastronomic skills.
Funding Statement
Michael Hanke was supported by funds from the German federal state of Saxony-Anhalt and the European Regional Development Fund (ERDF), Project: Center for Behavioral Brain Sciences.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 1; referees: 3 approved]
References
- 1. Hasson U, Landesman O, Knappmeyer B, et al. : Neurocinematics: The neuroscience of film. Projections. 2008;2(1):1–26. 10.3167/proj.2008.020102 [DOI] [Google Scholar]
- 2. Hasson U, Malach R, Heeger DJ: Reliability of cortical activity during natural stimulation. Trends Cogn Sci. 2010;14(1):40–48. 10.1016/j.tics.2009.10.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Lankinen K, Saari J, Hari R, et al. : Intersubject consistency of cortical MEG signals during movie viewing. Neuroimage. 2014;92:217–224. 10.1016/j.neuroimage.2014.02.004 [DOI] [PubMed] [Google Scholar]
- 4. Cutting JE, Brunick KL, DeLong JE: The changing poetics of the dissolve in Hollywood film. Empirical Studies of the Arts. 2011;29(2):149–169. 10.2190/EM.29.2.b [DOI] [Google Scholar]
- 5. Hanke M, Baumgartner FJ, Ibe P, et al. : A high-resolution 7-Tesla fMRI dataset from complex natural stimulation with an audio movie. Sci Data. 2014;1: 140003. 10.1038/sdata.2014.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Hanke M, Adelhöfer N, Kottke D, et al. : A studyforrest extension, simultaneous fMRI and eye gaze recordings during prolonged natural stimulation. bioRxiv. 2016. 10.1101/046581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Hanke M, Dinga R, Häusler C, et al. : High-resolution 7-Tesla fMRI data on the perception of musical genres – an extension to the studyforrest dataset [version 1; referees: 2 approved with reservations]. F1000Res. 2015;4:174 10.12688/f1000research.6679.1 [DOI] [Google Scholar]
- 8. Labs A, Reich T, Schulenburg H, et al. : Portrayed emotions in the movie "Forrest Gump" [version 1; referees: 2 approved]. F1000Res. 2015;4:92. 10.12688/f1000research.6230.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Sengupta A, Kaule F, Guntupalli JS, et al. : A studyforrest extension, retinotopic mapping and localization of higher visual areas. Scientific Data, accepted. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Epstein R, Kanwisher N: A cortical representation of the local visual environment. Nature. 1998;392(6676):598-601. 10.1038/33402 [DOI] [PubMed] [Google Scholar]
- 11. Häusler CO, Hanke M: Dataset 1 in: An annotation of cuts, depicted locations, and temporal progression in the motion picture “Forrest Gump”. F1000Research. 2016. Data Source [DOI] [PMC free article] [PubMed]
- 12. Häusler CO, Hanke M: Dataset 2 in: An annotation of cuts, depicted locations, and temporal progression in the motion picture “Forrest Gump”. F1000Research. 2016. Data Source [DOI] [PMC free article] [PubMed]