Capturing the nature of events and event context using hierarchical event descriptors (HED)

Kay Robbins; Dung Truong; Stefan Appelhoff; Arnaud Delorme; Scott Makeig

doi:10.1016/j.neuroimage.2021.118766

. Author manuscript; available in PMC: 2022 Mar 16.

Published in final edited form as: Neuroimage. 2021 Nov 27;245:118766. doi: 10.1016/j.neuroimage.2021.118766

Capturing the nature of events and event context using hierarchical event descriptors (HED)

Kay Robbins ^a,^*, Dung Truong ^b, Stefan Appelhoff ^c, Arnaud Delorme ^b,^d, Scott Makeig ^b

PMCID: PMC8925904 NIHMSID: NIHMS1770954 PMID: 34848298

Abstract

Event-related data analysis plays a central role in EEG and MEG (MEEG) and other neuroimaging modalities including fMRI. Choices about which events to report and how to annotate their full natures significantly influence the value, reliability, and reproducibility of neuroimaging datasets for further analysis and meta- or mega-analysis. A powerful annotation strategy using the new third-generation formulation of the Hierarchical Event Descriptors (HED) framework and tools (hedtags.org) combines robust event description with details of experiment design and metadata in a human-readable as well as machine-actionable form, making event annotation relevant to the full range of neuroimaging and other time series data. This paper considers the event design and annotation process using as a case study the well-known multi-subject, multimodal dataset of Wakeman and Henson made available by its authors as a Brain Imaging Data Structure (BIDS) dataset (bids.neuroimaging.io). We propose a set of best practices and guidelines for event annotation integrated in a natural way into the BIDS metadata file architecture, examine the impact of event design decisions, and provide a working example of organizing events in MEEG and other neuroimaging data. We demonstrate how annotations using HED can document events occurring during neuroimaging experiments as well as their interrelationships, providing machine-actionable annotation enabling automated within- and across-experiment analysis and comparisons. We discuss the evolution of HED software tools and have made available an accompanying HED-annotated BIDS-formated edition of the MEEG data of the Wakeman and Henson dataset (openneuro.org, ds003645).

Keywords: Events, Event annotation, Hierarchical event descriptors, HED, BIDS, EEG, MEG, HED-3G, Time series

1. Introduction

EEG (electroencephalographic) and MEG (magnetoencephalographic) neuroimaging, collectively known as MEEG, are non-invasive brain imaging technologies for capturing neuroelectromagnetic brain dynamic records at millisecond-scale sampling rates. As MEEG records brain signals occurring on the time scale of individual thoughts and actions, event-related data analysis plays a central role in MEEG and other types of neuroimaging experiments. Because of the essential role that event markers and their annotations play in linking experimental data to the unfolding of the experiment, incomplete event reporting using event annotations that are inaccurate, overly simple, or absent represents a significant barrier to analysis of shared neuroimaging data. Thoughtful choices as to how events are measured, identified, and annotated can greatly improve the utility of the collected data for both immediate and long-term analyses.

Good annotation tools and standards can also incorporate useful information about experimental design, participant tasks, data features (for example eyeblinks, movement artifact, ictal activity), and other metadata into the collected and later shared data, thereby making the data ready for efficient within- and across-study analyses using a variety of approaches. Although here we focus on MEEG applications, event annotation standards and practices essential for MEEG data analysis can be applied equally well to other types of neuroimaging time series data including fMRI. For example, growing appreciation of the importance of embodied cognition on mental life (Shapiro, 2019), new lightweight, low cost methods of recording details of brain activities and motor behavior of experiment participants (Casson, 2019) (Jas et al., 2021) (Vitali and Perkins, 2020), and emergence of the practice of recording both brain activity and behavior (as well as psychophysiology) at higher resolution in a broader range of tasks and task environments (often termed Mobile Brain/Body Imaging or MoBI) (Makeig et al., 2009), make development of a suitable and more comprehensive data annotation framework ever more urgent.

Events.

In everyday life, we use the term “event” to describe some experience (or sequence of interrelated experiences) unfolding through time that has some significance distinguishing it from other preceding, concurrent, and succeeding events. Events in this sense may be brief (e.g., the experience of hearing an unexpected click) or may unfold over any time period (e.g., the experience of viewing a movie, or of repeatedly performing a cognitive task during a neuroimaging experiment).

Moreover, experiences we may refer to as events may be nested in time. For example we may recall, as a meaningful event, our emotional response to viewing the surprising first clip of a particular scene in a movie presented to us during a neuroimaging recording session. However, we may equally well recall, and think of as an event, our experience of viewing that clip, or our experience of viewing the whole scene, or the whole movie – or, of participating in the entire recording session. In recounting another experienced event, we typically recall and describe its critical transition points (e.g., “game kickoff”, “the final movie credits beginning to scroll”, “my feeling the moment after the electrode cap came off”). These we might liken to moments of phase transition in a time-limited dynamic process.

Event markers.

In neuroimaging time-series recordings, experiment events are typically recorded using event markers marking that each mark the time of some phase transition or other point of interest in the unfolding event or event process (most often, time of onset). Unfortunately, in practice these event markers are typically themselves labelled and referred to as “events”, risking conceptual confusion.

Each event marker designates a single time point, typically expressed as a time offset from the start of the time series recording. To be useful, the event marker must be associated with metadata that includes information about the type of event phase transition it marks, a reference to the ongoing event process it marks, as well as a description of the nature of that event. The description of the event is most conveniently associated with the event marker marking its onset. Event markers of later phase transitions in the event (e.g., its offset) need not repeat this description if they include an unequivocal reference to the event. As well as marking event onsets and offsets, event markers may mark other meaningful event phase transitions – for example the moments at which the trajectories of balls thrown by a participant in a juggling experiment reach their apex or a presented sound reaches maximum amplitude. Analyses aimed to better understand how brain activity supports skilled juggling or speech comprehension may well strongly benefit from identifying and then marking these moments in the experiment data record.

Fig. 1 illustrates these concepts schematically. During a task condition in which spatial target ‘+’ images are briefly presented at different screen positions, the participant is instructed to reach to touch the center of the current or most recently displayed target. HED annotations associated with the event markers provide essential linkage between the event processes and the measured data. Below, we also show how HED annotation can also capture the relation of events to the experiment design.

Event context.

An event occurring within longer-duration events (e.g., the experience of a stimulus presentation within a supervening task block in a neuroimaging session), and/or during temporally overlapping events, may be said to occur within the context of those events. Since event marker latencies use a common timeline, software tools may automatically add context information about other ongoing events (wholly concurrent or temporally overlapping) to the event marker metadata at their time of use in data search and analysis. In the future, tools dealing with event context might be extended to facilitate desired analyses relating recorded brain dynamics to the experienced preceding and/or anticipated succeeding events.

Overview.

This paper introduces a practical event design strategy and illustrates a set of best practices for event reporting and annotation based on combining the new third-generation formulation of the Hierarchical Event Descriptor (HED) annotation framework (Robbins et al., 2021) with the MEEG data storage architecture of the Brain Imaging Data Structure (BIDS) group (Gorgolewski et al., 2016) (Niso et al., 2018) (Pernet et al., 2019) (Holdgraf et al., 2019). The paper is organized around a case study using MEEG data from a publicly-available multi-participant, EEG/MEG and fMRI experiment by Daniel Wakeman and Richard Henson (Wakeman and Henson, 2015; abbreviated below as W-H) saved in conformity with the BIDS guidelines. The HED/BIDS integration of event annotation demonstrated and recommended here not only facilitates automatic and informative summarization of data; it also establishes a standardized interface for automated pipelines to search for, collect, read, preprocess, and perform automated event-related analysis using study-independent tools and vocabulary. In particular, the strategy enables analyses to be performed across stored datasets, even when these datasets do not have the same experiment design.

W-H.

The W-H experiment was conducted to develop methods for integrating multiple imaging modalities into analysis to increase the accuracy of functional and structural connectivity analyses. Nineteen participants completed two recording sessions spaced three months apart – one session recorded fMRI data (W-H-fMRI) and the other simultaneously recorded MEG and EEG data (W-H-MEEG). During each session, participants performed the same perceptual task, evaluating the symmetry of presented photographs of famous, unfamiliar, and scrambled faces. The participants pressed one of two keyboard keys with left or right index fingers, respectively, to indicate a subjective yes or no decision as to the relative spatial symmetry of each viewed image. The original, unannotated W-H dataset was made available on OpenNeuro (openneuro.org, ds000117). Recently, we have shared a BIDS version of the W-H joint EEG/MEG data on OpenNeuro (openneuro.org, ds003645) with the more complete event organization and annotation discussed in this paper. Although we here focus on the MEEG portion of the W-H data set, the methods we demonstrate are equally applicable to annotation of fMRI or other neuroimaging time series data.

Unlike most MEEG experiments, the W-H overt face-symmetry judgment task was not itself of interest to the experimenters, who thus made no effort to judge whether participant responses had some objective basis in the face images themselves. Rather, the experiment was designed to covertly test recognition memory for the three types of face images. To this end, each individual face image was presented twice during the session. For half of the presented faces, the second presentation immediately followed the first. For the other half, the second presentation occurred after 5–15 intervening face image presentations. Famous faces were feature-matched to unfamiliar faces, and half the faces were female. Following the neuroimaging sessions, the authors also collected behavioral recognition memory performance measures from participants to allow testing for interactions between MEEG responses associated with individual image presentations and subsequent recognition memory for those images. These behavioral recognition memory data were also provided by the data authors for inclusion in our revised MEEG dataset.

Fig. 2 shows a schematic view of a typical event sequence in the W-H experiment. All of the session recordings were conducted using the same equipment, with the participant seated and facing a computer monitor throughout (top black timeline). The bottom two timelines show the introduced sensory events (visual screen image presentations, green timeline) and participant actions (left or right index finger key presses, purple timeline).

Some of the participants were instructed to follow each face image presentation onset with a left index finger key press to indicate above average facial symmetry and a right index finger press to indicate below average facial symmetry. The remaining participants used the opposite key assignment. The key assignment was in effect for all of the recordings associated with a particular participant (orange timeline). The participants were also instructed to fixate on the white cross and asked not to blink while the fixation cross and face images were presented (thick gray gaze task timelines).

The fundamental problem addressed here is how to effectively describe events in a standardized form that is human-readable, machine-actionable, and analysis-ready – without placing undue burden on the annotator. The W-H-MEEG experiment has five regularly repeating types of events. We demonstrate how to create locally defined names (show_cross, show_face, show_circle, left_press, and right_press) using a standardized vocabulary (HED) and to associate these names with event markers, resulting in an analysis-ready annotated event stream.

The following section begins with a brief introduction to the HED system and, using the W-H MEEG experiment as a concrete example, explains the event annotation process including annotations relating event types to the experiment design. Section 3 shows how these annotations can be organized within a BIDS dataset to achieve machine-actionable, analysis-ready annotation. Using the example developed in Sections 2 and 3, Section 4 examines the event design process and proposes a set of guidelines for effective design and annotation in neuroimaging research. We discuss what events should be reported, how the events should be encoded, and sketch planned further work to extend this encoding to include the relationship of the encoded events to participant task(s) and intent. Section 4 also summarizes the importance and potential impact of best-practice annotation strategies in making both stored and shared data more reproducible, interpretable and usable, first to the annotators themselves, then in any subsequent analysis enabled by effective data storage and sharing. We give a brief review and roadmap for future HED development in Section 5.

2. Machine-actionable event annotation using HED

The HED system is based on a collection of hierarchically organized terms (the base HED schema) that can be used to describe experiment events, condition variables, participant tasks, metadata, and the recording’s temporal structure. HED was specifically designed to encode information in both human- and machine-actionable format to enable validation, search, identification, and analysis of events in neuroimaging or other time series datasets that include events with known timing.

The original HED implementation (first-generation) focused mainly on a description of stimuli and responses (Bigdely-Shamlo et al., 2013). The second-generation HED framework (Bigdely-Shamlo et al., 2016) included many vocabulary improvements, plus tools for validation, data search, and analysis. HED was accepted in 2019 as an optional standard for event annotation in BIDS formated data.

HED has recently undergone an extensive third-generation redesign (HED-3G) to enable capture not only of basic event and event marker descriptions, but also of experimental conditions, temporal structure, and event context (Robbins et al., 2021). HED-3G provides a readily extensible basis for easily interpretable annotation of time series datasets for use in analysis, re-analysis, and shared data mega-analysis. HED-3G was officially released in August 2021 and is ready for widespread use in data archiving, sharing, analysis, and mega-analysis. In this paper, we use the term HED to refer exclusively to HED-3G.

The remainder of Section 2 works through the W-H-MEEG case study step-by-step to illustrate the HED annotation process and the major features of HED. The examples are organized so that the end result is a fully-annotated BIDS dataset.

2.1. A starting point for HED dataset event annotation

The HED base schema has seven top-level or root nodes as shown by the partially expanded schema tree in Fig. 3, left. The very basic HED event annotation shown in the table inset on the right is our starting point for development of comprehensive annotation.

To annotate events, users create comma-separated lists of terms selected from the HED base schema to describe the main events and concepts. This can be done as a table such as the one shown on the right in Fig. 3. Users first select an item from the Event top-level subtree to give a basic characterization of the event category (e.g., Sensory-event, Agent-action, Data-feature) for each of the main types of event markers. The top-level event categorization tag often serves as a primary search key for identifying events of interest. In addition to the event category, tags describing the sensory modality for sensory events or the type of action for agent actions are included next. In some sense, the annotation process can be thought of as using keywords from a structured vocabulary to tag events. The tag group (Press, Keyboard-key) in Fig. 3 then resembles a verb phrase, and the (Index-finger, (Press, Keyboard-key)) tag group a sentence with a subject and verb clause.

Additional tags should then be added to provide a more detailed description. For follow-on analyses, particularly comparisons of MEEG dynamics across experiments, having still more detailed annotation can add significant and enduring value to the data. In this example, adding annotations answering questions such as: “Which fingers pressed the keys?”, “How large were the cross, face image, and circle?”, “What colors were they?”, “Where were these images presented on the screen?”, and “For how long were they shown?”, can add details to the annotation that could well prove of interest in further analyses and mega-analyses involving the data, even when (as here) the specific hypothesis testing for which the experiment was designed did not vary nor evaluate effects associated with answers to these questions.

While classical statistical testing assumes rigidly controlled experiments that involve controlled variation of at most a few features of interest, new statistical methods including machine learning can exploit diversity in labelled data to learn deep structure in the data – here, links between MEEG dynamics and human experience and behavior. In the past, the value of neuroimaging data for the researchers who created it depended primarily on the quality of the scientific paper they published using it. Increasingly, the value of neuroimaging data accruing to the data authors will also include the number and quality of further analyses that exploit the rich information contained in the dataset to power cross-study analysis.

2.2. Short and long form annotation

A critical usability innovation in third-generation HED is the requirement that each term in the HED schema must have a unique name (i.e., must only appear in one place in the schema). As a result, an annotator can tag using just a single end-node term (e.g., Circle in an Item hierarchy), rather than spelling out its full hierarchical schema path string (e.g., Item/Object/Geometric-object/2D-shape/Ellipse/Circle). Automated HED tools can then map such short-form tags to their complete (long-form) paths whenever the data are to be validated or analyzed. See the Tools section of the HED specification for links to tools written in Matlab, Python, and JavaScript to perform this mapping (https://hed-specification.readthedocs.io/en/latest/).

The expanded long-form annotations allow tools collecting related events for analysis to find HED strings that belong to more general categories – for example, searching for event markers whose HED strings contain the more general term 2D-shape, not only the more specific Circle. This type of organization is particularly useful for gathering data epochs time locked to a variety of events across datasets that have some feature or features in common, and/or have been annotated with different levels of detail.

The HED tag examples in this paper are given in short form for readability, and HED tags are always italicized. Supplementary Table 2 has examples of short form to long form tag expansion. While HED tags are case insensitive, by convention HED tags start with a capital letter and individual words in a tag are hyphenated. This convention makes it easier to pick out individual tags in a lengthy string of comma-separated tags. Also, HED tags cannot themselves contain blanks. In this paper we display locally-defined terms in fixed point type. Terms used in BIDS event files (e.g., show_face or event_type) use underbars as word separators to allow tools to directly map identifiers into program variables or structure fields.

2.3. Identifying event concepts using HED definitions

Fig. 3 (above) gives minimal HED annotations for the five most regularly occurring event types in the W-H dataset as described schematically in Fig. 2. This level of annotation allows analysts to isolate events of different types (e.g., stimulus events vs. participant actions), but does not provide sufficient detail to support advanced analysis and cross-study comparisons. Further, the annotation treats each event as occurring instantaneously, but the image presentation events have distinct onsets, durations, and offsets, all of which are known to affect brain dynamics measured by MEEG or fMRI.

HED user event definitions allow annotators to document the structure of the experiment, as laid out in Fig. 2, by “defining” or “declaring” experiment event-related concepts using names of their choosing and associating them with tag groups. During the annotation process, users can then use the defined names in place of the longer tag strings. HED definitions allow data authors to use shorthand terms from the colloquial lab jargon that they use in everyday lab conversations, while allowing data search and analysis to make use of the full HED annotations in analyses. Definitions also make it easier to initially identify and then later refine (all within the single definitions) annotations by adding tags to give further details. HED definitions thus can improve annotation process organization similar to the way first planning and then programming sub-functions can simplify the coding process and improve the resulting computer code.

Importantly, HED user definitions also play an integral role in assisting data authors in documenting experiment architecture, event temporal extent, and other dataset aspects. Consider a simple user definition (Face-image) for the presentation of a face image on a black background with a white fixation cross.

Here we embolden defined terms for ease of reading. For simplicity the definition uses short-form encoding (e.g., Visual-presentation instead of the full path string Property/Sensory-property/Sensory-presentation/Visual-presentation). Of course, this definition can be made more detailed, at any point in the annotation process. Note, however, that to avoid circularities HED definitions cannot be nested.

Once defined, annotators can use Def/Face-image in building annotations in place of the more complete but much longer and harder to remember tag string, thus increasing the readability of the dataset annotation while allowing the annotator to use (and more easily recall) terms that seem most natural to them.

Next, we focus on the use of HED definitions to annotate more of the temporal fine structure of the participant experience. The green timeline of Fig. 2 (Section 1) shows the time courses of the sensory events in the W-H data. The bright green bar marks the “pre-stimulus period” during which a white cross is displayed, while the dark green bar marks the time during which the face image is displayed, and the light green bar marks the period during which a white circle is displayed.

The boundaries between these displays are marked by the show_cross, show_face, and show_circle event markers, respectively. In the W-H experiment, face display ends when the circle image is presented. In addition, performance periods for two additional instructed eye-control tasks (represented by the thick gray timelines in Fig. 2) coincide with these events: 1) participants were asked to maintain eye gaze fixation on the white cross while it was displayed, and 2) to inhibit eye blinks during face image presentations.

Table 1 shows an expanded version of the table inset of Fig. 3 using definitions grouped with Onset and Offset tags to document temporal relationships between events indicated schematically in Fig. 2. (See Supplementary Table 1 for the complete annotation.)

Table 1.

The HED event marker annotations that capture repeating details of the W-H timeline.

Level (Column value)	HED Annotation
Factor: face_type
famous_face	Description: A face that should be recognized by the participants.
	HED: (Definition/Famous-face-cond, (Condition-variable/Face-type, (Image, (Face, Famous))))
unfamiliar_face	Description: A face that should not be recognized by the participants.
	HED: (Definition/Unfamiliar-face-cond, (Condition-variable/Face-type, (Image, (Face, Unfamiliar))))
scrambled_face	Description: A scrambled face image generated by the face 2D FFT.
	HED: (Definition/Scrambled-face-cond, (Condition-variable/Face-type, (Image, (Face, Disordered))))
Factor: rep_status
first_show	Description: Factor level indicating the first display of this face.
	HED: (Definition/First-show-cond, (Condition-variable/Repetition-status, Item-count/1))
immediate_repeat	Description: Factor level indicating this face was the same as previous.
	HED: (Definition/Immediate-repeat-cond, (Condition-variable/Repetition-status, Item-count/2, Item-interval/1))
delayed_repeat	Description: Factor level indicating face was seen 5 to 15 trials ago.
	HED: (Definition/Delayed-repeat-cond, (Condition-variable/Repetition-status, Item-count/2))

BIDS folder level	Information file	Function
Dataset	…events.json	Provides descriptions of the columns that are applicable to all the …events.tsv files in the dataset. [The ‘HED’ keys in this JSON dictionary link HED annotations to values in the events files.]
	participants.tsv	Lists the participants. [A HED column may be used to add participant-specific information in HED annotation.]
Subject	…sessions.tsv	Lists the sessions per participant. [A HED column may be used to add session-specific information in HED annotation.]
Session	…scans.tsv	Lists the scans in the session (optional). [A HED column may be used to add scan-specific information in HED annotation.]
Modality(Scan)	…events.tsv	Lists the events in the scan (run). The column meanings and associated HED tags are given in the dataset-level …events.json file or other applicable …events.json files in the hierarchy. [A HED column in …events.tsv gives event-specific information in HED annotation.]

onset	duration	sample	event_type	face_type	rep_status	rep_lag	value	stim_file
0.400	n/a	1	setup_left_sym	n/a	n/a	n/a	2	n/a
23.870	n/a	26275	show_face_initial	famous_face	first_show	n/a	7	f032.bmp
24.081	n/a	26488	left_press	n/a	n/a	n/a	256	n/a
24.750	n/a	27225	show_circle	n/a	n/a	n/a	0	circle.bmp
26.457	n/a	29095	show_cross	n/a	n/a	n/a	1	cross.bmp
26.940	n/a	29634	show_face	famous_face	immediate_repeat	1	8	f032.bmp
27.913	n/a	30701	show_circle	n/a	n/a	n/a	0	circle.bmp
27.990	n/a	30789	right_press	n/a	n/a	n/a	4096	n/a

onset	onset_sample	stim_type	trigger	stim_file
24.2073	26,628	Unfamiliar	13	meg/u032.bmp
27.2473	29,972	Unfamiliar	14	meg/u032.bmp
30.3545	33,390	Unfamiliar	13	meg/u088.bmp
33.3618	36,698	Unfamiliar	13	meg/u084.bmp

onset	duration	cross_duration^*	stim_type	trigger	button_pushed	response_time	stim_file
0	0.908	0.534	FAMOUS	5	4	2.158	func/f013.bmp
3.273	0.962	0.586	FAMOUS	6	4	1.233	func/f013.bmp
6.647	0.825	0.546	UNFAMILIAR	13	4	1.183	func/u014.bmp

PERMALINK

Capturing the nature of events and event context using hierarchical event descriptors (HED)

Kay Robbins

Dung Truong

Stefan Appelhoff

Arnaud Delorme

Scott Makeig

Abstract

1. Introduction

Events.

Event markers.

Fig. 1.

Event context.

Overview.

W-H.

Fig. 2.

2. Machine-actionable event annotation using HED

2.1. A starting point for HED dataset event annotation

Fig. 3.

2.2. Short and long form annotation

2.3. Identifying event concepts using HED definitions

Table 1.

2.4. Event context and temporal events

2.5. Annotating experiment design and condition variables

The W-H experiment design.

Documenting experiment control events.

Table 2.

Table 3.

3. HED annotation of a BIDS-formated dataset

Table 4.

Table 5.

3.1. BIDS events.tsv files

Table 7.

3.2. BIDS events.json sidecar files

Table 6.

3.3. Assembling and using the complete event annotation

4. Best practices in event design and annotation

4.1. Event design for the W-H experiment

Table 8.

Table 9.

4.2. Pitfalls in reporting events by-trial rather than by-event

Table 10.

4.3. Documenting sensory presentations

4.4. Documenting participant responses

4.5. Documenting experimental conditions, controls, and designs

4.6. Task specification

Explicit tasks.

Implicit tasks.

4.7. Documenting temporal organization and architecture

4.8. The event design process

5. Discussion and roadmap

HED Library Schema.

Supplementary Material

Acknowledgments

Footnotes

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases