Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Oct 2.
Published in final edited form as: Neuroimage. 2012 Feb 14;62(2):1017–1023. doi: 10.1016/j.neuroimage.2012.02.015

The continuing challenge of understanding and modeling hemodynamic variation in fMRI

Daniel A Handwerker 1, Javier Gonzalez-Castillo 1, Mark D’Esposito 3,4, Peter A Bandettini 1,2
PMCID: PMC4180210  NIHMSID: NIHMS357101  PMID: 22366081

Abstract

Interpretation of fMRI data depends on our ability to understand or model the shape of the hemodynamic response (HR) to a neural event. Although the HR has been studied almost since the beginning of fMRI, we are still far from having robust methods to account for the full range of known HR variation in typical fMRI analyses. This paper reviews how the authors and others contributed to our understanding of HR variation. We present an overview of studies that describe HR variation across voxels, healthy volunteers, populations, and dietary or pharmaceutical modulations. We also describe efforts to minimize the effects of HR variation in intrasubject, group, population, and connectivity analyses and the limits of these methods.

Keywords: BOLD, fMRI, hemodynamic response, population studies, regional variation, blood vessels

INTRODUCTION

The hemodynamic response (HR) to neural activity is the basis of functional magnetic resonance imaging (fMRI). Neural activity changes can occur on the order of milliseconds. When activity across a population of neurons in a voxel changes, a hemodynamic response can be recorded using T2* weighted acquisitions in an MRI scanner. In contrast to most direct measures of neural activity, HRs are slow (on the order of seconds) and vary in shape and timing between voxels, cortical regions, subjects, subject populations, and experimental tasks. Because hemodynamics depend on the blood vessel structure, changes in this underlying structure will alter the HR even with consistent changes in neural activity.

Figure 1 is reprinted, with permission, from (Handwerker, Ollinger et al. 2004). It shows hemodynamic responses from 20 subjects in primary sensorimotor cortex in response to a single button press. The signal increases over several seconds and then drops below and then returns to baseline. Although we do not fully understand in detail all the nuances of how blood flow, volume, and oxygenation change in response to neural activity, the hemodynamic response is highly consistent. Yet, if one looks closer, it has significant variability. The 20 subjects in Figure 1 show a wide range of differences in response latency, width, shape, and magnitude.

Figure 1.

Figure 1

Hemodynamic responses from 20 subjects averaged across a region of interest in primary sensorimotor cortex in response to a single button press. Reprinted, with permission, from (Handwerker, Ollinger et al. 2004).

While we understand many of the hemodynamic mechanisms behind these differences in HR, predicting the precise response shape for a given stimulus and voxel region is still impossible. HR shape predictions are required for many fMRI statistical analyses. In addition, it is difficult to determine if HR shape variability is neural rather or vascular in origin. This article covers the history of our understanding of HR variation and the development of methods to try to account for hemodynamic variation in fMRI research.

UNDERSTANDING HR VARIATION

The HR becomes central to fMRI analysis

The earliest fMRI studies used block designs to maximize signal-to-noise ratio and increase the likelihood of response detection. The first few papers, for example, (Kwong, Belliveau et al. 1992), didn’t model the HR shape. Similar to Positron Emission Tomography analyses of the time, they created statistical measures by subtracting, magnitudes from an active condition from magnitudes during a rest condition. Direct observation of raw time series revealed features in the hemodynamic response that required further modeling to more effectively extract task-related signal fluctuations. Soon after the work by Kwong et al., Bandettini and colleagues modeled the full time series response as a box-car function — i.e., zeros for the rest periods, and ones for the active periods — and convolved it with a smoothing function. By calculating the correlation of this time series to the data, they were able to generate more precise statistical maps (Bandettini, Jesmanowicz et al. 1993). This approach of comparing an idealized HR to BOLD time series soon became the standard way of detecting sites of neuronal activity in fMRI experiments. Simple correlation analysis was soon substituted by general linear model (GLM) analyses (Friston 1994), which allow greater flexibility for the experimental design and response modeling. The GLM remains the primary analysis technique for most task-based fMRI studies.

While some early fMRI papers tested different HR shapes or models, the robust response obtained with block design tasks made HR variability a less urgent concern. This started to change with the use of event-related task designs. Contrasting hemodynamic responses to brief stimuli let researchers answer more complex neuroscience questions. The inherently lower signal to noise ratio of event related experiments combined with the increased importance of subtle BOLD signal changes in response of brief events, meant that HR variability had the potential to confound results. Although the risk that HR variation could affect results was acknowledged, it was assumed to be a relatively minor concern (Buckner, Koutstaal et al. 1998). As described below, later studies have shown that the problem can be serious in some contexts.

HR variation in healthy volunteers

Even before event-related fMRI started to push the temporal limits of the hardware, early fMRI studies were determining the relative T2* vs. T2 vs. inflow contribution in activated tissue. In addition to showing that fMRI signal changes result from a spatially variable contribution of many factors, these early fMRI studies showed that the HR shape – particularly the response magnitude – varied greatly depending on the vessel diameters and density within each voxel. Another review in this issue covers this topic in more depth (Menon 2011). Vessel structure differences across voxels can also cause delay differences, as shown by the early HR delay maps produced by Lee, Glover, et. al. 1995. To produce these delay maps, the authors collected data from a periodic flashing checkerboard task and spatially mapped the phase differences of the responses. They saw latency delays of 4–8s in gray matter and 8–14s in visible vessels & sulci (Lee, Glover et al. 1995). Later work, at higher temporal and spatial resolutions, showed evidence of hemodynamic magnitude and delay differences across cortical layers (Silva and Koretsky 2002; Yu, Glen et al. 2011). While the goal of much of this work was to identify MRI pulse sequences and field strengths that could best localize neural activity, it also clearly showed that one of the biggest sources of non-neural HR magnitude and latency variation in healthy volunteers was within voxel blood vessel structure.

While the 14s phase lags from Lee, Glover et. al. 1995 are more than is typically observed in the literature, other studies have shown HR lag variation in the order of several seconds. For example, Buckner et al. found that when signals were averaged across regions of interest, task responses across healthy subjects often showed a 4–6s range of time-to-peaks (with 13 subjects and a 2s TR) (Buckner, Koutstaal et al. 1998). In another study, Birn et al. showed 6–11s visual cortex response latency in response to a brief stimulus (Birn, Saad et al. 2001). Interestingly, in this last study, these latencies were not correlated with response magnitude. This is potentially explained by the understanding that while vessels are mostly “downstream” they can also exist more upstream as well – having shorter latencies.

Buckner et al. not only looked at delay differences, but also evaluated HR differences across subjects in terms of how much the HR from one subject could predict the HR on another subject for the same region. They found that HR from one subject was able, on average, to explain 72% of the variance in other subjects. Around the same time, Aguirre was asking how much the hemodynamic response to a simple visuo-motor task varies within the same subject across hours or days and across subjects. While they didn’t perform the same variance analysis as Buckner, they did see a time-to-peak range of 2.7–6.2s across 32 volunteers. This variation didn’t correlate with reaction time. The HR shapes across subjects in that paper have very different peak-times, widths, and post-undershoot shapes. This variability is visibly greater than the HR shapes Buckner et. al. used to estimate HR similarity. They also showed that the HRs estimated from a region of interest in the same subject in multiple scans varied much less than HRs estimated from the same region of interest across subjects – particularly for multiple scans in a single day (Aguirre, Zarahn et al. 1998).

Known causes of HR variation

As discussed above, one of the biggest causes of non-neural HR variation is vessel size and density within a voxel. Large vessels are typically “downstream” and therefore experience an activation-induced oxygenation change up to 3 seconds later than the capillaries within parenchyma. Additional factors that can contribute to differences in HR shape between two subsequent measurements of the response to the same task, in the same region of the same subject include: thermal noise, breathing and heart rate changes, hardware instabilities, changes in cognitive state and strategy used to approach the task. If the measurements are performed on different days or we attempt to compare across different subjects or regions then vascular state, region of interest selection, and tissue differences also contribute.

Past research has helped us start to understand the relative contributions of each of these factors to HR variation so that variation of neuronal origin can be segregated from variation due to the other non-neuronal sources. Much of the research is based on dynamic models of BOLD contrast, which try predict how variation in regional blood flow, volume, and vessel reactivity combine to cause observed BOLD signal changes. This special issue has another article that covers this topic in more depth (Buxton 2012).

In the particular case of studies across populations, the major concern should be that an HR difference is mistakenly considered a neural difference or that a true neuronal difference is obliterated by the HR variability. Virtually every examined disease state, age difference, or ingested substance causes changes in HR shape. Alcohol (Levin, Ross et al. 1998), caffeine (Mulderink, Gitelman et al. 2002), fat (oral canola oil) (Noseworthy, Alfonsi et al. 2003), inhaled CO2 and O2 concentration changes (Sicard and Duong 2005), and intravenous saline (Levin, Frederick et al. 2001) all alter HR response magnitude. Nicotine is one notable exception with one study showing that HR magnitude remains constant across dose even with a heart rate change (Jacobsen, Gore et al. 2002). Population and case studies have found HR differences that were likely vascular in origin in aging (D'Esposito, Zarahn et al. 1999), dementia (Buckner, Snyder et al. 2000), and stroke (Pineiro, Pendlebury et al. 2002; Hamzei, Knab et al. 2003). Even if variation across voxels due to vessel effects is much larger than these group differences, a systematic change under large amounts of within-group variance can still produce a population effect. The larger the population size, the more likely a population effect due to HR differences will be statistically significant.

In many of these examples, the underlying assumption is that anything that can systematically alter baseline cerebral blood flow (CBF) or hematocrit levels will alter the HR response. While others have systematically examined the effects of baseline CBF on the HR (Bandettini and Wong 1997), an elegantly designed study by Cohen et. al. provides a good illustration of the the effects of baseline CBF on HR responses. The authors had volunteers breath normocapnic, hypercapnic (raises baseline CBF), and hypocapnic (lowers baseline CBF) air. When the baseline CBF stabilized in each condition, they ran a visual stimulation task. Thus, in the same volunteers, they had HRs at three different baseline CBF levels. As CBF rose, the magnitude of the response to visual stimulation decreased. Interestingly, the onset time and time-to-peak also increased with baseline CBF. When there was a physiologic modulation in BOLD magnitude in an individual, it appeared that time-to-peak increased by 0.28s for every 1% change in the baseline BOLD signal (Cohen, Ugurbil et al. 2002). This lag difference could appear in any population with a baseline CBF change.

Hemodynamic response changes can be due to more than altered CBF. Arterial spin labeling measures depend primarily on blood flow. Multiple arterial spin labeling studies have shown lower HR response magnitude variability across scans within subjects and across subjects than BOLD (Aguirre, Detre et al. 2002; Tjandra, Brooks et al. 2005; Raoult, Petr et al. 2011). This means that variation in volume and vessel elasticity can increase variability of the BOLD hemodynamic response in ways that are distinct from blood flow variation.

Drug studies also show how HR variability across scans is due to more than just blood flow variation. Caffeine, a known vasoconstrictor, is one of very few drugs whose effects on the HR have been examined in multiple experiments. While an early study noted the CBF baseline shift and response magnitude increase and proposed intentionally giving caffeine as a BOLD contrast enhancer (Mulderink, Gitelman et al. 2002), later work creates a more complex story (Liu, Behzadi et al. 2004; Behzadi and Liu 2006). If all caffeine did was shift the baseline CBF then the BOLD response would shift to result in a similar estimate of oxygen metabolism in response to a task. Two groups have shown that caffeine alters the relationship between CBF and oxygen metabolism (Perthen, Lansing et al. 2008; Chen and Parrish 2009). In addition, BOLD response is measurably better modeled as a linear system after caffeine ingestion (Liu and Liau 2010) and caffeine alters poststimulus undershoot dynamics (Liu, Behzadi et al. 2004). Perhaps some of these effects are neural, but it appears that the effects of caffeine on the HR shape include more than a CBF baseline shift. Even with this information, there is no field-wide consensus or even significant public discussions regarding how we record or control for caffeine intake (or drug intake in general) in fMRI studies.

Another mechanism to alter neurovascular coupling involves cyclooxygenase (COX). COX has several key roles in how astrocytes and neurons interact to alter vessel dilation (Koehler, Gebremedhin et al. 2006). Indomethacin, a COX inhibitor, has been shown to decrease the BOLD response to a task despite being a vasoconstrictive drug that should lower baseline CBF (Bruhn, Fransson et al. 2001). HR magnitude also changes in humans depending on volunteer’s specific COX-1 genotype (Hahn, Heinzel et al. 2011). Given that Ibuprofen, an over-the-counter pain reliever, is a COX inhibitor, population differences in the intake of a drug of such common use as this have the potential to become a confound in fMRI studies. Caffeine and COX inhibitors are just two examples, but the broader question of what other common substances can cause similarly complex changes in the HR remains unanswered.

In population studies, there also seem to be changes that can’t simply be attributed to baseline CBF changes. In one case, disturbed cerebrovascular reserve capacity from an intracranial arterial occlusion, resulted in a negative HR (Hamzei, Knab et al. 2003). Aging studies have shown conflicting reports of differences in HR time-from-peak, time-to-baseline, and fit residuals (D'Esposito, Zarahn et al. 1999; Huettel, Singerman et al. 2001; Richter and Richter 2003). Other potential aging related variations in BOLD signal that might be hemodynamic in origin were summarized in a review article (D'Esposito, Deouell et al. 2003). For example, changes in vessel elasticity was implicated as causing a difference in the time course of cerebrovascular reactivity (Hajdu, Heistad et al. 1990). Some of the challenges mentioned in that review can be generalized to other populations.

THE EFFECT OF HR VARIATION IN DATA ANALYSIS

Significance testing across varying HR shapes

Any statistical method that includes a model of the HR risks missing the aspects of the response that are not included in the model. Some of the earliest analysis methods examined how significance varied depending on the modeled HR shape (Bandettini, Jesmanowicz et al. 1993; Friston 1994). In most of these cases, they were trying to identify a single HR shape that was best for most studies. Aguirre et al. suggested including a separate scan to estimate a separate HR for each subject (Aguirre, Zarahn et al. 1998). They made it easy to include these individualized estimates in GLM analyses using the VoxBo software. This proposal was not widely adapted. Hesitance to use 6 additional minutes of scan time to independently estimate a subject-specific HR played a role, but more prosaic issues also mattered. There was no “push-button way” to insert a custom HR into SPM, the very widely used fMRI data processing software. There was also skepticism that a motor HR from one region could better estimate an HR in other brain regions of the same person despite solid research showing this was true. Perhaps the primary reason that it was not adapted was, that no one showed precisely, at the time, how much an HR misestimate could alter results.

When Dan Handwerker joined Mark D’Eposito’s group as a graduate student, they decided to examine this issue. They repeated the basic experimental design of Aguirre, but designed a task to generate robust HR’s in 4 brain regions. These responses were used to set a range of HR shapes to simulate the effect of HR variation on GLM statistics. They discovered that a 1 s misestimate of the HR time-to-peak could decrease the fit magnitude by 10% and a 2 s misestimate decreases the magnitude by 38%. An accurate subject-specific HR estimate would clearly improve statistical results compared to using the same shape in all subjects. Still, variation across voxels within subjects were still large enough to cause non-trivial shifts in statistics to limit the benefits of using one HR shape per subject (Handwerker, Ollinger et al. 2004).

While a robust enough task would still show significance, the drop is large enough to create false negative results. Including a wider range of response HR shapes in an analysis, such as modeled transients, increases the extent of significant activation (Harms and Melcher 2003; Uludag 2008). While these studies hypothesized neural origins of HR shape differences, they clearly show that unmodeled variation affects significance. In addition, if there were a systematic HR shape difference between populations it would incorrectly appear as a response magnitude difference in many group analyses.

The basic approach to account for HR variability in GLM-based analyses is to include multiple basis functions for each condition. For example, a Taylor expansion using an HR and its derivatives allows for some lag and width variation around the initial HR shape (Henson, Price et al. 2002). While beneficial, it rapidly loses effectiveness if the true HR in a voxel has more than a 1s lag from the modeled HR. In addition, if all the basis functions aren’t included in the magnitude estimates in a group analysis (Calhoun, Stevens et al. 2004), the benefits of flexible HR modeling are lost (Handwerker, Ollinger et al. 2004). The opposite extreme would be to deconvolve any task-locked HR with a series of sinusoids in a GLM (Josephs, Turner et al. 1997), without a GLM (Glover 1999), or using a series of lagged impulse responses in a GLM (Ollinger, Corbetta et al. 2001). These are powerful methods for significance testing within subject, but they use many more degrees of freedom and it is more challenging to design and interpret contrasts between task conditions group statistics using these methods. Intermediate methods can model an unlimited range of lags for a specific HR shape (Saad, Ropella et al. 2001) or use a specially designed set of basis functions in a GLM and prior information to estimate the HR (Woolrich, Behrens et al. 2004). While it is possible to compare results across several methods, no one method is designed to handle the full range of known HR lag and shape variation observed across voxels in individual subjects.

Correcting for non-neural magnitude differences

HR magnitude variation based on the underlying vessel structure, adds noise to group results & can bias the foci of activation towards larger vessels. At best, this is just another source of noise in fMRI data. For studies across populations, a vascular difference could be interpreted as a neural difference. One robust way to address this issue is to make sure one condition of a study shows no population difference in the same voxels that the condition of interest does show a population difference. This is not always feasible.

A series of methods have sought to calibrate the fMRI signal to remove large vessel effects (or more precisely, to remove blood volume variation across voxels – the primary source of magnitude differences). Peter Bandettini and Eric Wong first proposed collecting a run with volunteers breathing CO2 enriched air. This would cause a global BOLD signal change what would scale with venous blood volume in each voxel. Simply dividing by vessel magnitudes would remove spatial vascular differences (Bandettini and Wong 1997). This concept was extended to collecting BOLD/CBF data at a single (Davis, Kwong et al. 1998) or multiple (Hoge, Atkinson et al. 1999) air CO2 concentrations to calibrate responses to a more direct measure of cerebral metabolism. A large methodological literature is being built around this concept, but these types of methods are used in only a small fraction of population studies. One reason is that CO2 inhalation systems might be unpleasant for volunteers, and the best methods require a lot of scan time. Also, these calibrations involve subtraction or division of data, decreasing the signal to noise ratio. These inconvenience factors tend to outweigh the benefit of this type of calibration for most studies. Handwerker and D’Esposito tested whether a more convenient, but slightly less controllable method, global signal changes from breath hold, could be sufficient to calibrate some signal changes with aging (Handwerker, Gazzaley et al. 2007). That manuscript includes a more thorough review of earlier work in this area. We showed that breath holding revealed age-related BOLD differences. Accounting for those global signal changes altered the HR magnitude in regions that showed significant task-related changes with aging. While simpler to implement than CO2 breathing, breath holding tasks still require people who are able to hold their breath for around 20s and follow instructions. This challenge, in addition to the requirements of extra scan time and the decreased SNR from dividing data for calibration, explain why this method also hasn’t been widely adapted.

All these approaches are trying to strike a balance between an accurate and precise calibration method and a practical protocol to add to a large population study. Until that happens, best research practice will continue to require study designs with a dissociation between population responses between conditions. Studies that only show a population difference to a single condition should continue to be treated with healthy skepticism.

HR variation and brain networks

With the rapid growth of network modeling of brain connections – particularly using fMRI data with spontaneous fluctuations – there is increased interest and attempts to derive functional brain networks; and to assign a temporal direction to information flow between brain regions. A recent paper, Smith, Miller et. al. 2011, provides a good summary of existing network modeling methods and uses simulations to compare method quality. The simulations included HR delay variation with a standard deviation of 0.5s. Since this was well below the observed variation across voxels, they considered this a “best-case scenario while remaining reasonably realistic.” They show that some methods are robust to this level of HR lag variation, but lag-based methods, like Granger causality perform particularly poorly. Even on stimulated data with best-case HR variation, none of the existing methods perform exceptionally at assigning directionality to the connections between regions. The best method tested, Patel's τ, showed a 65% accuracy with 50% being chance (Smith, Miller et al. 2011). While these results show some existing methods are robust to a best-case scenario of random HR noise, none were tested on the full range of known HR lag differences, other HR shape differences, or systematic HR differences between network nodes.

One well-known network directionality method that was not tested by Smith et. al. is Dynamic Causal Modeling (DCM). An appealing aspect of this method is that it attempts to use Baysean methods to fit an optimal HR for each node and then use that model to estimate the temporal patterns of the underlying neural activity (Friston, Harrison et al. 2003). Although the creators of this method have tested it using some HR variation (Stephan, Weiskopf et al. 2007), we decided to test the accuracy of DCM at making the simplest possible prediction. In a two node network, do the estimated neural events of node 1 predict the estimated neural events in node 2 or vice versa? These two models are depicted in Figure 2E. For each of the two nodes, we took a hemodynamic response function shape (Fig. 2A), convolved it with an event-related time series of neural events (Fig. 2B), and scaled and added it to noise (Fig. 2C,D). The HR was constant in node 1. The HR in node 2 was identical to node 1, had a 1 sec delayed peak or onset, or a larger relative post-stimulus undershoot. For the event-related design, there was a 50% chance of an event appearing at each time point (sampling rate = 2 sec). The identical event pattern with no time lag was used in both nodes. While a 1 sec HR onset delay is equivalent to a 1 sec shift in event timing, all other cases should be null results (i.e. neither node significantly predicts the other). To get realistic fMRI noise with minimal causality and correlation, each of the two nodes contained spontaneous fluctuations from a single voxel taken from a different subject and brain region (Fig. 2C). These data had 300 time points and were collected as part of (Murphy, Birn et al. 2009). The noise time series had a correlation magnitude of r=−0.08. Simulated data were created by adding these noise time series to the scaled synthetic responses (Fig. 2B). The data were scaled so that the correlations between the nodes when the HRs were identical would represent a higher (r=0.78) and lower (r=0.46) temporal signal-to-noise ratio (TSNR). A DCM analysis in the SPM8 software package was run on both of the networks in Figure 2E at both TNSR levels and all 4 pairings of HR shapes (Fig. 2D). The DCM analysis was also run on the noise time series (Fig. 2C). The underlying noise time series in nodes 1 and 2 were also switched to confirm that the presented results were a function of HR variation and not noise characteristics.

Figure 2.

Figure 2

The left column shows the inputs to node 1 of the dynamic casual modeling (DCM) simulation and the right column shows the inputs to node 2. (A) Node 1 always includes a single HR shape. Node 2 includes the same HR shape or HR shapes that have delayed onsets, peak times, or undershoot magnitudes. (B) The HRs are convolved with a event-relate design of neural event times (black dots). This shows a 150 sec window of the 300 sec time series (C) Each node has a noise time series from different subjects’ scans of spontaneous fluctuations. (D). The HR time series in B are scaled and added to the noise in C. This figure shows how the time series look for the lower TSNR condition. (E) Schematics of the two models that were compared using DCM. (F) Comparisons of the two models in E. For each HR shape tested, if the blue or green lines are higher, that means node 1 is more likely to predict node 2. If the red or yellow bars are higher, node 2 is more likely to predict node 1. The dashed line at 0.9 is a typical significance threshold.

Figure 2F shows the results comparing posterior probabilities from each model. If one model is higher than the other, it is more likely to represent the data. A typical significance cutoff is a posterior probability greater than 0.9. As expected, neither model is significantly better when they contain identical HR shapes or with the noise time series. The identical HR shapes show a causality bias towards node 1 predicting node 2, but this doesn’t cross significance. This non-significant bias might exist because there is very small bias towards the same causal model in the noise time series (“no HRs” in Fig. 2F). With just the noise time series, neither model fits the data well so there’s little evidence either is correct. With the identical stimuli included, there is the same bias for the node 1 predicts node 2 causal model as in the noise time series, but each node’s time series fits the stimulus timing better and shows more evidence for each model. Thus, even with identical HRs, one model seems more accurate.

When the HR in node 2 has a 1 sec peak or onset delay compared to node 1, the DCM analysis shows node 1 probably predicts node 2. Since an HR onset delay would look identical to a neural event delay, one would expect DCM to show this model difference. A 1 sec slower rise-time for an HR is well within the range of normal HR variation and shouldn’t be considered to represent a delay in neural timing. When the HR in node 2 had a larger post-stimulus undershoot, the DCM analysis shows that node 2 probably predicts node 1. While the tested undershoot is very similar to HRs observed in real data (Figure 1), it doesn’t perfectly fit into the standard HR estimation method used in DCM. We suspect that, if the HR is misestimated, any systematic variation in responses could alter the predicted timing for neural events. For all models with different HR shapes in each node, the higher the TSNR of the data, the more likely a significant model difference will be found – since the variability of the HR’s was more fully expressed.

While our test is on an overly simplistic model it does highlight that the hemodynamics estimation portion of the DCM analysis shouldn’t be assumed to be accurate and cannot be finessed away no matter how sophisticated the models. Even though DCM may model some HR variation it is important to both examine the Bayesian priors for the HR estimation and compare them to the actual HR shape in each voxel or region of interest.

THE CURRENT STATUS OF HR VARIATION

The challenges and solutions regarding HR variation seem to have changed little over the history of fMRI. We now have a better understanding of the range of HR variation and some factors that can systematically alter HR shape. We know observed HR variation is sufficient, at best, to increase noise in most fMRI analyses and, at worst, to systematically bias results. An array of methods have been developed to deal with some of this variation, but they are all far from perfect and quite a few studies don’t consider HR variation at all.

Since neurovascular coupling can vary based on how the vascular and tissue composition of voxels interact and can be altered in complex ways by disease or medication, we are not capable of robustly separating the voxel-wise, region-wise, subject-wise, and population-wise variability into variability of hemodynamic origin and variability of neuronal origin. While better scanners and pulse sequences will improve our temporal signal to noise ratio, the issues surrounding HR variation will continue to fundamentally limit the conclusions we can reach using fMRI data.

The best research uses known sources of HR variation and consistency and makes careful assumptions where possible and necessary. Even after 20 years, we are still developing a deeper understanding of the hemodynamic response and better methods to robustly characterize it, work with it, calibrate it, or work around it. In addition, we are finding that even with the dominance of the hemodynamic variability, still more subtle neuronal information can and will be extracted in the future.

Acknowledgments

Supported by the NIMH Intramural research program. Carlton Chu helped make sure the DCM analyses were properly run.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Aguirre GK, Detre JA, et al. Experimental design and the relative sensitivity of BOLD and perfusion fMRI. Neuroimage. 2002;15(3):488–500. doi: 10.1006/nimg.2001.0990. [DOI] [PubMed] [Google Scholar]
  2. Aguirre GK, Zarahn E, et al. The variability of human, BOLD hemodynamic responses. Neuroimage. 1998;8(4):360–369. doi: 10.1006/nimg.1998.0369. [DOI] [PubMed] [Google Scholar]
  3. Bandettini PA, Jesmanowicz A, et al. Processing strategies for time-course data sets in functional MRI of the human brain. Magnetic resonance in medicine : official journal of the Society of Magnetic Resonance in Medicine / Society of Magnetic Resonance in Medicine. 1993;30(2):161–173. doi: 10.1002/mrm.1910300204. [DOI] [PubMed] [Google Scholar]
  4. Bandettini PA, Wong EC. A hypercapnia-based normalization method for improved spatial localization of human brain activation with fMRI. NMR Biomed. 1997;10(4–5):197–203. doi: 10.1002/(sici)1099-1492(199706/08)10:4/5<197::aid-nbm466>3.0.co;2-s. [DOI] [PubMed] [Google Scholar]
  5. Behzadi Y, Liu TT. Caffeine reduces the initial dip in the visual BOLD response at 3 T. Neuroimage. 2006;32(1):9–15. doi: 10.1016/j.neuroimage.2006.03.005. [DOI] [PubMed] [Google Scholar]
  6. Birn RM, Saad ZS, et al. Spatial heterogeneity of the nonlinear dynamics in the FMRI BOLD response. Neuroimage. 2001;14(4):817–826. doi: 10.1006/nimg.2001.0873. [DOI] [PubMed] [Google Scholar]
  7. Bruhn H, Fransson P, et al. Modulation of cerebral blood oxygenation by indomethacin: MRI at rest and functional brain activation. Journal of magnetic resonance imaging : JMRI. 2001;13(3):325–334. doi: 10.1002/jmri.1047. [DOI] [PubMed] [Google Scholar]
  8. Buckner RL, Koutstaal W, et al. Functional-anatomic study of episodic retrieval. II. Selective averaging of event-related fMRI trials to test the retrieval success hypothesis. Neuroimage. 1998;7(3):163–175. doi: 10.1006/nimg.1998.0328. [DOI] [PubMed] [Google Scholar]
  9. Buckner RL, Snyder AZ, et al. Functional brain imaging of young, nondemented, and demented older adults. Journal of cognitive neuroscience. 2000;12(Suppl 2):24–34. doi: 10.1162/089892900564046. [DOI] [PubMed] [Google Scholar]
  10. Calhoun VD, Stevens MC, et al. fMRI analysis with the general linear model: removal of latency-induced amplitude bias by incorporation of hemodynamic derivative terms. Neuroimage. 2004;22(1):252–257. doi: 10.1016/j.neuroimage.2003.12.029. [DOI] [PubMed] [Google Scholar]
  11. Chen Y, Parrish TB. Caffeine's effects on cerebrovascular reactivity and coupling between cerebral blood flow and oxygen metabolism. Neuroimage. 2009;44(3):647–652. doi: 10.1016/j.neuroimage.2008.09.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Cohen ER, Ugurbil K, et al. Effect of basal conditions on the magnitude and dynamics of the blood oxygenation level-dependent fMRI response. J Cereb Blood Flow Metab. 2002;22(9):1042–1053. doi: 10.1097/00004647-200209000-00002. [DOI] [PubMed] [Google Scholar]
  13. D'Esposito M, Deouell LY, et al. Alterations in the BOLD fMRI signal with ageing and disease: a challenge for neuroimaging. Nat Rev Neurosci. 2003;4(11):863–872. doi: 10.1038/nrn1246. [DOI] [PubMed] [Google Scholar]
  14. D'Esposito M, Zarahn E, et al. The effect of normal aging on the coupling of neural activity to the bold hemodynamic response. Neuroimage. 1999;10(1):6–14. doi: 10.1006/nimg.1999.0444. [DOI] [PubMed] [Google Scholar]
  15. Davis TL, Kwong KK, et al. Calibrated functional MRI: mapping the dynamics of oxidative metabolism. Proc Natl Acad Sci U S A. 1998;95(4):1834–1839. doi: 10.1073/pnas.95.4.1834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Friston KJ, Harrison L, et al. Dynamic causal modelling. Neuroimage. 2003;19(4):1273–1302. doi: 10.1016/s1053-8119(03)00202-7. [DOI] [PubMed] [Google Scholar]
  17. Friston KJ, Jezzard P, Turner R. Analysis of functional MRI time-series. Human Brain Mapping. 1994;1:153–171. [Google Scholar]
  18. Glover GH. Deconvolution of impulse response in event-related BOLD fMRI. Neuroimage. 1999;9(4):416–429. doi: 10.1006/nimg.1998.0419. [DOI] [PubMed] [Google Scholar]
  19. Hahn T, Heinzel S, et al. Neurovascular coupling in the human visual cortex is modulated by cyclooxygenase-1 (COX-1) gene variant. Cerebral cortex. 2011;21(7):1659–1666. doi: 10.1093/cercor/bhq236. [DOI] [PubMed] [Google Scholar]
  20. Hajdu MA, Heistad DD, et al. Effects of aging on mechanics and composition of cerebral arterioles in rats. Circ Res. 1990;66(6):1747–1754. doi: 10.1161/01.res.66.6.1747. [DOI] [PubMed] [Google Scholar]
  21. Hamzei F, Knab R, et al. The influence of extra- and intracranial artery disease on the BOLD signal in FMRI. Neuroimage. 2003;20(2):1393–1399. doi: 10.1016/S1053-8119(03)00384-7. [DOI] [PubMed] [Google Scholar]
  22. Handwerker DA, Gazzaley A, et al. Reducing vascular variability of fMRI data across aging populations using a breathholding task. Hum Brain Mapp. 2007;28(9):846–859. doi: 10.1002/hbm.20307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Handwerker DA, Ollinger JM, et al. Variation of BOLD hemodynamic responses across subjects and brain regions and their effects on statistical analyses. Neuroimage. 2004;21(4):1639–1651. doi: 10.1016/j.neuroimage.2003.11.029. [DOI] [PubMed] [Google Scholar]
  24. Harms MP, Melcher JR. Detection and quantification of a wide range of fMRI temporal responses using a physiologically-motivated basis set. Hum Brain Mapp. 2003;20(3):168–183. doi: 10.1002/hbm.10136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Henson RN, Price CJ, et al. Detecting latency differences in event-related BOLD responses: application to words versus nonwords and initial versus repeated face presentations. Neuroimage. 2002;15(1):83–97. doi: 10.1006/nimg.2001.0940. [DOI] [PubMed] [Google Scholar]
  26. Hoge RD, Atkinson J, et al. Linear coupling between cerebral blood flow and oxygen consumption in activated human cortex. Proc Natl Acad Sci U S A. 1999;96(16):9403–9408. doi: 10.1073/pnas.96.16.9403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Huettel SA, Singerman JD, et al. The effects of aging upon the hemodynamic response measured by functional MRI. Neuroimage. 2001;13(1):161–175. doi: 10.1006/nimg.2000.0675. [DOI] [PubMed] [Google Scholar]
  28. Jacobsen LK, Gore JC, et al. Impact of intravenous nicotine on BOLD signal response to photic stimulation. Magn Reson Imaging. 2002;20(2):141–145. doi: 10.1016/s0730-725x(02)00494-0. [DOI] [PubMed] [Google Scholar]
  29. Josephs O, Turner R, et al. Event-related fMRI. Human Brain Mapping. 1997;5(4):243–248. doi: 10.1002/(SICI)1097-0193(1997)5:4<243::AID-HBM7>3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
  30. Koehler RC, Gebremedhin D, et al. Role of astrocytes in cerebrovascular regulation. Journal of applied physiology. 2006;100(1):307–317. doi: 10.1152/japplphysiol.00938.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kwong KK, Belliveau JW, et al. Dynamic magnetic resonance imaging of human brain activity during primary sensory stimulation. Proc Natl Acad Sci U S A. 1992;89(12):5675–5679. doi: 10.1073/pnas.89.12.5675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lee AT, Glover GH, et al. Discrimination of large venous vessels in time-course spiral blood-oxygen-level-dependent magnetic-resonance functional neuroimaging. Magnetic resonance in medicine : official journal of the Society of Magnetic Resonance in Medicine / Society of Magnetic Resonance in Medicine. 1995;33(6):745–754. doi: 10.1002/mrm.1910330602. [DOI] [PubMed] [Google Scholar]
  33. Levin JM, Frederick BB, et al. Influence of baseline hematocrit and hemodilution on BOLD fMRI activation. Magn Reson Imaging. 2001;19(8):1055–1062. doi: 10.1016/s0730-725x(01)00460-x. [DOI] [PubMed] [Google Scholar]
  34. Levin JM, Ross MH, et al. Reduction in BOLD fMRI response to primary visual stimulation following alcohol ingestion. Psychiatry Res. 1998;82(3):135–146. doi: 10.1016/s0925-4927(98)00022-5. [DOI] [PubMed] [Google Scholar]
  35. Liu TT, Behzadi Y, et al. Caffeine alters the temporal dynamics of the visual BOLD response. Neuroimage. 2004;23(4):1402–1413. doi: 10.1016/j.neuroimage.2004.07.061. [DOI] [PubMed] [Google Scholar]
  36. Liu TT, Liau J. Caffeine increases the linearity of the visual BOLD response. Neuroimage. 2010;49(3):2311–2317. doi: 10.1016/j.neuroimage.2009.10.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Menon RS. The great brain versus vein debate. Neuroimage. 2011 doi: 10.1016/j.neuroimage.2011.09.005. [DOI] [PubMed] [Google Scholar]
  38. Mulderink TA, Gitelman DR, et al. On the use of caffeine as a contrast booster for BOLD fMRI studies. Neuroimage. 2002;15(1):37–44. doi: 10.1006/nimg.2001.0973. [DOI] [PubMed] [Google Scholar]
  39. Murphy K, Birn RM, et al. The impact of global signal regression on resting state correlations: are anti-correlated networks introduced? Neuroimage. 2009;44(3):893–905. doi: 10.1016/j.neuroimage.2008.09.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Noseworthy MD, Alfonsi J, et al. Attenuation of brain BOLD response following lipid ingestion. Hum Brain Mapp. 2003;20(2):116–121. doi: 10.1002/hbm.10131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Ollinger JM, Corbetta M, et al. Separating processes within a trial in event-related functional MRI. Neuroimage. 2001;13(1):218–229. doi: 10.1006/nimg.2000.0711. [DOI] [PubMed] [Google Scholar]
  42. Perthen JE, Lansing AE, et al. Caffeine-induced uncoupling of cerebral blood flow and oxygen metabolism: a calibrated BOLD fMRI study. Neuroimage. 2008;40(1):237–247. doi: 10.1016/j.neuroimage.2007.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Pineiro R, Pendlebury S, et al. Altered hemodynamic responses in patients after subcortical stroke measured by functional MRI. Stroke. 2002;33(1):103–109. doi: 10.1161/hs0102.100482. [DOI] [PubMed] [Google Scholar]
  44. Raoult H, Petr J, et al. Arterial spin labeling for motor activation mapping at 3T with a 32-channel coil: reproducibility and spatial accuracy in comparison with BOLD fMRI. Neuroimage. 2011;58(1):157–167. doi: 10.1016/j.neuroimage.2011.06.011. [DOI] [PubMed] [Google Scholar]
  45. Richter W, Richter M. The shape of the fMRI BOLD response in children and adults changes systematically with age. Neuroimage. 2003;20(2):1122–1131. doi: 10.1016/S1053-8119(03)00347-1. [DOI] [PubMed] [Google Scholar]
  46. Saad ZS, Ropella KM, et al. Analysis and use of FMRI response delays. Hum Brain Mapp. 2001;13(2):74–93. doi: 10.1002/hbm.1026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sicard KM, Duong TQ. Effects of hypoxia, hyperoxia, and hypercapnia on baseline and stimulus-evoked BOLD, CBF, and CMRO2 in spontaneously breathing animals. Neuroimage. 2005;25(3):850–858. doi: 10.1016/j.neuroimage.2004.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Silva AC, Koretsky AP. Laminar specificity of functional MRI onset times during somatosensory stimulation in rat. Proceedings of the National Academy of Sciences of the United States of America. 2002;99(23):15182–15187. doi: 10.1073/pnas.222561899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Smith SM, Miller KL, et al. Network modelling methods for FMRI. Neuroimage. 2011;54(2):875–891. doi: 10.1016/j.neuroimage.2010.08.063. [DOI] [PubMed] [Google Scholar]
  50. Stephan KE, Weiskopf N, et al. Comparing hemodynamic models with DCM. Neuroimage. 2007;38(3):387–401. doi: 10.1016/j.neuroimage.2007.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Tjandra T, Brooks JC, et al. Quantitative assessment of the reproducibility of functional activation measured with BOLD and MR perfusion imaging: implications for clinical trial design. Neuroimage. 2005;27(2):393–401. doi: 10.1016/j.neuroimage.2005.04.021. [DOI] [PubMed] [Google Scholar]
  52. Uludag K. Transient and sustained BOLD responses to sustained visual stimulation. Magn Reson Imaging. 2008;26(7):863–869. doi: 10.1016/j.mri.2008.01.049. [DOI] [PubMed] [Google Scholar]
  53. Woolrich MW, Behrens TE, et al. Constrained linear basis sets for HRF modelling using Variational Bayes. Neuroimage. 2004;21(4):1748–1761. doi: 10.1016/j.neuroimage.2003.12.024. [DOI] [PubMed] [Google Scholar]
  54. Yu X, Glen D, et al. Direct imaging of macrovascular and microvascular contributions to BOLD fMRI in layers IV-V of the rat whisker-barrel cortex. Neuroimage. 2011 doi: 10.1016/j.neuroimage.2011.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES