Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

ArXiv logoLink to ArXiv
[Preprint]. 2023 Sep 12:arXiv:2303.01592v3. [Version 3]

JOSA: Joint surface-based registration with atlas construction enables accurate alignment of the brain geometry and function

Jian Li 1,2, Greta Tuckute 3,4, Evelina Fedorenko 3,4,5, Brian L Edlow 1,2, Adrian V Dalca 1,6,*, Bruce Fischl 1,6,*
PMCID: PMC10516111  PMID: 37744470

Abstract

Surface-based cortical registration is an important topic in medical image analysis and facilitates many downstream applications. Current approaches for cortical registration are mainly driven by geometric features, such as sulcal depth and curvature, and often assume that registration of folding patterns leads to alignment of brain function. However, functional variability of anatomically corresponding areas across subjects has been widely reported, particularly in higher-order cognitive areas. In this work, we present JOSA, a novel cortical registration framework that jointly models the mismatch between geometry and function while simultaneously learning an unbiased population-specific atlas. Using a semi-supervised training strategy, JOSA achieves superior registration performance in both geometry and function without requiring functional data at inference. This learning framework can be extended to any auxiliary data to guide spherical registration that is available during training but is difficult or impossible to obtain during inference, such as parcellations, architectonic identity, transcriptomic information, and molecular profiles.

Keywords: Cortical Registration, Semi-supervised Learning

1. Introduction

Image registration is a key research topic in medical image analysis. Deformable image registration establishes spatial correspondence between a pair of images via a nonlinear spatial transformation. This transformation is often obtained through an optimization procedure that maximizes a similarity measure. Approaches for registration of the brain are either directly based on 3D images from scans of the brain (volume-based) or based on features resampled onto the brain surface (surface-based) (Maintz and Viergever, 1998) or a combination of these (Postelnicu et al., 2009; Joshi et al., 2009). Surface-based registration of the human cerebral cortex, often referred to as cortical registration, extracts a variety of representative information from the brain images and solves the registration problem based on a surface matching framework.

Cortical registration methods achieve high accuracy in aligning the complex folding patterns of the cerebral cortex by matching the geometric features of the brain (Davatzikos and Bryan, 1996; Fischl et al., 1999b). It has been shown that inter-subject cortical registration can improve not only the statistical power of group functional analyses (van Atteveldt et al., 2004; Frost and Goebel, 2012), but also the predictability of cytoarchitecture of the brain (Yeo et al., 2010b; Fischl et al., 2008). Due to the geometric complexity of the cortex and the large variability among individuals, cortical registration remains a challenging task. Inter-subject surface alignment is commonly driven by geometric features that describe measures of cortical folding patterns, such as sulcal depth and mean curvature (Fischl et al., 1999b; Yeo et al., 2010a; Conroy et al., 2013). It is often assumed that an accurate registration of folding patterns will also successfully align brain function. However, the functional variability of anatomically corresponding areas across subjects has been widely reported, particularly in regions implicated in higher-order cognition (Fischl et al., 2008; Frost and Goebel, 2012). This variability implies that regions with different functional specializations may not be optimally aligned even when a perfect anatomical registration is achieved. There may exist many equally-good solutions for geometric registration with substantially different performance in registration of function. Another possible cause is the mismatch between geometry and function within each subject, i.e., different individuals may use a slightly different region to process the same function.

Traditional model-based deformable registration has been extensively studied (Fischl et al., 1999b; Yeo et al., 2010a; Sabuncu et al., 2010; Guntupalli et al., 2016; Robinson et al., 2014; Vercauteren et al., 2009; Nenning et al., 2017; Avants et al., 2008; Beg et al., 2005; Christensen et al., 1997; Van Essen et al., 1998). Typical strategies employ an iterative approach that seeks an optimal deformation field to warp a moving image to a fixed image. Methods usually involve optimization of a similarity measure between two feature maps, e.g., minimizing mean squared error (MSE) or maximizing normalized cross correlation, while regularizing the deformation field to have some desired property, such as smoothness and/or diffeomorphism. Widely used techniques for cortical surface registration map the surface onto the unit sphere and establish correspondence between feature maps in the spherical space (Fischl et al., 1999a). Conventional approaches, such as FreeSurfer (Fischl et al., 1999b), register an individual subject to a probabilistic population atlas by minimizing the geometry MSE weighted by the inverse variance of the atlas convexity, in a maximum a posteriori formulation. These anatomical registration methods have been adapted to functional registration by minimizing MSE on functional connectivity computed from functional magnetic resonance imaging (fMRI) data (Sabuncu et al., 2010). Spatial correspondence can also be maximized in a non-diffeomorphic manner by finding local orthogonal transforms that linearly combine features around each local neighborhood (Guntupalli et al., 2016, 2018). Recent discrete optimization approaches iteratively align local features using spherical meshes from low-resolution to high-resolution (Robinson et al., 2014, 2018). To encourage invertibility of the deformation field and preserve anatomical topology, diffeomorphism has been introduced into registration using an exponentiated Lie algebra, most often assuming a stationary velocity field (SVF) (Ashburner, 2007; Vercauteren et al., 2009). These strategies were extended to the sphere by regularizing the deformation using spherical thin plate spline interpolation (Yeo et al., 2010a). Several methods align functional regions, for example using Laplacian eigen embeddings computed from fMRI data (Nenning et al., 2017). These methods are successful but solve an optimization problem for each image pair, resulting in a high computational cost.

With the increased of popularity of deep learning techniques, many learning-based registration methods have been proposed. They can be categorized into supervised registration (Krebs et al., 2017; Sokooti et al., 2017; Yang et al., 2016; Cao et al., 2017) and unsupervised registration (Dalca et al., 2018a; Balakrishnan et al., 2019; Cheng et al., 2020a; Niethammer et al., 2019; Krebs et al., 2019; de Vos et al., 2019). Supervised registration aims to predict the deformation fields directly and evaluate the performance by comparing the predicted deformation with a “ground truth” deformation. However, due to the lack of the “ground truth”, deformation generated by iterative methods or simulation is often used as the target, which fundamentally limits the performance of supervised approaches (Sokooti et al., 2017; Yang et al., 2016; Cao et al., 2017). Further, the iterative registration methods are relatively slow which makes it impractical to generate large numbers of samples for supervised training, often resulting in overfitting. In contrast, Unsupervised registration methods predict a deformation field that is used to warp the moving image to the fixed image. The core idea of these methods is to employ a classical loss in the image space, thus forming an end-to-end training pipeline (de Vos et al., 2019; Balakrishnan et al., 2019). Semi-supervised methods employ additional information, such as segmentation or parcellation maps, to guide registration without requiring them during inference (Balakrishnan et al., 2019). Recent methods extend this strategy to the spherical domain by parameterizing the brain surfaces in a 2D grid that accounts for distortions (Cheng et al., 2020a) or directly on the sphere using spherical kernels (Zhao et al., 2021). These learning-based registration methods substantially improved registration speed by orders of magnitude at inference while achieving a superior or comparable registration accuracy relative to the iterative methods. However, these methods do not directly model structure-function mismatches within a subject.

In this work, we present a diffeomorphic cortical registration framework that 1) explicitly uses functional data to drive the geometric registration to optimally align function, and 2) explicitly models mismatches between geometry and function. We build on recent unsupervised spherical registration strategies (Balakrishnan et al., 2019; Cheng et al., 2020a) and use a joint deformation field shared by geometry and function to capture the relatively large differences among subjects. We introduce deformation fields that describe relatively small variations between geometry and function within each subject. To avoid the biases of existing anatomical templates, we simultaneously learn a population-specific atlas during training (Dalca et al., 2019). We develop a semi-supervised training strategy that uses task fMRI data to improve functional registration but without requiring task fMRI data during inference. However, in contrast to the term “unsupervised” commonly used in the literature (Balakrishnan et al., 2019; Cheng et al., 2020a), here we borrow the term “semi-supervised” to describe our strategy to highlight that auxiliary information, such as functional data, can be incorporated during training but are not required during inference. This semi-supervised training framework can also be easily extended to any auxiliary data that could be helpful to guide spherical registration but is difficult or impossible to obtain or undesirable to use during inference, such as parcellations, architectonic identity, transcriptomic information, and molecular profiles. Therefore, the registration at inference is solely based on the geometry of the subject surface, which further avoids the potential circularity issue in any subsequent analyses of functional or other auxiliary data. We demonstrate experimentally that the proposed framework yields improved registration performance in both anatomical and functional domains.

Some of the preliminary results have been presented in (Li et al., 2023). The current paper provides a more detailed description of the method, novel experimental results with a comparison to an extensive list of the state-of-the-art methods, as well as detailed ablation studies.

2. Methods

We describe Joint Spherical registration and Atlas building (JOSA), a method for surface-based cortical registration with simultaneous atlas construction that explicitly models the cross-subject variation in the relationship between anatomy and functional properties in each subject.

2.1. Generative model

Fig. 1 shows the graphical representation for the proposed generative model. Let A be an unknown population atlas with all geometric and functional cortical features of interest. We propose a generative model that describes the formation of the subject geometric Ig and functional If features by first warping the atlas A by a subject deformation field ϕj. This model characterizes the differences between subjects and results in a joint multi-feature image Ij. Geometric feature Ig is formed given an additional field ϕg that deforms the geometric features in Ij, and similarly for If and ϕf.

Figure 1:

Figure 1:

Graphical representation of the generative model. Circles are random variables. Rounded squares indicate parameters. Shaded quantities are observations. The big plate represents replication. A represents the global atlas, I the input image, ϕ deformation field. The subscript j, g, and f stand for joint, geometry, and function, respectively.

2.1.1. Deformation priors

Let ϕij, ϕig, ϕif be the joint, geometric, and functional deformation fields for each subject i, respectively. All variables in the model are subject-specific, except for the global atlas A, and we omit i for our derivation. We impose the deformation priors

p(ϕj)~exp{(λjuj2+αju¯j2)}p(ϕg)~exp{λgug2}p(ϕf)~exp{λfuf2} (1)

where uj is the spatial displacement for ϕj = Id + uj, ∇uj is its spatial gradient, and ūj = 1/N i uij, N is the number of subjects, and similarly for ug and uf. The gradient term encourages smooth deformations, while the mean term encourages an unbiased atlas A by penalizing the average deformation over the entire dataset and hence favouring atlases that are “close” to every subject in the training set. (Dalca et al., 2019).

2.1.2. Data likelihood

We treat the latent joint image Ij as a noisy warped atlas,

p(Ijϕj;A)=𝒩(Ij;ϕjA,σ2I) (2)

where N (·; μ, Σ) is the multivariate Gaussian distribution with mean μ and covariance Σ, ◦ represents spatial transformation, σ represents additive noise, and I is the identity matrix. The geometric feature image Ig is then a noisy observation of a further-moved joint image Ij:

p(Igϕg,Ij)=𝒩(Ig;ϕgIj,σ2I). (3)

Therefore, the complete geometric image likelihood is given by:

p(Igϕg,ϕj;A)=Ijp(Igϕg,Ij)p(Ijϕj;A)=Ij𝒩(Ig;ϕgIj,σI)𝒩(Ij;ϕjA,σI)=Ij𝒩(Ij;ϕg1Ig,σI)𝒩(Ij;ϕjA,σI)=*Ij𝒩(Ij;μc,Σc)𝒩(ϕg1Ig;ϕjA;2σ2I)=𝒩(ϕg1Ig;ϕjA;2σ2I)=𝒩(Ig;ϕgϕjA;2σ2I) (4)

where in * we used an identity of the product of two Gaussian distributions, and μc, Σc are constants. We use a similar model for the functional image If.

2.1.3. Learning

Let Φ = j, ϕg, ϕf} and I = {Ig, If}. We estimate Φ by minimizing the negative log likelihood,

(ΦI;A)=logp(ΦI;A)=logp(IΦ;A)logp(Φ)=logk{g,f}p(Ikϕj,ϕk,A)logk{j,g,f}p(ϕk)=12σ2(IgϕgϕjA2+IfϕfϕjA2)+λjuj2+λgug2+λfuf2+αju¯j2+ const. (5)

2.2. Neural network approach with semi-supervised training strategy

We use a neural network to approximate the function hθ,A(I) = Φ, where θ are network parameters. Fig. 2 shows the proposed network architecture. To work with surface-based data, the cortical surface of each subject is inflated into a sphere and then rigidly registered to an average space using FreeSurfer (Fischl, 2012). Geometric and functional features are parameterized onto a 2D grid using a standard conversion from Cartesian coordinates to spherical coordinates, resulting in a 2D image for each input (Cheng et al., 2020a).

Figure 2:

Figure 2:

Network architecture and preprocessing pipeline. The network takes the geometric features from the subject and outputs one joint and two separate deformation fields. The joint and geometric deformation are composted for registration of folding patterns, and likewise for function. Task fMRI data were used for evaluating the functional loss only in a semi-supervised manner, hence, functional task data are not required during inference.

The network takes such a parameterized geometric image as input and outputs three velocity fields in the parameterized coordinates, each followed by an integration layer generating the corresponding deformation field. The joint deformation ϕj, which models the relatively large inter-subject variance, is shared among and composed with individual deformations ϕg and ϕf. We note that the separation of ϕg from ϕf enables us to explicitly model structure-function variability across subjects. This is of critical importance from a neuroscientific perspective because it has been shown that some functional areas are better predicted by folding patterns than others (Fischl et al., 2008). The losses ℒgeom and Lfunc shown in Fig. 2 represents the geometric part and the functional part of the data fidelity terms in Eq.(5). They are evaluated in the atlas and subject space, which also helps avoid atlas drift (Aganj et al., 2017) during atlas construction. Lreg represents the regularization and centrality terms which encourages smooth deformations and an unbiased estimation of the atlas.

In this study, we learn network parameters and use task fMRI data in a semi-supervised manner. As shown in the green block in Fig. 2, the task fMRI data and the corresponding functional atlas are not input into the neural network. Rather they are used only for evaluating the functional terms in the loss function (5). This obviates the need for functional data during inference, as the deformation fields can be inferred using only geometric features. The proposed framework is flexible in the sense that auxiliary data, perhaps from different modalities, can be integrated into the framework for simultaneous multi-modality registration.

2.3. Implementation

We implemented a Unet-like (Ronneberger et al., 2015) network based on the core architecture in VoxelMorph (https://voxelmorph.net) (Balakrishnan et al., 2019; Dalca et al., 2019). We used a 5-layer encoder with [128, 256, 384, 512, 640] filters and a symmetric decoder followed by 2 more convolutional layers with [64, 32] filters. Each layer involves convolution, max-pooling/down-sampling, and LeakyReLU activation. The spherical parameterization leads to denser sampling grids for regions at higher latitudes. To account for the difference in sampling density, we performed prior and distortion corrections identical to that described in (Cheng et al., 2020a). In short, weights proportional to sin(θ), where θ is the elevation, were used to correct the distortion. (Cheng et al., 2020a) also found that varying the locations of the poles in the projection had little impact on the resulting registration. The parameterized images were standardized identically but separately for structural and functional features, where the median was subtracted for each feature image followed by a division by the standard deviation.

During training, we randomly sampled the training data into mini-batches with a batch size of 8. For each batch, we augmented the data by adding Gaussian random deformations with a maximum σ = 8 and a proper spherical topology and distortion correction at each spatial location. We further augmented the data by adding Gaussian noise with σ = 1 for geometric features and σ = 6 for functional features. We used the Adam optimizer (Kingma and Ba, 2014) with an initial learning rate of 10−3. The learning rate was scheduled to decrease linearly to 10−4 within the first 500 epochs and then reduced by a factor of 0.9 if the validation loss does not decrease after every 100 epochs. The relative weights between functional loss and geometric loss were set to 0.7:0.3 empirically. We set the regularization hyperparameter λj in Eq. (5) for the joint (large) deformation to be 0.1 and λg and λf for the individual (small) deformations to be 0.2.

The atlases, as part of the network parameters, were initialized using Gaussian random noise and automatically learned during training. We exploited a three-stage training strategy to further avoid atlas expansion or drift (Aganj et al., 2017): 1) we performed a training run using a single deformation with geometric features only, so that the network learned a geometric atlas; 2) we used and fixed the geometric atlas obtained in step one and trained a JOSA network to learn the functional atlas; and 3) we fixed both the geometric and the functional atlas from the previous two steps and performed a final training for the composite deformation fields. We used TensorFlow (Abadi et al., 2016) with Keras front-end (Chollet, 2018) and the Neurite package (Dalca et al., 2018b), and all experiments were conducted in a Dell Workstation with dual Intel Xeon Silver 6226R CPUs and an Nvidia RTX6000 GPU. The source code and the trained model of JOSA will be released to the public at https://voxelmorph.net as well as integrated as part of FreeSurfer (Fischl, 2012) package.

3. Experiments

3.1. Language task fMRI data

We used task fMRI data from a large-scale language mapping study (Lipkin et al., 2022) (N = 800), where the language network was functionally localized using a task that contrasts reading/listening of sentences versus a perceptually matched control (such as strings of nonwords or degraded speech), collected using a standard blocked design. Ten different versions of the language localizer task were used (Table 2 in Lipkin et al. (2022)), with the vast majority of subjects (~77%) completing a version that contrasted reading of sentences with nonwords strings. In this localizer version, each stimulus was presented one word/nonword at a time at the rate of 450 ms per word/nonwords (12 words/nonwords per stimulus). Each stimulus was preceded by a 100 ms blank screen and followed by a 400 ms screen showing a picture of a finger pressing a button, and a blank screen for another 100ms, for a total trial duration of 6s. Experimental blocks lasted 18s, and fixation blocks lasted 14s. Each run (consisting of 5 fixation blocks and 16 experimental blocks) lasted 358s. Subjects completed 2 runs. Subjects were instructed to read attentively and press a button whenever they saw the finger-pressing picture on the screen. Structural and functional data were collected on a 3T Siemens Trio scanner at the Athinoula A. Martinos Imaging Center at the McGovern Institute for Brain Research at MIT. T1-weighted images were collected in 176 sagittal slices with 1 mm isotropic resolution. Functional data (BOLD) were acquired using an EPI sequence with 4 mm thick near-axial slices, 2.1 mm × 2.1 mm in-plane resolution, TR = 2,000 ms, and TE = 30 ms. We preprocessed the data using FreeSurfer v6.0.0 as described in (Lipkin et al., 2022). In brief, data were sampled onto the FreeSurfer average space (fsaverage), motion corrected, registered using the middle time point of each run, and spatially smoothed with a 4mm FWHM Gaussian filter. The subjects’ surfaces were reconstructed from the T1 images (default recon-all parameters). A “sentence vs. nonword” contrast t-map was originally generated for each subject using first-level GLM analysis based on the blocked design, and sampled on the reconstructed surface in the fsaverage space (Lipkin et al., 2022). The t-maps were resampled back onto the individual surfaces using FreeSurfer’s ‘mri_surf2surf’ utility as functional features for registration in this work. We randomly split the data into a training set with 600 subjects, a validation set with 100 subjects, and a test set with the remaining 100 subjects.

3.2. Baseline

We compare the registration performance of JOSA to FreeSurfer (Fischl et al., 1999b), Multimodel Surface Matching (MSM) (Robinson et al., 2014), the improved version of MSM with higher-order constraint (MSM-HOC) (Robinson et al., 2018), and SphereMorph (Cheng et al., 2020a) as surface registration baselines. For all baseline methods, we used three geometric features that quantify the folding patterns of the cortex to drive the registration, namely the mean curvature of the inflated surface (?h.inflated.H), the sulcal depth map (?h.sulc) and the mean curvature (?h.curv) of the non-inflated white matter surface, all available as part of the FreeSurfer standard outputs. For FreeSurfer registration, we ran mris_register (https://surfer.nmr.mgh.harvard.edu/fswiki/mris_register) to register each subject to fsaverage with the configurations used in their 2014 and 2018 papers, respectively. For a fair comparison, both versions of MSM were executed in a hierarchical manner such that ?h.inflated.H was used for initial coarse alignment, then refined by ?h.sulc and ?h.curv, consecutively. For SphereMorph (https://voxelmorph.net), we used fsaverage as the registration target and trained the network to predict a single deformation field purely based on the three geometric features. The predicted deformation is then used in turn to warp each subject’s functional data to the fsaverage space at the inference time.

3.3. Evaluation

Qualitatively, we computed the group mean images of both the geometric and the functional data in the test set for each registration method. We then visualized them by superimposing the functional group mean map with the curvature group mean map using Freeview (Fischl, 2012).

We quantified the geometric registration accuracy using the correlation between the registered individual data and the group mean (Cheng et al., 2020a). Specifically, we computed the Pearson correlation ck = corr(Ik,Ī) between the individual image Ik to the group mean image Ī = 1/N k Ik for subject k, where N is the number of subjects. We quantified the functional registration accuracy using the size of the overlapped regions between the suprathreshold individual functional maps and the suprathreshold group mean. We expect a better registration will have a larger surface area in the individual functional active regions as well as a larger overlap with the group mean for a reasonably conservative threshold. This also helps avoid a potential evaluation bias when a single-task contrast is used where only a fraction of the surface area is active during tasks. Specifically, we computed |sk=|IkthI¯th|, where Ikth is the suprathreshold individual image for subject k at threshold th and similarly for the group image Īth, and is the measure of cardinality. We used a threshold of 3 in this evaluation, where the suprathreshold regions for the group mean are shown in Fig. 3.

Figure 3:

Figure 3:

Group average curvature map of 100 test subjects overlaid with the group average functional language activation map. (a) Rigid; (b) FreeSurfer; (c) MSM; (d) MSM-HOC; (e) SphereMorph; (f) JOSA. The language activation map is defined based on the contrast between well-formed sentences and a perceptually matched control stimulus. In the curvature maps, dark gray indicates sulci and light gray indicates gyri. The color bar for functional data (t-values) is shown on the right. White arrows highlight the key regions in functional alignment where large functional variability across subjects is often observed. JOSA achieved a substantially better alignment in both folding patterns and function with stronger functional response and sharper transition from task-active to non-active regions.

To better demonstrate the consistency in registration improvement across subjects, we assessed the pair-wise geometric correlation difference ckJOSAckRigid between our proposed method JOSA and the rigid alignment, and similarly for FreeSurfer, MSM, MSM-HOC, and SphereMorph (i.e., the correlation difference for the same subject after registration in comparison to the rigidly aligned version). We similarly assessed the pair-wise functional difference skJOSAskRigid.

In functional evaluations, to further investigate the impact of the selected threshold th, we computed the sum of the superthreshold t-values in the group mean jI¯jth as a function of the threshold th ∈ {3, …, 5} We also looked into the two individual factors that have impact on the sum of the suprathreshold t-values, namely the number of vertices that have t-values above the threshold M=|I¯th| as well as the average of the suprathreshold t-values within those regions 1/MjI¯jth The former reflects the size of the functional active regions and the latter reflects the strength of the functional response.

We also compared the computational efficiency of the registration methods by measuring the run time the registration procedure for each of the test subjects. Iterative methods (FreeSurfer, MSM, and MSM-HOC) were run on a single CPU thread, while the learning-based methods (SphereMorph and JOSA) were evaluated using both CPU and a single GPU. See Section 2.3 for hardware details.

3.4. Stability of Atlas Construction

Bias towards the initial selected target is a well known issue in atlas construction (Fischl et al., 1999b; Dickie et al., 2017; Cheng et al., 2020b). To investigate the stability and the potential bias of atlas construction using JOSA, we repeated the geometric atlas learning procedure 6 times, each time with randomly selected batches of subjects and their orders through the training process. We qualitatively visualized the difference between each run to the average of all runs. To quantitatively measure the stability of the learned atlases, for each pair of the learned atlases with the 6 trials (hence 15 pairs in total), we computed Pearson correlation between the pairs of atlases.

3.5. Ablation studies

We conducted the following two ablation studies: 1) To illustrate benefit of learning the population-specific and unbiased atlas, we trained two JOSA networks with the only difference being in the registration target, where we used rigid average and FreeSurfer average for comparison to our learned atlas; 2) To illustrate the power of modeling the difference between geometry and function, we trained an additional JOSA network with a single deformation field that was used to align both geometry and function for comparison to our two separate deformations.

4. Results

4.1. Qualitative Comparison

Fig. 3 shows the group mean curvature maps (one of the geometric features) of the 100 test subjects overlaid with the group mean functional language activation maps for each of the registration methods. A better registration leads to a stronger average response (larger yellow areas) to language stimuli and a sharper transition from task-positive to task-negative regions (narrower margin from yellow to transparent regions). FreeSurfer, MSM, MSM-HOC, and SphereMorph significantly improved the registration of the folding patterns compared to the rigid alignment, albeit with some noticeable but minor differences in the average curvature patterns. However, all these methods only yield marginal improvement of the functional alignment. In contrast, JOSA achieved a substantially better alignment in both folding patterns and function. In particular, the predominant language region in the superior temporal gyrus shows a substantially stronger response and clearer functional boundaries. We also find additional/stronger language-responsive regions in the frontal eye field, the area in the inferior frontal gyrus (BA 44/45 near the classic Broca’s area), as well as the posterior section of the inferior parietal lobule as indicated by the white arrows, all consistent with the probabilistic language atlas in the earlier work (Lipkin et al., 2022).

4.2. Quantitative Evaluation

Quantitatively, Fig. 4 (a) shows the pair-wise geometric correlation difference between each of the registration methods to the rigid alignment, and Fig. 4 (b) shows the pair-wise functional suprathreshold overlap difference. Values higher than zero indicate consistent registration improvement over rigid alignment. Fig. S1 in the supplementary material shows the original correlation/overlap values without taking pair-wise differences. Iterative methods such as FreeSurfer and MSM substantially improved geometric correlations over rigid alignment (by ~0.3). Learning-based methods achieved a further significant improvement (by ~0.43) and were comparable between SphereMorph and JOSA. In contrast, only JOSA substantially improved functional registration, while all other methods did not improve or only marginally improved functional registration compared to rigid alignment.

Figure 4:

Figure 4:

Violin plot of pairwise difference between each registration method to rigid alignment. (a) Geometric difference measured by correlation; (b) Functional difference measured by the suprathreshold (t-values >= 3) functional overlaps. JOSA yielded substantial higher correlation/larger overlap against rigid alignment compared to baseline methods and the improvement is consistent across subjects as shown by the pairwise difference measures.

Fig. 5 shows the total suprathreshold t-values of the group mean functional image for each registration method as a function of the threshold. The total suprathreshold t-values are measured as the sum of the t-scores over the regions that have values above the threshold. As shown in Fig. 5, all geometry-based registration methods yielded similar functional total t-values to that using only rigid alignment, while JOSA, in contrast, produces substantially higher functional total t-values. This significant difference between JOSA and baseline methods were consistent regardless of the choice of the threshold. We also note that the higher total t-values may come from either a larger region where the functional values exceeded the threshold or a stronger response or both. To further illustrate the effect, we plotted the curves for (a) the number of vertices within regions that have functional values above the threshold and (b) the averaged functional values within those regions in the Supplementary Fig. S2. We can see that JOSA has both larger functional regions as well as a stronger functional response within those regions, and the former contributes more than the latter.

Figure 5:

Figure 5:

Functional total suprathreshold t-values of the group means as a function of the threshold. The total suprathreshold t-values in JOSA is substantially higher than that using baseline methods regardless of the selected threshold.

4.3. Run time

Table 1 summarizes the run time for each registration method. For a single subject registration, all methods started with the spherical cortical surface, and the output is the warped spherical surface. The default FreeSurfer takes approximately 9 min to complete the spherical registration procedure. MSM and MSM-HOC operated directly on the sphere without parameterization but took significantly longer to execute. In contrast, the learning-based methods take, on average, only 0.097 min ( 5.8 sec!), providing a speed up by two orders of magnitude compared to the iterative methods. The registration speed can be further accelerated by ~ 0.2 sec per subject with a GPU implementation. We note that this 0.2 sec CPU-to-GPU improvement is significant because, in both SphereMorph and JOSA, the parameterization and resampling back to sphere take most of the time during registration, and the actual inference time through the convolutional neural network is ~ 0.06 sec using GPU. That means if the parameterization and resampling can be parallelized and precomputed on a computer cluster, registering 1000 subjects can be done in under 1 minute.

Table 1:

Comparison of the run time for individual subject registration.

Method CPU run time (min) GPU run time (min)
FreeSurfer 8.82 ± 2.47
MSM 21.05 ± 1.87
MSM-HOC 57.09 ± 8.86
SphereMorph 0.0969 ± 0.0081 0.0947 ± 0.0085
JOSA 0.0972 ± 0.0089 0.0945 ± 0.0083

4.4. Atlas Stability

Fig. 6 shows the geometric atlases learned from the 6 repeated training runs, which are identical in all respects except that the subjects and their orders are randomly selected as the input to the training. Fig. 6 (a) shows the mean averaged over the 6 runs and (b) - (g) shows the absolute difference between each run and the mean. These atlases are highly consistent across runs with all major folds nearly identical to each other and subtle differences in the prefrontal, inferior parietal, and middle temporal regions as reflected in the difference images. This strong consistency is quantitatively confirmed by the Pearson correlations 0.94 ± 0.01 among all 15 possible pairs of these atlases. The results demonstrate that the atlas construction and its learning procedure in our proposed JOSA method are stable and not biased towards any specific subject.

Figure 6:

Figure 6:

Learned geometric atlases from 6 repeated training with randomly selected subjects. (a) The mean atlas averaged over the 6 runs. (b) - (g) The absolute difference between each run to the mean. Red indicates sulci and blue indicates gyri. The learned atlases are highly consistent across runs all major folds nearly identical to each other and subtle difference in prefrontal, inferior parietal, and middle temporal areas.

4.5. Ablation Study

The results for the ablation studies are shown in Fig. 7. The first and the second column shows the result using separate deformations for geometry and function as with JOSA but varying the atlas used as the registration target. The third column shows the result using a single deformation for both geometry and function but the same learned atlas as with JOSA. The last column shows the full JOSA result again for easy reference. Again these are pair-wise differences from each registration method to the rigid result. From these ablation results, we can see that the substantial improvement in geometry is mainly attributable to a better atlas, whereas the improvement in the registration of function is primarily due to the separate modeling of deformation between geometry and function. We highlight the importance of this separate modeling of deformations. It is evident from the third column that the registration performance is sub-optimal for both geometry and function if they share the same deformation, which is commonly assumed (anatomy predicts function).

Figure 7:

Figure 7:

Violin plot of pair-wise difference between each registration method to rigid alignment for ablation studies. (a) Geometric difference measured by correlation; (b) Functional difference measured by the suprathreshold overlaps. Geometric improvement is mainly attributable to a better atlas where as functional improvement is primarily due to the separate modeling of the deformation fields.

5. Conclusion and discussion

In this work, we presented JOSA, a novel registration framework that jointly models the relationship between folding patterns and cortical function. In the early 1900s, the correspondence between the two was discovered in primary motor and sensory regions where a cortical homunculus was drawn to illustrate the distorted mapping of human body parts to sensorimotor functions (Penfield and Boldrey, 1937). Similarly, the functional properties of primary visual cortex have been well studied, where the central and peripheral visual fields are located in the posterior and anterior part of the calcarine sulcus, respectively (Dougherty et al., 2003; Sereno et al., 1995; Tootell et al., 1998; Engel, 1997; Wandell et al., 2007; Engel et al., 1994), and the topographic mapping of the visual field to the visual cortex follows the complex-logarithm rule (Schwartz, 1980; Polimeni et al., 2010). However, the anatomo-functional mapping in higher-order functional areas, e.g., language, emotion, etc., is highly variable among individuals compared with the primary regions (Steinmetz and Seitz, 1991; Fischl et al., 2008; Frost and Goebel, 2012; Robinson et al., 2014). This variability is mainly attributable to the non-unique optimal solutions for geometric registration, where, without the guidance from functional data, their performance can be sub-optimal in registration of function. For example, there are several secondary folds around the active language region near the frontal eye field. Aligning any of them may yield equally good geometric registration, but only one of them may be optimal for functional registration. This is not obviously without access and visualization of the functional data of each individual subject. By using a semi-supervised training strategy with functional data as the guidance, JOSA is able to choose the “right” deformation from many valid geometric solutions that produces optimal functional registration. As evidenced in Fig. 4, it is clear that both JOSA and SphereMorph (no functional data were used) achieved comparable performance in geometric alignment, but only JOSA substantially improved functional registration. Moreover, a structure-function mismatch may exist and be different across individuals, i.e., different subjects may use slightly different regions to process similar function. By allowing small deviations of the functional alignment from geometric alignment, JOSA yields superior performance in registration of both folding patterns and functional properties relative to other traditional or learning-based methods. The result in the ablation study shown in Fig. 7, the 3rd column, illustrated this point, where the both the geometric and functional registration are sub-optimal when a single (shared between geometry and function) deformation is used.

In contrast to many other learning-based methods, JOSA also estimates a population-specific atlas as the registration target during training. Fig. 8 shows the learned atlas in (c) compared to atlases constructed by averaging over rigidly registered training subjects (a) and averaged FreeSurfer registered training subjects in (b). We find that the JOSA atlas provides more anatomical definition, supporting registration with higher resolution and finer details, which also contributes to the improved performance of JOSA (particularly in geometric registration).

Figure 8:

Figure 8:

Comparison of atlases. (a) Rigid atlas; (b) FreeSurfer atlas; (c) JOSA atlas. The learned atlas from JOSA provides more anatomical definition than rigid or FreeSurfer atlas, thus supporting registration with higher resolution and finer details.

Atlas or template construction has been widely studied in classical iterative approaches (Collins et al., 1995; Fischl et al., 1999b; Desikan et al., 2006; Destrieux et al., 2010; Joshi et al., 2022). These methods build atlases by repeatedly registering subjects to an estimated atlas, and estimating a new atlas by averaging the registered subjects (Dickie et al., 2017). Inherited from this iterative construction approach, a bias towards the initial subject may be introduced to the atlas. JOSA, as one of the very recent learning-based methods, not only facilitates faster atlas construction (Lee et al., 2022b,a), but also potentially avoids the bias to any individual subject by random selection of subjects as inputs during training (Cheng et al., 2020b), as illustrated in Fig. 6. In this work, JOSA learned an atlas specific to the population that was used for training (Lipkin et al., 2022). It is, however, straightforward to extend this framework to learn any population-specific atlases, optionally conditioned on clinical attributes if desired (Dalca et al., 2019; Dey et al., 2021; Ding and Niethammer, 2022).

Using a semi-supervised training strategy, the deformation fields can be solely determined from geometric features, which can be derived from a single T1-weighted MR scan. Therefore, JOSA not only lifts the burden of acquiring functional data during inference, but also provides capacity for extension to other comprehensive data modalities that are related to geometry but difficult or even impossible to acquire, such as architectonics, transcriptomics, or proteomics. This geometry-only-based registration model further avoids the circularity issue in any subsequent analyses of functional or any other auxiliary data. In another words, if the functional data is used as one of the inputs to drive the registration, then any evaluation or analysis of the functional data after registration will be potentially biased as the network has observed the data before.

The current approach for better registration of brain function is limited by the use of one task fMRI localizer, the language localizer, used in this study. The partial coverage of the brain by the language-active regions may not provide useful registration guidance for functions that use different parts of the cerebral cortex. We therefore plan to extend JOSA to training with multi-modal datasets, enabling accurate registration of an array of brain functional responses. Although the language dataset used in this study (Lipkin et al., 2022) is one of the largest task fMRI datasets publicly available, another caveat in the training of the registration network is the limited number of samples (subjects). Because the functional data were used only in loss evaluation through the semi-supervised block (i.e., they were not seen by the convolutional layers directly), it poses a significant challenge for the network to learn the anatomo-functional relationships, hence often resulting in an overfitting issue when the sample size is small. We also acknowledge the difficulty in data acquisition. One potential solution in the future is to use a pooled dataset from different task contrasts or even resting-state fMRI data, where subjects may have different functional data available, and the network can be trained dynamically using the corresponding available features, atlases, and loss functions.

Supplementary Material

1

Acknowledgments

Support for this research was provided in part by the BRAIN Initiative Cell Census Network grant U01MH117023, the National Institute for Biomedical Imaging and Bioengineering (R01EB023281, R01EB006758, R21EB018907, R01EB019956, P41EB030006), the National Institute on Aging (R56AG064027, R01AG064027, R01AG008122, R01AG016495, R01AG070988), the National Institute of Mental Health (UM1MH130981, R01MH123195, R01MH121885, RF1MH123195), the National Institute for Neurological Disorders and Stroke (R01NS0525851, R21NS072652, R01NS070963, R01NS083534, U01NS086625, U24NS10059103, R01NS105820, R21NS109627, RF1NS115268, U01NS132181), the National Institute on Deafness and Other Communication Disorders (R01DC016607, R01DC016950), the NIH Director’s Office (DP2HD101400), the James S. McDonnell Foundation. This work was also made possible by the resources provided by Shared Instrumentation Grants S10RR023401, S10RR019307, and S10RR023043, the NIH Blueprint for Neuroscience Research (U01MH093765), part of the multi-institutional Human Connectome Project. Additional support was provided by the Amazon Fellowship from the Science Hub, administered by the MIT Schwarzman College of Computing, and the International Doctoral Fellowship from the American Association of University Women (AAUW). Much of the computation resources required for this research was performed on computational hardware generously provided by the Massachusetts Life Sciences Center (https://www.masslifesciences.com).

References

  1. Abadi M., Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., Corrado G.S., Davis A., Dean J., Devin M., Ghemawat S., Goodfellow I., Harp A., Irving G., Isard M., Jia Y., Jozefowicz R., Kaiser L., Kudlur M., Levenberg J., Mane D., Monga R., Moore S., Murray D., Olah C., Schuster M., Shlens J., Steiner B., Sutskever I., Talwar K., Tucker P., Vanhoucke V., Vasudevan V., Viegas F., Vinyals O., Warden P., Wattenberg M., Wicke M., Yu Y., Zheng X., 2016. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. URL: http://arxiv.org/abs/1603.04467, doi:doi: 10.48550/arXiv.1603.04467, arXiv:1603.04467. [DOI] [Google Scholar]
  2. Aganj I., Iglesias J.E., Reuter M., Sabuncu M.R., Fischl B., 2017. Mid-space-independent deformable image registration. NeuroImage 152, 158–170. URL: https://www.sciencedirect.com/science/article/pii/S1053811917301660, doi:doi: 10.1016/j.neuroimage.2017.02.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ashburner J., 2007. A fast diffeomorphic image registration algorithm. NeuroImage 38, 95–113. URL: https://www.sciencedirect.com/science/article/pii/S1053811907005848, doi:doi: 10.1016/j.neuroimage.2007.07.007. [DOI] [PubMed] [Google Scholar]
  4. Avants B.B., Epstein C.L., Grossman M., Gee J.C., 2008. Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain. Medical Image Analysis 12, 26–41. URL: https://www.sciencedirect.com/science/article/pii/S1361841507000606, doi:doi: 10.1016/j.media.2007.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Balakrishnan G., Zhao A., Sabuncu M.R., Guttag J., Dalca A.V., 2019. VoxelMorph: A learning framework for deformable medical image registration. IEEE Transactions on Medical Imaging 38, 1788–1800. URL: https://ieeexplore.ieee.org/document/8633930/, doi:doi: 10.1109/TMI.2019.2897538. [DOI] [PubMed] [Google Scholar]
  6. Beg M.F., Miller M.I., Trouvé A., Younes L., 2005. Computing large deformation metric mappings via geodesic flows of diffeomorphisms. International Journal of Computer Vision 61, 139–157. URL: 10.1023/B:VISI.0000043755.93987.aa, doi:doi:10.1023/B:VISI.0000043755.93987.aa. [DOI] [Google Scholar]
  7. Cao X., Yang J., Zhang J., Nie D., Kim M., Wang Q., Shen D., 2017. Deformable image registration based on similarity-steered CNN regression, in: Medical Image Computing and Computer Assisted Intervention, Cham. pp. 300–308. URL: http://link.springer.com/10.1007/978-3-319-66182-7_35, doi:doi: 10.1007/978-3-319-66182-7_35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cheng J., Dalca A.V., Fischl B., Zöllei L., 2020a. Cortical surface registration using unsupervised learning. NeuroImage 221, 117161. URL: http://www.sciencedirect.com/science/article/pii/S1053811920306479, doi:doi: 10.1016/j.neuroimage.2020.117161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cheng J., Dalca A.V., Zollei L., 2020b. Unbiased atlas construction for neonatal cortical surfaces via unsupervised learning, in: Hu Y., Licandro R., Noble J.A., Hutter J., Aylward S., Melbourne A., Abaci Turk E., Torrents Bar-rena J. (Eds.), Medical Ultrasound, and Preterm, Perinatal and Paediatric Image Analysis, Springer International Publishing, Cham. pp. 334–342. URL: https://link.springer.com/10.1007/978-3-030-60334-2_33, doi:doi: 10.1007/978-3-030-60334-2_33. [DOI] [Google Scholar]
  10. Chollet F., 2018. Keras: the Python deep learning library. Astrophysics Source Code Library , ascl:1806.022URL: https://ui.adsabs.harvard.edu/abs/2018ascl.soft06022C. [Google Scholar]
  11. Christensen G., Joshi S., Miller M., 1997. Volumetric transformation of brain anatomy. IEEE Transactions on Medical Imaging 16, 864–877. URL: http://ieeexplore.ieee.org/document/650882/, doi:doi: 10.1109/42.650882. [DOI] [PubMed] [Google Scholar]
  12. Collins D.L., Holmes C.J., Peters T.M., Evans A.C., 1995. Automatic 3D model-based neuroanatomical segmentation. Human Brain Mapping 3, 190–208. URL: https://onlinelibrary.wiley.com/doi/10.1002/hbm. 460030304, doi:doi: 10.1002/hbm.460030304. [DOI] [Google Scholar]
  13. Conroy B.R., Singer B.D., Guntupalli J.S., Ramadge P.J., Haxby J.V., 2013. Inter-subject alignment of human cortical anatomy using functional connectivity. NeuroImage 81, 400–411. URL: https://linkinghub.elsevier.com/retrieve/pii/S1053811913004990, doi:doi: 10.1016/j.neuroimage.2013.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dalca A., Rakic M., Guttag J., Sabuncu M., 2019. Learning conditional deformable templates with convolutional networks, in: Advances in Neural Information Processing Systems, Curran Associates, Inc. URL: https://papers.nips.cc/paper/2019/hash/bbcbff5c1f1ded46c25d28119a85c6c2-Abstract.html. [Google Scholar]
  15. Dalca A.V., Balakrishnan G., Guttag J., Sabuncu M.R., 2018a. Unsupervised learning for fast probabilistic diffeomorphic registration, in: Medical Image Computing and Computer Assisted Intervention, Cham. pp. 729–738. doi:doi: 10.1007/978-3-030-00928-1_82. [DOI] [Google Scholar]
  16. Dalca A.V., Guttag J., Sabuncu M.R., 2018b. Anatomical priors in convolutional networks for unsupervised biomedical segmentation, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9290–9299. URL: https://openaccess.thecvf.com/content_cvpr_2018/html/Dalca_Anatomical_Priors_in_CVPR_2018_paper.html. [Google Scholar]
  17. Davatzikos C., Bryan N., 1996. Using a deformable surface model to obtain a shape representation of the cortex. IEEE Transactions on Medical Imaging 15, 785–795. doi:doi: 10.1109/42.544496. [DOI] [PubMed] [Google Scholar]
  18. de Vos B.D., Berendsen F.F., Viergever M.A., Sokooti H., Staring M., Isgum I., 2019. A deep learning framework for unsupervised affine and deformable image registration. Medical Image Analysis 52, 128–143. URL: https://www.sciencedirect.com/science/article/pii/S1361841518300495, doi:doi: 10.1016/j.media.2018.11.010. [DOI] [PubMed] [Google Scholar]
  19. Desikan R.S., Segonne F., Fischl B., Quinn B.T., Dickerson B.C., Blacker D., Buckner R.L., Dale A.M., Maguire R.P., Hyman B.T., Albert M.S., Killiany R.J., 2006. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage 31, 968–980. URL: https://linkinghub.elsevier.com/retrieve/pii/S1053811906000437, doi:doi: 10.1016/j.neuroimage.2006.01.021. [DOI] [PubMed] [Google Scholar]
  20. Destrieux C., Fischl B., Dale A., Halgren E., 2010. Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. NeuroImage 53, 1–15. URL: https://linkinghub.elsevier.com/retrieve/pii/S1053811910008542, doi:doi: 10.1016/j.neuroimage.2010.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dey N., Ren M., Dalca A.V., Gerig G., 2021. Generative adversarial registration for improved conditional deformable templates, in: Proceedings of the IEEE/CVF international conference on computer vision, pp. 3929–3941. [Google Scholar]
  22. Dickie D.A., Shenkin S.D., Anblagan D., Lee J., Blesa Cabez M., Rodriguez D., Boardman J.P., Waldman A., Job D.E., Wardlaw J.M., 2017. Whole brain magnetic resonance image atlases: A systematic review of existing atlases and caveats for use in population imaging. Frontiers in Neuroinformatics 11. URL: http://journal.frontiersin.org/article/10.3389/fninf.2017.00001/full, doi:doi: 10.3389/fninf.2017.00001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ding Z., Niethammer M., 2022. Aladdin: Joint atlas building and diffeomorphic registration learning with pairwise alignment, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 20784–20793. [Google Scholar]
  24. Dougherty R.F., Koch V.M., Brewer A.A., Fischer B., Modersitzki J., Wandell B.A., 2003. Visual field representations and locations of visual areas V1/2/3 in human visual cortex. Journal of Vision 3, 1. URL: http://jov.arvojournals.org/article.aspx?doi=10.1167/3.10.1, doi:doi: 10.1167/3.10.1. [DOI] [PubMed] [Google Scholar]
  25. Engel S., 1997. Retinotopic organization in human visual cortex and the spatial precision of functional MRI. Cerebral Cortex 7, 181–192. URL: https://academic.oup.com/cercor/article-lookup/doi/10.1093/cercor/7.2.181, doi:doi: 10.1093/cercor/7.2.181. [DOI] [PubMed] [Google Scholar]
  26. Engel S.A., Rumelhart D.E., Wandell B.A., Lee A.T., Glover G.H., Chichilnisky E.J., Shadlen M.N., 1994. fMRI of human visual cortex. Nature 369, 525–525. URL: https://www.nature.com/articles/369525a0, doi:doi: 10.1038/369525a0. [DOI] [PubMed] [Google Scholar]
  27. Fischl B., 2012. FreeSurfer. NeuroImage 62, 774–781. URL: https://linkinghub.elsevier.com/retrieve/pii/S1053811912000389, doi:doi: 10.1016/j.neuroimage.2012.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Fischl B., Rajendran N., Busa E., Augustinack J., Hinds O., Yeo B.T.T., Mohlberg H., Amunts K., Zilles K., 2008. Cortical folding patterns and predicting cytoarchitecture. Cerebral Cortex 18, 1973–1980. URL: https://academic.oup.com/cercor/article-lookup/doi/10.1093/cercor/bhm225, doi:doi: 10.1093/cercor/bhm225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Fischl B., Sereno M.I., Dale A.M., 1999a. Cortical surface-based Analysis. NeuroImage 9, 195–207. URL: https://linkinghub.elsevier.com/retrieve/pii/S1053811998903962, doi:doi: 10.1006/nimg.1998.0396. [DOI] [PubMed] [Google Scholar]
  30. Fischl B., Sereno M.I., Tootell R.B.H., Dale A.M., 1999b. High-resolution intersubject averaging and a coordinate system for the cortical surface. Human Brain Mapping 8, 272–284. URL: https://onlinelibrary.wiley.com/doi/abs/10.1002/%28SICI%291097-0193%281999%298%3A4%3C272%3A%3AAID-HBM10%3E3.0.CO%3B2-4, doi:doi:. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Frost M.A., Goebel R., 2012. Measuring structural–functional correspondence: Spatial variability of specialised brain regions after macro-anatomical alignment. NeuroImage 59, 1369–1381. URL: https://www.sciencedirect.com/science/article/pii/S1053811911009281, doi:doi: 10.1016/j.neuroimage.2011.08.035. [DOI] [PubMed] [Google Scholar]
  32. Guntupalli J.S., Feilong M., Haxby J.V., 2018. A computational model of shared fine-scale structure in the human connectome. PLOS Computational Biology 14, e1006120. URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006120, doi:doi: 10.1371/journal.pcbi.1006120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Guntupalli J.S., Hanke M., Halchenko Y.O., Connolly A.C., Ramadge P.J., Haxby J.V., 2016. A model of representational spaces in human cortex. Cerebral Cortex 26, 2919–2934. URL: 10.1093/cercor/bhw068, doi:doi:10.1093/cercor/bhw068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Joshi A., Leahy R., Toga A.W., Shattuck D., 2009. A framework for brain registration via simultaneous surface and volume flow, in: Information Processing in Medical Imaging. Springer Berlin Heidelberg, Berlin, Heidelberg. volume 5636, pp. 576–588. URL: http://link.springer.com/10.1007/978-3-642-02498-6_48, doi:doi: 10.1007/978-3-642-02498-6_48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Joshi A.A., Choi S., Liu Y., Chong M., Sonkar G., Gonzalez-Martinez J., Nair D., Wisnowski J.L., Haldar J.P., Shattuck D.W., Damasio H., Leahy R.M., 2022. A hybrid high-resolution anatomical MRI atlas with subparcellation of cortical gyri using resting fMRI. Journal of Neuroscience Methods 374, 109566. URL: https://linkinghub.elsevier.com/retrieve/pii/S0165027022000930, doi:doi: 10.1016/j.jneumeth.2022.109566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kingma D.P., Ba J., 2014. Adam: A method for stochastic optimization. URL: https://arxiv.org/abs/1412.6980, doi:doi: 10.48550/ARXIV.1412.6980. [DOI] [Google Scholar]
  37. Krebs J., Delingette H., Mailhé B., Ayache N., Mansi T., 2019. Learning a probabilistic model for diffeomorphic registration. IEEE Transactions on Medical Imaging 38, 2165–2176. doi:doi: 10.1109/TMI.2019.2897112. [DOI] [PubMed] [Google Scholar]
  38. Krebs J., Mansi T., Delingette H., Zhang L., Ghesu F.C., Miao S., Maier A.K., Ayache N., Liao R., Kamen A., 2017. Robust non-rigid registration through agent-based action learning, in: Medical Image Computing and Computer Assisted Intervention, Cham. pp. 344–352. doi:doi: 10.1007/978-3-319-66182-7_40. [DOI] [Google Scholar]
  39. Lee H.H., Tang Y., Bao S., Xu Y., Yang Q., Yu X., Fogo A.B., Harris R., de Caestecker M.P., Spraggins J.M., Heinrich M.P., Huo Y., Landman B.A., 2022a. Supervised deep generation of high-resolution arterial phase computed tomography kidney substructure atlas, in: Isgum I., Colliot O. (Eds.), Medical Imaging 2022: Image Processing, SPIE, San Diego, United States. p. 97. URL: https://www.spiedigitallibrary.org/conference-proceedings-of-spie/12032/2608290/Supervised-deep-generation-of-high-resolution-arterial-phase-computed-tomography/10.1117/12.2608290.full, doi:doi: 10.1117/12.2608290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lee H.H., Tang Y., Xu K., Bao S., Fogo A.B., Harris R., de Caestecker M.P., Heinrich M., Spraggins J.M., Huo Y., Landman B.A., 2022b. Multi-contrast computed tomography healthy kidney atlas. Computers in Biology and Medicine 146, 105555. URL: https://linkinghub.elsevier.com/retrieve/pii/S001048252200347X, doi:doi: 10.1016/j.compbiomed.2022.105555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Li J., Tuckute G., Fedorenko E., Edlow B.L., Fischl B., Dalca A.V., 2023. Joint cortical registration of geometry and function using semi-supervised learning, in: Proceedings of the Sixth Conference on Medical Imaging with Deep Learning, PMLR. [Google Scholar]
  42. Lipkin B., Tuckute G., Affourtit J., Small H., Mineroff Z., Kean H., Jouravlev O., Rakocevic L., Pritchett B., Siegelman M., Hoeflin C., Pongos A., Blank I.A., Struhl M.K., Ivanova A., Shannon S., Sathe A., Hoffmann M., Nieto-Castanon A., Fedorenko E., 2022. Probabilistic atlas for the language network based on precision fMRI data from >800 individuals. Scientific Data 9, 529. URL: https://www.nature.com/articles/s41597-022-01645-3, doi:doi: 10.1038/s41597-022-01645-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Maintz J.B.A., Viergever M.A., 1998. A survey of medical image registration. Medical Image Analysis 2, 1–36. URL: https://www.sciencedirect.com/science/article/pii/S1361841501800268, doi:doi: 10.1016/S1361-8415(01)80026-8. [DOI] [PubMed] [Google Scholar]
  44. Nenning K.H., Liu H., Ghosh S.S., Sabuncu M.R., Schwartz E., Langs G., 2017. Diffeomorphic functional brain surface alignment: Functional demons. NeuroImage 156, 456–465. URL: https://www.sciencedirect.com/science/article/pii/S105381191730321X, doi:doi: 10.1016/j.neuroimage.2017.04.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Niethammer M., Kwitt R., Vialard F.X., 2019. Metric learning for image registration, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8463–8472. URL: https://openaccess.thecvf.com/content_CVPR_2019/html/Niethammer_Metric_Learning_for_Image_Registration_CVPR_2019_paper.html. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Penfield W., Boldrey E., 1937. Somatic motor and sensory representation in the cerebral cortex of man as studied by electrical stimulation. Brain 60, 389–443. URL: https://academic.oup.com/brain/article-lookup/doi/10.1093/brain/60.4.389, doi:doi: 10.1093/brain/60.4.389. [DOI] [Google Scholar]
  47. Polimeni J.R., Fischl B., Greve D.N., Wald L.L., 2010. Laminar analysis of 7T BOLD using an imposed spatial activation pattern in human V1. NeuroImage 52, 1334–1346. URL: https://linkinghub.elsevier.com/retrieve/pii/S1053811910007111, doi:doi: 10.1016/j.neuroimage.2010.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Postelnicu G., Zollei L., Fischl B., 2009. Combined volumetric and surface registration. IEEE Transactions on Medical Imaging 28, 508–522. URL: http://ieeexplore.ieee.org/document/4601460/, doi:doi: 10.1109/TMI.2008.2004426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Robinson E.C., Garcia K., Glasser M.F., Chen Z., Coalson T.S., Makropoulos A., Bozek J., Wright R., Schuh A., Webster M., Hutter J., Price A., Cordero Grande L., Hughes E., Tusor N., Bayly P.V., Van Essen D.C., Smith S.M., Edwards A.D., Hajnal J., Jenkinson M., Glocker B., Rueckert D., 2018. Multimodal surface matching with higher-order smoothness constraints. NeuroImage 167, 453–465. URL: https://www.sciencedirect.com/science/article/pii/S1053811917308649, doi:doi: 10.1016/j.neuroimage.2017.10.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Robinson E.C., Jbabdi S., Glasser M.F., Andersson J., Burgess G.C., Harms M.P., Smith S.M., Van Essen D.C., Jenkinson M., 2014. MSM: A new flexible framework for Multimodal Surface Matching. NeuroImage 100, 414–426. URL: https://www.sciencedirect.com/science/article/pii/S1053811914004546, doi:doi: 10.1016/j.neuroimage.2014.05.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ronneberger O., Fischer P., Brox T., 2015. U-Net: Convolutional networks for biomedical image segmentation. arXiv URL: https://arxiv.org/abs/1505.04597, doi:doi: 10.48550/ARXIV.1505.04597. [DOI] [Google Scholar]
  52. Sabuncu M.R., Singer B.D., Conroy B., Bryan R.E., Ramadge P.J., Haxby J.V., 2010. Function-based intersubject alignment of human cortical anatomy. Cerebral Cortex 20, 130–140. URL: https://academic.oup.com/cercor/article-lookup/doi/10.1093/cercor/bhp085, doi:doi: 10.1093/cercor/bhp085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Schwartz E.L., 1980. Computational anatomy and functional architecture of striate cortex: A spatial mapping approach to perceptual coding. Vision Research 20, 645–669. URL: https://linkinghub.elsevier.com/retrieve/pii/0042698980900905, doi:doi: 10.1016/0042-6989(80)90090-5. [DOI] [PubMed] [Google Scholar]
  54. Sereno M.I., Dale A.M., Reppas J.B., Kwong K.K., Belliveau J.W., Brady T.J., Rosen B.R., Tootell R.B.H., 1995. Borders of multiple visual areas in humans revealed by functional magnetic resonance imaging. Science 268, 889–893. URL: https://www.science.org/doi/10.1126/science.7754376, doi:doi: 10.1126/science.7754376. [DOI] [PubMed] [Google Scholar]
  55. Sokooti H., de Vos B., Berendsen F., Lelieveldt B.P.F., Isgum I., Staring M., 2017. Nonrigid image registration using multi-scale 3D convolutional neural networks, in: Medical Image Computing and Computer Assisted Intervention, Cham. pp. 232–239. doi:doi: 10.1007/978-3-319-66182-7_27. [DOI] [Google Scholar]
  56. Steinmetz H., Seitz R.J., 1991. Functional anatomy of language processing: Neuroimaging and the problem of individual variability. Neuropsychologia 29, 1149–1161. URL: https://www.sciencedirect.com/science/article/pii/002839329190030C, doi:doi: 10.1016/0028-3932(91)90030-C. [DOI] [PubMed] [Google Scholar]
  57. Tootell R.B.H., Hadjikhani N.K., Vanduffel W., Liu A.K., Mendola J.D., Sereno M.I., Dale A.M., 1998. Functional analysis of primary visual cortex (V1) in humans. Proceedings of the National Academy of Sciences 95, 811–817. URL: https://pnas.org/doi/full/10.1073/pnas.95.3.811, doi:doi: 10.1073/pnas.95.3.811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. van Atteveldt N., Formisano E., Goebel R., Blomert L., 2004. Integration of letters and speech sounds in the human brain. Neuron 43, 271–282. URL: https://www.sciencedirect.com/science/article/pii/S0896627304003964, doi:doi: 10.1016/j.neuron.2004.06.025. [DOI] [PubMed] [Google Scholar]
  59. Van Essen D.C., Drury H.A., Joshi S., Miller M.I., 1998. Functional and structural mapping of human cerebral cortex: Solutions are in the surfaces. Proceedings of the National Academy of Sciences 95, 788–795. URL: https://pnas.org/doi/full/10.1073/pnas.95.3.788, doi:doi: 10.1073/pnas.95.3.788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Vercauteren T., Pennec X., Perchant A., Ayache N., 2009. Diffeomorphic demons: Efficient non-parametric image registration. NeuroImage 45, S61–S72. URL: https://www.sciencedirect.com/science/article/pii/S1053811908011683, doi:doi: 10.1016/j.neuroimage.2008.10.040. [DOI] [PubMed] [Google Scholar]
  61. Wandell B.A., Dumoulin S.O., Brewer A.A., 2007. Visual field maps in human cortex. Neuron 56, 366–383. URL: https://linkinghub.elsevier.com/retrieve/pii/S089662730700774X, doi:doi: 10.1016/j.neuron.2007.10.012. [DOI] [PubMed] [Google Scholar]
  62. Yang X., Kwitt R., Niethammer M., 2016. Fast predictive image registration, in: Deep Learning and Data Labeling for Medical Applications, Cham. pp. 48–57. doi:doi: 10.1007/978-3-319-46976-8_6. [DOI] [Google Scholar]
  63. Yeo B., Sabuncu M., Vercauteren T., Ayache N., Fischl B., Golland P., 2010a. Spherical demons: Fast diffeomorphic landmark-free surface registration. IEEE Transactions on Medical Imaging 29, 650–668. URL: http://ieeexplore.ieee.org/document/5223581/, doi:doi: 10.1109/TMI.2009.2030797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Yeo B.T.T., Sabuncu M.R., Vercauteren T., Holt D.J., Amunts K., Zilles K., Golland P., Fischl B., 2010b. Learning task-optimal registration cost functions for localizing cytoarchitecture and function in the cerebral cortex. IEEE Transactions on Medical Imaging 29, 1424–1441. URL: http://ieeexplore.ieee.org/document/5482176/, doi:doi: 10.1109/TMI.2010.2049497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Zhao F., Wu Z., Wang F., Lin W., Xia S., Shen D., Wang L., Li G., 2021. S3Reg: Superfast spherical surface registration based on deep learning. IEEE Transactions on Medical Imaging 40, 1964–1976. doi:doi: 10.1109/TMI.2021.3069645. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

Articles from ArXiv are provided here courtesy of arXiv

RESOURCES