Revealing Developmental Regionalization of Infant Cerebral Cortex Based on Multiple Cortical Properties

Fan Wang; Chunfeng Lian; Zhengwang Wu; Li Wang; Weili Lin; John H Gilmore; Dinggang Shen; Gang Li

doi:10.1007/978-3-030-32245-8_93

. Author manuscript; available in PMC: 2020 Mar 18.

Published in final edited form as: Med Image Comput Comput Assist Interv. 2019 Oct 10;11765:841–849. doi: 10.1007/978-3-030-32245-8_93

Revealing Developmental Regionalization of Infant Cerebral Cortex Based on Multiple Cortical Properties

Fan Wang ¹, Chunfeng Lian ¹, Zhengwang Wu ¹, Li Wang ¹, Weili Lin ¹, John H Gilmore ², Dinggang Shen ¹, Gang Li ¹

PMCID: PMC7079741 NIHMSID: NIHMS1569925 PMID: 32190846

Abstract

The human brain develops dynamically and regionally heterogeneously during the first two postnatal years. Cortical developmental regionalization, i.e., the landscape of cortical heterogeneity in development, reflects the organization of underlying microstructures, which are closely related to the functional principles of the cortex. Therefore, prospecting early cortical developmental regionalization can provide neurobiologically meaningful units for precise region localization, which will advance our understanding on brain development in this critical period. However, due to the absence of dedicated computational tools and large-scale datasets, our knowledge on early cortical developmental regionalization still remains intact. To fill both the methodological and knowledge gaps, we propose to explore the cortical developmental regionalization using a novel method based on nonnegative matrix factorization (NMF), due to its ability in analyzing complex high-dimensional data by representing data using several bases in a data-driven way. Specifically, a novel multi-view NMF (MV-NMF) method is proposed, in which multiple distinct and complementary cortical properties (i.e., multiple views) are jointly considered to provide comprehensive observation of cortical regionalization process. To ensure the sparsity of the discovered regions, an orthogonal constraint defined in Stiefel manifold is imposed in our MV-NMF method. Meanwhile, a graph-induced constraint is also included to improve the compactness of the discovered regions. Capitalizing on an unprecedentedly large dataset with 1,560 longitudinal MRI scans from 887 infants, we delineate the first neurobiologically meaningful representation of early cortical regionalization, providing a valuable reference for brain development studies.

1. Introduction

The human brain undergoes a protracted course of development after birth, while maintains an exceptionally high rate of development in the first two years [1, 2]. There is a growing consensus that early development strongly affects brain cognition and function throughout the entire lifespan, suggesting that exploring the early brain developmental patterns is of great importance in understanding both ordered and disordered brains [1, 3]. Due to the non-uniformities in underlying cortical microstructures, which are closely related to functional principles of the cortex, the infant cerebral cortex grows in regionally heterogeneous manners, thus forming a unique landscape of cortical developmental regionalization. Prospecting such developmental regionalization could potentially provide neurobiologically meaningful units to facilitate precise region localization, inter-subject and inter-study comparison, and effective feature reduction for early brain disorder identification [4], thus advancing our understanding on brain development in this critical early period. However, due to the absence of dedicated computational tools and large-scale datasets, our knowledge of such infantile cortical developmental regionalization still remains intact.

To fill the methodological gap and hence to advance our knowledge towards this critical early postnatal period, we are motivated to explore cortical developmental regionalization in the first two postnatal years, by leveraging a novel multi-view nonnegative matrix factorization (MV-NMF) method and an unprecedentedly large-scale infant MRI dataset. The utilization of NMF to analyze complex high-dimensional cortical data is mainly motivated by three reasons. 1) It can naturally discover different groups of vertices varying in similar manners across different subjects and ages, thus providing a part-based cortical representation to facilitate the interpretability of the cortical regionalization. 2) By varying the desired number of output components/parts, NMF can reveal the regionalization at different resolutions in a hierarchical manner. 3) It is more flexible than other development-based analyses [5], as there is no strict requirement of longitudinal data with multiple time points. On the other hand, the motivation to use multiple views, i.e., multiple cortical attributes (e.g., cortical thickness and surface area), is to provide comprehensive and complementary characterizations of the cortical regionalization process, as different cortical attributes have distinct neurobiological mechanisms and thus reflect distinct aspects of the regionalization progress.

Therefore, in our proposed MV-NMF method, we minimize an objective function to jointly decompose multi-view data matrices (of cortical attributes) for grouping cortical vertices into hierarchical regions. Specifically, apart from the conventional reconstruction error in each view, a multi-view regularization is included in our objective function to iteratively fuse multi-view information to encourage the consistency of regionalization results between different views. To improve the sparsity of the discovered regions, the orthogonal constraint is effectively imposed during the alternative optimization procedure by computing the gradient in the Stiefel manifold [6]. Moreover, a graph-induced regularization is also incorporated in our objective function to further improve the compactness of the results. The proposed MV-NMF method has been applied to an unprecedentedly large dataset with 1,560 longitudinal MRI scans from 887 typically developing infants. The revealed landscape of early cortical regionalization provides a reliable reference for studying early postnatal cortical organization and progression.

2. Materials and Methods

2.1. Dataset and Image Processing

Totally 1,560 longitudinal MRI scans were collected from 887 typically developing infants (466 males / 421 females) with age at scan ranging from 0.2 months to 29.3 months. Using a Siemens 3T head-only scanner, T1w images with the resolution of 1 × 1 × 1 mm³ were acquired with the parameters: TR=1,900 ms, TE=4.38 ms, inversion time=1,100 ms, and flip angle= 7°. T2w images with the resolution of 1.25 × 1.25 × 1.95 mm³ were acquired with the parameters: TR=7,380 ms, TE=119 ms, and flip angle=150°.

All MR images were preprocessed following a standard procedure [7], including skull stripping, cerebellum and brain stem removal, intensity inhomogeneity correction, rigid alignment of each scan to the age-matched infant brain atlas, tissue segmentation, non-cortical structure masking and filling, and left-right hemisphere separation. Inner and outer cortical surfaces were reconstructed for each hemisphere and mapped onto a spherical space. To establish cortical correspondences across subjects and ages, we employed a longitudinally consistent registration strategy. Specifically, first, for each subject, longitudinal cortical correspondences were established by spherical registration of surfaces at the early time points to the corresponding surface at the last time point [8]. Then, inter-subject cortical correspondences were established by groupwise registration of all spherical cortical surfaces at the last time point. All cortical surfaces were finally resampled to a standard mesh tessellation, and computed with surface area and cortical thickness for each vertex [8], which will be used as cortical attributes for discovering cortical developmental regionalization.

2.2. Nonnegative Matrix Factorization (NMF)

In a data-driven way, the NMF method approximates complex high-dimensional data by representing them as the linear combinations of limited bases, with each basis corresponding to a part/component. Taking cortical thickness as an example, the data matrix $X \in ℝ^{M \times N}$ is a large nonnegative matrix constructed by cortical thickness values from all scans (data samples), where M and N are the numbers of vertices and scans, respectively. Each column of X includes cortical thickness values from one scan. The NMF method decomposes X into two smaller non-negative matrices, i.e., the bases matrix $W \in ℝ^{M \times K}$ and the coefficients matrix $H \in ℝ^{K \times N}$ , by satisfying X ≈ WH. The number of bases (components/parts) should be small, i.e., K ≪ M and K ≪ N. As a result, each column of W is a basis that represents one distinct region/part, which indicates a group of cortical vertices jointly developing across subjects and ages. Each data sample (i.e., each column in X) can be approximately reconstructed by a linear combination of bases from W, using the corresponding column from H as the coefficients. By minimizing the Frobenius norm of the reconstruction error $| | X - {W | H | |}_{F}^{2}$ , W and H can be found through an iterative multiplication [9].

2.3. Multi-View NMF

Multiple cortical attributes, e.g., cortical thickness, surface area and myelin content, have distinct biological mechanisms and provide rich complementary information on cortical development, thus improving the reliability and meaningfulness of discovered basis matrices. Hence, we propose a multi-view NMF (MV-NMF) framework, with each view corresponding to a cortical attribute, to discover early cortical developmental regionalization. Besides the multi-view regularization, our MV-NMF also imposes an orthogonal constraint by computing the true gradient in an orthogonal parameter space, namely Stiefel manifold. To further improve the compactness of the final solutions, a graph regularization is also included.

Objective Function.

Let $X_{v} \in ℝ^{M \times N_{v}}$ be the data matrix for the v-th view in total V views, while $W_{v} \in ℝ^{M \times K}$ and $H_{v} \in ℝ^{K \times N_{v}}$ denote, respectively, the basis and coefficient matrices decomposed from X_v. Scalar N_v is the total number of samples in the v-th view. The objective function of our MV-NMF method is defined as

J = \underset{W_{v}^{T} W_{v} = I}{min_{W_{v}, H_{v} \geq 0}} \frac{1}{2} \sum_{v = 1 \dots V} λ_{v} ‖ X_{v} - W_{v} H_{v} ‖_{F}^{2} + R_{M V} + α \cdot R_{G} .

(1)

Herein, alongside the reconstruction error in the first term, we introduce two additional terms, i.e., the multi-view regularization $R_{M V}$ and the graph-based regularization $R_{G}$ . Parameters λ_v and α control the influences of their corresponding terms.

The multi-view regularization is defined as

R_{M V} = \frac{1}{2} \sum_{u = 1 \dots V} \sum_{v = 1 \dots V} θ_{u v} ‖ W_{u} - W_{v} ‖_{F}^{2},

(2)

which aims to minimize the dissimilarity between the bases of different views. Parameter θ_uv controls the influence of the inconsistency between any two paired views.

The graph-based regularization $R_{G}$ is introduced to improve the compactness and remove small artifacts of discovered regions. Denoting matrix A as a symmetric adjacent matrix, which defines the one-ring connections of vertices on the cortical surface. Matrix D is a diagonal matrix, where each element D_ii represents the sum of the i-th row (or column) of A. Let Tr(·) denote the trace of a matrix, the graph-based regularization is defined in terms of the graph Laplacian [10], i.e.,

R_{G} = \frac{1}{2} \sum_{v = 1 \dots V} T r (W_{v}^{T} (A - D) W_{v})) .

(3)

Updating Rule.

First, the partial gradient of $J$ toward H_v is found to be

\nabla_{H_{v}} J = - λ_{v} W_{v}^{T} X_{v} + λ_{v} W_{v}^{T} W_{v} H_{v} .

(4)

Thus, it is straightforward to have the form of updating rule for H_v, with ⨀ representing the elementwise multiplication:

H_{v} \leftarrow H_{v} ⊙ \frac{W_{v}^{T} X_{v}}{W_{v}^{T} W_{v} H_{v}} .

(5)

To update W_v, we first compute the corresponding partial gradient:

\nabla_{W_{v}} J = - λ_{v} X_{v} H_{v}^{T} - \sum_{u = 1 \dots V} θ_{u v} W_{u} + \sum_{u = 1 \dots V} θ_{u v} W_{v} + λ_{v} W_{v} H_{v} H_{v}^{T} + α (A - D) W_{v} .

(6)

Then, in Stiefel manifold, which is a parameter space with the orthogonality constraint $W_{v}^{T} W_{v} = I$ , the true gradient ${\tilde{\nabla}}_{W_{v}} J$ (with orthogonality) [6] is computed by

{\tilde{\nabla}}_{W_{v}} J = \nabla_{W_{v}} J - W_{v} {[\nabla_{W_{v}} J]}^{T} W_{v} = [λ_{v} W_{v} H_{v} X_{v}^{T} W_{v} + \sum_{u = 1, \dots, V} θ_{u v} W_{v} W_{u}^{T} W_{v} + α D W_{v} + α W_{v} W_{v}^{T} A W_{v}] - [λ_{v} X_{v} H_{v}^{T} + \sum_{u = 1, v} θ_{u v} W_{u} + α A W_{v} + α W_{v} W_{v}^{T} D W_{v}] .

(7)

Accordingly, the multiplicative update rule for W_v is of the form

w_{v} \leftarrow w_{v} ⊙ \frac{λ_{v} X_{v} H_{v}^{T} + \sum_{u = 1}^{v} θ_{u v} W_{u} + α A W_{v} + α W_{v} W_{v}^{T} D W_{v}}{λ_{v} W_{v} H_{v} X_{v}^{T} W_{v} + \sum_{u = 1}^{v} θ_{u v} W_{v} W_{u}^{T} W_{v} + α D W_{v} + α W_{v} W_{v}^{T} A W_{v}} .

(8)

Iterative Optimization.

To generate meaningful solutions, an appropriate initialization for NMF-based methods is necessary. Many single-view NMF methods employ the nndsvd initialization method [11], which has shown faster reduction of the residual error than random initialization. Herein, we also apply nndsvd algorithm for initialization, but with some modifications to adapt our proposed multi-view scheme. Specifically, we first obtain initializations from each view separately, which encode the potential jointly developing vertices on each view. Second, we explicitly look for the intersections from the initializations of different views to generate new components in the initialization, since their intersections encode the potential jointly developing vertices on all views. Third, according to the desired number of initialization components, the smallest components are removed. In this way, the multi-view NMF scheme can be appropriately initialized. The optimal bases matrices W_v and coefficients matrices Hv of each view were obtained by iteratively calling Equations (8) and (5). The final regions discovered using MV-NMF were the average of regions from different views.

3. Experiments

In this paper, cortical thickness and surface area were used in our MV-NMF scheme to explore cortical regionalization, since they are two important cortical morphological attributes with distinct neurobiological/genetic mechanisms and growth patterns during infancy [4]. The data matrix of each view is normalized so that each column averages to 1. In our experiments, parameters of the proposed MV-NMF method were empirically set as: λ₁ = λ₂ 1, θ_uv 1000, and α = 500.

Effectiveness of MV-NMF.

To integrate information from multiple views, one straightforward solution is to concatenate the multiple data matrices into a single matrix. Specifically, assume $X_{1} \in ℝ^{M \times N_{1}}$ and $X_{2} \in ℝ^{M \times N_{2}}$ are two data matrices in term of cortical thickness and surface area, respectively. One can directly concatenate X₁ and X₂ (as [X₁, X₂]), and then apply the conventional NMF on the concatenated big matrix for exploring cortical regionalization. We refer this method as rig-NMF, since it rigidly imposes the bases from different views to be identical. Hence, we compared the rig-NMF with our MV-NMF using the same data and initialization. For a fair comparison, we employed an orthogonal projective version of NMF [9] for the rig-NMF method, given its state-of-the-art performance with high orthogonality and compactness [12].

We assume that the effectiveness of a method can be reflected by the border consistency of its discovered regionalization across different resolutions. That is, if a border is more consistently preserved along the change of K (i.e., the number of output components), the result is more likely to reflect the stable hierarchical regionalization of the underlying microstructures. Accordingly, by increasing K from 2 to 7, the border consistencies obtained by rig-NMF and our MV-NMF are compared in Fig. 1. Note that, each component highlighted by black block in a specific row (e.g., K = 2) means that it is hierarchically decomposed into two finer components in the next row (e.g., K = 3). Based on the inconsistent borders pointed by blue arrows, it can be observed that our MV-NMF discovers borders with higher consistency, indicating the effectiveness and rationality of our MV-NMF.

Fig. 1. — The layers of hierarchical structures obtained by MV-NMF and rig-NMF, by increasing the component number K from 2 to 7. The inconsistent borders across different layers are pointed out with blue arrows.

Robustness of MV-NMF.

The proposed MV-NMF does not require different views to have the same number of samples. To show the robustness of the MV-NMF, we designed an imbalanced testing dataset, where we randomly removed half of the cortical thickness samples to check if our method can still produce similar results as using all samples. This situation could happen when estimated cortical thickness values are not reliable for some subjects, since cortical thickness (between 1 and 5 mm) is much more sensitive to cortical surface reconstruction results, compared to surface area. This process was repeated five times. By calculating the Dice coefficient between each imbalanced test and the case of using all samples, we finally summarized the mean Dice coefficients at different numbers of components in Fig. 2 (a). It can be observed that, compared with rig-NMF, our MV-NMF yields similar performance when K is between 2 and 8, but much better performance when K is larger. Across all resolutions, the Dice coefficients of our MV-NMF are always larger than 0.9, demonstrating the robustness of our proposed method.

Fig. 2. — (a) Mean Dice coefficients by performing five times imbalanced sample test. (b) Silhouette coefficients in terms of different resolutions. K = 13, 25 are local peaks. K = 18 shows a start of a plateau (i.e., K = 18 to 22).

Determination of Resolution K.

Whereas MV-NMF provides hierarchical regions while increasing the resolution, we need to select some representative resolutions for the best representation of the studied data. To this end, a silhouette value was computed for each resolution K. It measures the intra-region similarity against inter-regions similarity. High silhouette values indicate better results. In each view, the similarity of two vertices i and j is evaluated using the correlation coefficient between X_v(i,·) and X_v(j,·). The multi-view silhouette value is the average of the values from all views, as shown in Fig. 2 (b). Naturally, higher resolutions lead to higher silhouette values. Yet we can still observe local peaks, such as for K = 13 and 25. Meanwhile, when resolution increases from 18 to 22, a plateau is shown. We thus chose 13, 18, and 25 as the most appropriate resolutions for the landscape of early cortical regionalization.

Discovered Regions.

Although 13, 18 and 25 are all shown to be appropriate resolutions, we prefer relatively finer ones when considering network/region-based analyses. Fig. 3 shows the discovered regions with K = 25 and their topography with sharp boundaries, which correspond well with existing neuroscience knowledge according to the name of each region given in the figure. Besides, these regions are also consistent with previous findings. For example, regions 2, 11, 13, 15, 17, 18, 21, 22 and 25 are found in the genetic-based cortical topography [4]. Regions 2, 6, 7, 11, 13, 14, 15, 18, 23, 24 are found in the cortical thickness based remodeling patterns in adolescents [12].

Fig. 3. — The 25 regions and their topography with sharp boundaries.

4. Conclusion

We have proposed a novel multi-view NMF scheme to jointly consider cortical thickness and surface area for exploring the early cortical developmental regionalization. Capitalizing on an unprecedentedly large infant MRI dataset, we have provided valuable references of region topography that encodes multi-view cortical developmental patterns. Our discovered regionalizatoin reflects co-developing vertices across ages and subjects on multi-views, which differs markedly from previous cortical atlases. These references will later be publicly released to facilitate other infant-related studies, thus boosting our knowledge in the early brain development. As an important future work, we will include more cortical attributes to discover regionalization patterns based on more than two views, which will provide more insights into early brain development.

5. Acknowledgement

This work was supported in part by NIH grants (MH107815, MH116225, and MH117943).

References

1.Fjell AM, Grydeland H, Krogsrud SK, et al. : Development and aging of cortical thickness correspond to genetic organization patterns. PNAS 112, 15462–15467 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Li G, Wang L, Yap P-T, et al. : Computational neuroanatomy of baby brains: A review. NeuroImage 185, 906–925 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Eickhoff SB, Constable RT, Yeo BT: Topographic organization of the cerebral cortex and brain cartography. NeuroImage 170, 332–347 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Chen C-H, Fiecas M, Gutierrez E, et al. : Genetic topography of brain morphology. PNAS 110, 17089–17094 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Li G, Wang L, Shi F, et al. : Constructing 4D infant cortical surface atlases based on dynamic developmental trajectories of the cortex MICCAI, pp. 89–96. Springer; (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Choi S: Algorithms for orthogonal nonnegative matrix factorization IJCNN 2008., pp. 1828–1832. IEEE; (2008) [Google Scholar]
7.Li G, Nie J, Wang L, et al. : Mapping region-specific longitudinal cortical surface expansion from birth to 2 years of age. Cerebral cortex 23, 2724–2733 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Yeo BT, Sabuncu MR, Vercauteren T, et al. : Spherical demons: fast diffeomorphic landmark-free surface registration. IEEE TMI 29, 650–668 (2010) [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Yang Z, Oja E: Linear and nonlinear projective nonnegative matrix factorization. IEEE TNN 21, 734–749 (2010) [DOI] [PubMed] [Google Scholar]
10.Cai D, He X, Han J, et al. : Graph regularized nonnegative matrix factorization for data representation. IEEE TPAMI 33, 1548–1560 (2011) [DOI] [PubMed] [Google Scholar]
11.Boutsidis C, Gallopoulos E: SVD based initialization: A head start for nonnegative matrix factorization. Pattern Recognition 41, 1350–1362 (2008) [Google Scholar]
12.Sotiras A, Toledo JB, Gur RE, et al. : Patterns of coordinated cortical remodeling during adolescence and their associations with functional specialization and evolutionary expansion. PNAS 114, 3527–3532 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.Fjell AM, Grydeland H, Krogsrud SK, et al. : Development and aging of cortical thickness correspond to genetic organization patterns. PNAS 112, 15462–15467 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Li G, Wang L, Yap P-T, et al. : Computational neuroanatomy of baby brains: A review. NeuroImage 185, 906–925 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Eickhoff SB, Constable RT, Yeo BT: Topographic organization of the cerebral cortex and brain cartography. NeuroImage 170, 332–347 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Chen C-H, Fiecas M, Gutierrez E, et al. : Genetic topography of brain morphology. PNAS 110, 17089–17094 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Li G, Wang L, Shi F, et al. : Constructing 4D infant cortical surface atlases based on dynamic developmental trajectories of the cortex MICCAI, pp. 89–96. Springer; (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Choi S: Algorithms for orthogonal nonnegative matrix factorization IJCNN 2008., pp. 1828–1832. IEEE; (2008) [Google Scholar]

[R7] 7.Li G, Nie J, Wang L, et al. : Mapping region-specific longitudinal cortical surface expansion from birth to 2 years of age. Cerebral cortex 23, 2724–2733 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Yeo BT, Sabuncu MR, Vercauteren T, et al. : Spherical demons: fast diffeomorphic landmark-free surface registration. IEEE TMI 29, 650–668 (2010) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Yang Z, Oja E: Linear and nonlinear projective nonnegative matrix factorization. IEEE TNN 21, 734–749 (2010) [DOI] [PubMed] [Google Scholar]

[R10] 10.Cai D, He X, Han J, et al. : Graph regularized nonnegative matrix factorization for data representation. IEEE TPAMI 33, 1548–1560 (2011) [DOI] [PubMed] [Google Scholar]

[R11] 11.Boutsidis C, Gallopoulos E: SVD based initialization: A head start for nonnegative matrix factorization. Pattern Recognition 41, 1350–1362 (2008) [Google Scholar]

[R12] 12.Sotiras A, Toledo JB, Gur RE, et al. : Patterns of coordinated cortical remodeling during adolescence and their associations with functional specialization and evolutionary expansion. PNAS 114, 3527–3532 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Revealing Developmental Regionalization of Infant Cerebral Cortex Based on Multiple Cortical Properties

Fan Wang

Chunfeng Lian

Zhengwang Wu

Li Wang

Weili Lin

John H Gilmore

Dinggang Shen

Gang Li

Abstract

1. Introduction