Variance Measures for Symmetric Positive (Semi-) Definite Tensors in Two Dimensions

Magnus Herberthson; Evren Özarslan; Carl-Fredrik Westin

doi:10.1007/978-3-030-56215-1_1

. Author manuscript; available in PMC: 2023 May 22.

Published in final edited form as: Math Vis. 2021 Feb 11;2021:3–22. doi: 10.1007/978-3-030-56215-1_1

Variance Measures for Symmetric Positive (Semi-) Definite Tensors in Two Dimensions

Magnus Herberthson ¹, Evren Özarslan ², Carl-Fredrik Westin ³

PMCID: PMC10201932 NIHMSID: NIHMS1735621 PMID: 37220520

Abstract

Calculating the variance of a family of tensors, each represented by a symmetric positive semi-definite second order tensor/matrix, involves the formation of a fourth order tensor R_abcd. To form this tensor, the tensor product of each second order tensor with itself is formed, and these products are then summed, giving the tensor R_abcd the same symmetry properties as the elasticity tensor in continuum mechanics. This tensor has been studied with respect to many properties: representations, invariants, decomposition, the equivalence problem et cetera. In this paper we focus on the two-dimensional case where we give a set of invariants which ensures equivalence of two such fourth order tensors R_abcd and ${\tilde{R}}_{a b c d}$ . In terms of components, such an equivalence means that components R_ijkl of the first tensor will transform into the components ${\tilde{R}}_{i j k l}$ of the second tensor for some change of the coordinate system.

1. Introduction

Positive semi-definite second order tensors arise in several applications. For instance, in image processing, a structure tensor is computed from greyscale images that captures the local orientation of the image intensity variations [10, 17] and is employed to address a broad range of challenges. Diffusion tensor magnetic resonance imaging (DT-MRI) [1, 5] characterizes anisotropic water diffusion by enabling the measurement of the apparent diffusion tensor, which makes it possible to delineate the fibrous structure of the tissue. Recent work has shown that diffusion MR measurements of restricted diffusion obscures the fine details of the pore shape under certain experimental conditions [11], and all remaining features can be encoded accurately by a confinement tensor [19].

All such second order tensors share the same mathematical properties, namely, they are real-valued, symmetric, and positive semi-definite. Moreover, in these disciplines, one encounters a collection of such tensors, e.g., at different locations of the image. Populations of such tensors have also been key to some studies aiming to model the underlying structure of the medium under investigation [8, 12, 18].

Irrespective of the particular application, let R_ab denote such tensors,¹ and we shall refer to the set of n tensors as ${R_{a b}^{(i)}}_{i}$ . Our desire is to find relevant descriptors or models of such a family. One relevant statistical measure of this family is the (population) variance

\frac{1}{n} \sum_{i = 1}^{n} (R_{a b}^{(i)} - {\hat{R}}_{a b}) (R_{c d}^{(i)} - {\hat{R}}_{c d}) = (\frac{1}{n} \sum_{i = 1}^{n} R_{a b}^{(i)} R_{c d}^{(i)}) - {\hat{R}}_{a b} {\hat{R}}_{c d},

where ${\hat{R}}_{a b} = \frac{1}{n} \sum_{i = 1}^{n} R_{a b}^{(i)}$ is the mean. (For another approach, see e.g., [8]). In this paper, we are interested in the first term, i.e., we study the fourth order tensor (skipping the normalization)

R_{a b c d} = \sum_{i = 1}^{n} R_{a b}^{(i)} R_{c d}^{(i)}, R_{a b}^{(i)} \geq 0,

(1)

where $R_{a b}^{(i)} \geq 0$ stands for $R_{a b}^{(i)}$ being positive semi-definite. It is obvious that R_abcd has the symmetries R_abcd = R_bacd = R_abdc and R_abcd = R_cdab, i.e., R_abcd has the same symmetries as the elasticity tensor [14] from continuum mechanics. The elasticity tensor is well studied [13], e.g. with respect to classification, decompositions, and invariants. In most cases this is done in three dimensions. The same (w.r.t. symmetries) tensor has also been studied in the context of diffusion MR [2].

In this paper we will focus on the corresponding tensor R_abcd in two dimensions. First, there are direct applications in image processing, and secondly, the problems posed will be more accessible in two dimensions than in three. In particular we study the equivalence problem, namely, we ask the question: given the components R_ijkl and ${\tilde{R}}_{i j k l}$ of two such tensors do they represent the same tensor in different coordinate systems (see Sects. 2.1.2 and 4)?

1.1. Outline

Section 2 contains tensorial matters. We will assume some basic knowledge of tensors, although some definitions are given for completeness. The notation(s) used is commented on and in particular the three-dimensional Euclidean vector space V_(ab) is introduced.

In Sect. 2.1.2, we make some general remarks concerning the tensor R_abcd and specify the problem we focus on. Section 2.1 is concluded with some remarks on the Voigt/Kelvin notation and the corresponding visualisation in $R^{3}$ .

Section 2.2 gives examples of invariants, especially invariants which are easily accessible from R_abcd. Also, more general invariant/canonical decompositions of R_abcd are given.

In Sect. 3, we discuss how the tensor R_abcd can (given a careful choice of basis) be expressed in terms of a 3 × 3 matrix, and how this matrix is affected by a rotation of the coordinate system in the underlying two-dimensional space on which R_abcd is defined.

In Sect. 4 we return to the equivalence problem and give the main result of this work. In Sect. 4.1.1 we provide a geometric condition for equivalence, while in Sect. 4.1.2, we present the equivalence in terms of a 3 × 3 matrix. Both these characterisations rely on the choice of particular basis elements for the vector spaces employed. In Sect. 4.1.3 the same equivalence conditions are given in a form which does not assume a particular basis.

2. Preliminaries

In this section we clarify the notation and some concepts which we need. Section 2.1 deals with the (alternatives of) tensor notation and some representations. The equivalence (and related) problems are also briefly addressed. Section 2.2 accounts for some natural invariants, traces and decompositions of R_abcd.

We will assume some familiarity with tensors, but to clarify the view on tensors we recall some facts. We start with a (finite dimensional) vector space V with dual V*. A tensor of order (p,q) is then a multi-linear mapping $\underset{q}{\underset{︸}{V \times V \dots \times V}} \times \underset{p}{\underset{︸}{V^{*} \times \dots \times V^{*}}} \to R$ . Moreover, a (non-degenerate) metric/scalar product $g : V \times V \to R$ gives an isomorphism from V to V* through v → g(v, ·), and it is this isomorphism which is used to ‘raise and lower indices’, see below. Indeed, for a fixed v ∈ V, g(v, ·) is a linear mapping $V \to R$ , i.e., an element of V*.

2.1. Tensor Notation and Representations

There is a plethora of notations for tensors. Here, we follow the well-adopted convention [16] that early lower case Latin letters (T^a_bc) refer to the tensor as a geometric object, its type being inferred from the indices and their positions (the abstract index notation). g_ab denotes the metric tensor. When the indices are lower case Latin letters from the middle of the alphabet, Tⁱ_jk, they refer to components of T^a_bc in a certain frame. The super-index i denotes a contravariant index while the sub-indices j, k are covariant. For instance, a typical vector (tensor of type (1, 0)) will be written v^a with components vⁱ, while the metric g_ab (tensor of type (0, 2)) has components g_ij. At a number of occasions, it will also be useful to express quantities in terms of components with respect to orthonormal frames, i.e., Cartesian coordinates. This is sometimes referred to as ‘Cartesian tensors’, and the distinction between contra- and covariant indices is obscured. In these situations, it is possible (but not necessary) to write all indices as sub-indices, and sometimes the symbol ≐ is used to indicate that an equation is only valid in Cartesian coordinates. For example T_i ≐ T_ijkδ_jk instead of Tⁱ = Tⁱ_jkg^jk = T^ik_k. Often this is clear form the context, but we will sometimes use ≐ to remind the reader that a Cartesian assumption is made. Here, the Einstein summation convention is implied, i.e., repeated indices are to be summed over, so that for instance $T^{i} = T_{j k}^{i} g^{j k} = T_{k}^{i k} = \sum_{j = 1}^{n} \sum_{k = 1}^{n} T_{j k}^{i} g^{j k} = \sum_{k = 1}^{n} T_{k}^{i k}$ if each index ranges from 1 to n. We have also used the metric g_ij and its inverse g^ij to raise and lower indices. For instance, since g_ijvⁱ is an element of V*, we write g_ijvⁱ = v_j.

We also remind of the notation for symmetrisation. For a two-tensor $T_{(a b)} = \frac{1}{2} (T_{a b} + T_{b a})$ , while more generally for a tensor T_{a₁a₂⋯a_n} of order (0, n) we have

T_{(a_{1} a_{2} \dots a_{n})} = \frac{1}{n!} \sum_{π} T_{a_{π (1)} a_{π (2)} \dots a_{π (n)}}

where the sum is taken over all permutations π of 1, 2, …, n. Naturally, this convention can also be applied to subsets of indices. For instance, $H_{a (b c)} = \frac{1}{2} (H_{a b c} + H_{a c b})$ .

2.1.1. The Vector Space of Symmetric Two-Tensors

In any coordinate frame a symmetric tensor R_ab (i.e., R_ab = R_ba) is represented by a symmetric matrix R_ij (2 × 2 or 3 × 3 depending on the dimension of the underlying space). In the two-dimensional case, with the underlying vector space $V^{a} \sim R^{2}$ , this means that R_ab lives in a three-dimensional vector space, which we denote by V_(ab). V_(ab) is equipped with a natural scalar product: < A_ab, B_ab >= A_abB^ab, making it into a three-dimensional Euclidean space. Here A_abB^ab = A_abB_cdg^acg^bd, i.e, the contraction of A_abB_cd over the indices a, c and b, d, and the tensor product A_abB_cd itself is the tensor of order (0, 4) given by (A_abB_cd)v^au^bw^cm^d = (A_abv^au^b)(B_cdw^cm^d) together with multi-linearity.

2.1.2. The Tensor R_abcd and the Equivalence Problem

As noted above, R_abcd given by (1) has the symmetries R_abcd = R_(ab)cd = R_ab(cd) and R_abcd = R_cdab, and it is not hard to see that this gives R_abcd six degrees of freedom in two dimensions. (See also Sect. 2.1.3.) It is also interesting to note that R_abcd provides a mapping V_(ab) → V_(ab) through

R_{a b} \mapsto R_{a b c d} R^{c d},

and that this mapping is symmetric (due to the symmetry R_abcd = R_cdab). Given R_abcd there are a number of questions one can ask, e.g.,

Feasibility—given a tensor R_abcd with the correct symmetries, can it be written in the form (1)?
Canonical decomposition—given R_abcd of the form (1), can you write R_abcd as a canonical sum of the form (1), but with a fixed number of terms (cf. eigenvector decomposition of symmetric matrices)?
Visualisation—since fourth order tensors are a bit involved, how can one visualise them in ordinary space?
Characterisation/relevant sets of invariants—what invariants are relevant from an application point of view?
The equivalence problem—in terms of components, how do we know if R_ijkl and ${\tilde{R}}_{i j k l}$ represent the same tensor when they are in different coordinate systems?

We will now focus on the equivalence problem in two dimensions. This problem can be formulated as above: given, in terms of components, two tensors (with the symmetries we consider) R_ijkl and ${\tilde{R}}_{i j k l}$ , do they represent the same tensor in the sense that there is a coordinate transformation taking the components R_ijkl into the components ${\tilde{R}}_{i j k l}$ ? In other words, does there exist an (invertible) matrix P^m_i so that

R_{i j k l} = {\tilde{R}}_{m n o p} {P^{m}}_{i} {P^{n}}_{j} {P^{o}}_{k} {P^{p}}_{l} ?

This problem can also be formulated when R_ijkl and ${\tilde{R}}_{i j k l}$ are expressed in Cartesian frames. Then the coordinate transformation must be a rotation, i.e., given by a rotation matrix Qⁱ j ∈ SO(2). Hence, the problem of (unitary) equivalence is: Given R_ijkl and ${\tilde{R}}_{i j k l}$ , both expressed in Cartesian frames, is there a matrix (applying the ‘Cartesian convention’) Q_ij ∈ SO(2) so that

R_{i j k l} = {\tilde{R}}_{m n o p} Q_{m i} Q_{n j} Q_{o k} Q_{p l} ?

2.1.3. The Voigt/Kelvin Notation

Since (in two dimensions) the space V_(ab) is three-dimensional, one can introduce coordinates, for example $R_{i j} = (\begin{matrix} x & y \\ y & z \end{matrix}) \sim (\begin{matrix} x \\ y \\ z \end{matrix})$ and use vector algebra on $R^{3}$ . This is used in the Voigt notation [15] and the related Kelvin notation [6]. As always, one must be careful to specify with respect to which basis in V_(ab) the coordinates $(\begin{matrix} x \\ y \\ z \end{matrix})$ are taken. For instance, in the correspondence $R_{i j} = (\begin{matrix} x & y \\ y & z \end{matrix}) \sim (\begin{matrix} x \\ y \\ z \end{matrix})$ , the understood basis for V_(ab) (in the understood/induced coordinate system) is ${(\begin{matrix} 1 & 0 \\ 0 & 0 \end{matrix}), (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}), (\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix})}$ . These elements are orthogonal (viewed as vectors in V_(ab)) to each other, but not (all of them) of unit length.

Since the unit matrix plays a special role, we make the following choice. Starting with an orthonormal basis ${\hat{ξ}, \hat{η}}$ for V, (i.e., ${{\hat{ξ}}^{a}, {\hat{η}}^{a}}$ for V^a) a suitable orthonormal basis for V_(ab) is ${e_{a b}^{(1)}, e_{a b}^{(2)}, e_{a b}^{(3)}}$ where $e_{a b}^{(1)} = \frac{1}{\sqrt{2}} (ξ_{a} ξ_{b} - η_{a} η_{b})$ , $e_{a b}^{(2)} = \frac{1}{\sqrt{2}} (ξ_{a} η_{b} + η_{a} ξ_{b})$ , $e_{a b}^{(3)} = \frac{1}{\sqrt{2}} (ξ_{a} ξ_{b} + η_{a} η_{b})$ , i.e., in the induced basis we have

e_{i j}^{(1)} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \sim \hat{x}, e_{i j}^{(2)} = \frac{1}{\sqrt{2}} (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}) \sim \hat{y}, e_{i j}^{(3)} = \frac{1}{\sqrt{2}} (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) \sim \hat{z} .

(2)

In this basis, we write an arbitrary element M_ab ∈ V_(ab) as $M_{i j} = (\begin{matrix} z + x & y \\ y & z - x \end{matrix})$ , which means that M_ab gets the coordinates ${\bar{M}}_{i} = \sqrt{2} (\begin{matrix} x \\ y \\ z \end{matrix})$ . Note that M_ij is positive definite if z² − x² − y² ≥ 0 and z ≥ 0. In terms of the coordinates of the Voigt notation, the tensor R_abcd corresponds to a symmetric mapping $R^{3} \to R^{3}$ , given by a symmetric 3 × 3 matrix, which also shows that the degrees of freedom for R_abcd is six.

2.1.4. Visualization in $R^{3}$

Through the Voigt notation, any symmetric two-tensor (in two dimensions) can be visualised as a vector in $R^{3}$ . Using the basis vector given by (2), we note that $e_{i j}^{(1)}$ and $e_{i j}^{(2)}$ correspond to indefinite quadratic forms, while $e_{i j}^{(3)}$ is positive definite. We also see that $e_{i j}^{(1)} + e_{i j}^{(3)}$ and $e_{i j}^{(2)} + e_{i j}^{(3)}$ are positive semi-definite.

In Fig. 1 (left) these matrices are illustrated as vectors in $R^{3}$ . The set of positive semi-definite matrices corresponds to a cone, cf. [4], indicated in blue. When the symmetric 2 × 2 matrices are viewed as vectors in $R^{3}$ , the outer product of such a vector with itself gives a symmetric 3 × 3 matrix. Hence we get a positive semi-definite quadratic form on $R^{3}$ , which can be illustrated by an (degenerate) ellipsoid in $R^{3}$ . In Fig. 1 (right) $(e_{a b}^{(1)} + e_{a b}^{(3)}) (e_{c d}^{(1)} + e_{c d}^{(3)})$ , $(e_{a b}^{(2)} + e_{a b}^{(3)}) (e_{c d}^{(2)} + e_{c d}^{(3)})$ and $e_{a b}^{(3)} e_{c d}^{(3)}$ are visualised in this manner. Note that all these quadratic forms correspond to matrices which are rank one. (Cf. the ellipsoids in Fig. 2.)

Fig. 2 — Three identical (truncated) ellipsoids in $R^{3}$ with different orientations. The two leftmost ellipsoids can be carried over to each other through a rotation around the (vertical in the figure) z-axis, which implies that they represent the same tensor R_abcd (up to the meaning discussed). The right ellipsoid, despite identical eigenvalues with the two others, represent a different tensor since the rotation which carries this ellipsoid to any of the others is not around the z-axis

2.2. Invariants, Traces and Decompositions

By an invariant, we mean a quantity that can be calculated from measurements, and which is independent of the frame/coordinate system with respect to which the measurements are performed, despite the fact that components, e.g., Tⁱ_jk themselves depend on the coordinate system. It is this property that makes invariants important, and typically they are formed via tensor products and contractions, e.g., Tⁱ_jkT^k_ilg^jl. Sometimes, the invariants have a direct geometrical meaning. For instance, for a vector vⁱ, the most natural invariant is its squared length vⁱv_i. For a tensor Tⁱ_j of order (1,1) in three dimensions, viewed as a linear mapping $R^{3} \to R^{3}$ , the most well known invariants are perhaps the trace Tⁱ_i and the determinant det(Tⁱ_j). The modulus of the determinant gives the volume scaling under the mapping given by Tⁱ_j, while the trace equals the sum of the eigenvalues. If Tⁱ_j represents a rotation matrix, then its trace is 1 + 2 cos ϕ, where ϕ is the rotation angle. In general, however, the interpretation of a given invariant may be obscure. (For an account relevant to image processing, see e.g., [9]. A different, but relevant, approach in the field of diffusion MRI is found in [20].)

2.2.1. Natural Traces and Invariants

From (1), and considering the symmetries of R_abcd, two (and only two) natural traces arise. For a tensor of order (1, 1), e.g., R_i ^j, it is natural to consider this as an ordinary matrix, and consequently use stem letters without any indices at all. To indicate this slight deviation from the standard tensor notation, we denote e.g., R_i ^j by $\bar{\bar{R}}$ . Using [·] for the trace, so that $[\bar{\bar{R}}] = Tr (\bar{\bar{R}}) = R_{a}^{a}$ , we then have

T_{a b} = R_{a b c}^{c} = \sum_{i = 1}^{n} R_{a b}^{(i)} R_{c}^{(i)^{c}} = \sum_{i = 1}^{n} R_{a b}^{(i)} [{\bar{\bar{R}}}^{(i)}],

(3)

and

S_{a b} = R_{a c b}^{c} = \sum_{i = 1}^{n} R_{a c}^{(i)} R_{b}^{(i)^{c}} .

(4)

Hence, in a Cartesian frame, where the index position is unimportant, we have for the matrices $\bar{\bar{T}} = T_{i j}$ , $\bar{\bar{S}} = S_{i j}$

\bar{\bar{T}} = \sum_{i = 1}^{n} {\bar{\bar{R}}}^{(i)} [{\bar{\bar{R}}}^{(i)}], \bar{\bar{S}} = \sum_{i = 1}^{n} {\bar{\bar{R}}}^{(i)} {\bar{\bar{R}}}^{(i)} .

To proceed there are two double traces (i.e., contracting R_abcd twice):

T = T_{a}^{a} = {R_{a}^{a}}_{c}^{c} = \sum_{i = 1}^{n} R_{a}^{(i)^{a}} R_{c}^{(i)^{c}} = \sum_{i = 1}^{n} [{\bar{\bar{R}}}^{(i)}]^{2}

(5)

and

S = S_{a}^{a} = R_{a c}^{a c} = \sum_{i = 1}^{n} R_{a c}^{(i)} R^{(i)^{a c}} = \sum_{i = 1}^{n} [({\bar{\bar{R}}}^{(i)})^{2}] .

(6)

In two dimensions, the difference T_ab−S_ab is proportional to the metric g_ab. Namely,

Lemma 1 With T_ab and S_ab given by (3) and (4), it holds that (in two dimensions)

T_{a b} - S_{a b} = \sum_{i = 1}^{n} det ({\bar{\bar{R}}}^{(i)}) g_{a b} .

Proof By linearity, it is enough to prove the statement when n = 1, i.e., when the sum has just one term. Raising the second index, and using components, the statement then is $T_{i}^{j} - S_{i}^{j} = det ({\bar{\bar{R}}}^{(1)}) δ_{i}^{j}$ . Putting ${\bar{\bar{R}}}^{(1)} = A$ , we see that T_i ^j − S_i ^j = A[A] − A² while $det ({\bar{\bar{R}}}^{(1)}) δ_{i}^{j} = det (A) I$ , and by the Cayley-Hamilton theorem in two dimensions, A[A] − A² is indeed det(A)I. □

From lemma 1, it follows that $T - S = 2 \sum_{i = 1}^{n} det ({\bar{\bar{R}}}^{(i)}) \geq 0$ . In fact the following inequalities hold.

Lemma 2 With T and S defined as above, it holds that S ≤ T ≤ 2S. If T = S, all tensors $R_{a b}^{(i)}$ have rank 1. If T = 2S, all tensors $R_{a b}^{(i)}$ are isotropic, i.e., proportional to the metric g_ab.

Proof Again, by linearity it is enough to consider one tensor ${\bar{\bar{R}}}^{(1)} = A$ . In an orthonormal frame which diagonalises A, we have $A = (\begin{matrix} a & 0 \\ 0 & c \end{matrix})$ (with a ≥ 0, c ≥ 0, a + c > 0). Hence

S = a^{2} + c^{2} \leq a^{2} + c^{2} + 2 a c = (a + c)^{2} = T = 2 (a^{2} + c^{2}) - (a - c)^{2} \leq 2 S .

The first inequality becomes equality when ac = 0, i.e., when A has rank one. The second inequality becomes equality when a = c, i.e., when A is isotropic. □

Definition 1 We define the mean rank, r_m, by r_m = T/S, with T and S as above. Hence, in two dimensions, 1 ≤ r_m ≤ 2.

2.2.2. A Canonical Decomposition

It is customary [3, 7] to decompose a tensor with the symmetries of R_abcd into a sum where one term is the completely symmetric part:

R_{a b c d} = H_{a b c d} + W_{a b c d}, where H_{a b c d} = R_{(a b c d)}, W_{a b c d} = R_{a b c d} - H_{a b c d} .

It is also customary to split H_abcd into a trace-free part and ‘trace part’. We start by defining H_ab = H_abc^c, H = H_a^a and then the trace-free part of $H_{a b} : {\overset{_{_{\circ}}}{H}}_{a b} = H_{a b} - \frac{1}{2} H g_{a b}$ so that $H_{a b} = {\overset{_{_{\circ}}}{H}}_{a b} + \frac{1}{2} H g_{a b}$ . (These decompositions can be made in any dimension, but the actual coefficients, e.g., $\frac{1}{2}$ above and $\frac{1}{8}$ and $\frac{3}{8}$ et cetera below depend on the underlying dimension.) It is straightforward to check that

{\overset{_{_{\circ}}}{H}}_{a b c d} = H_{a b c d} - g_{(a b} H_{c d)} + \frac{1}{8} H g_{(a b} g_{c d)} = H_{a b c d} - g_{(a b} {\overset{_{_{\circ}}}{H}}_{c d)} - \frac{3}{8} H g_{(a b} g_{c d)}

is also trace-free. Hence we have the decomposition

H_{a b c d} = {\overset{_{_{\circ}}}{H}}_{a b c d} + g_{(a b} H_{c d)} - \frac{1}{8} H g_{(a b} g_{c d)} = {\overset{_{_{\circ}}}{H}}_{a b c d} + g_{(a b} {\overset{_{_{\circ}}}{H}}_{c d)} + \frac{3}{8} H g_{(a b} g_{c d)} .

Moreover, due to the symmetry of R_abcd, we find that

H_{a b c d} = \frac{1}{3} (R_{a b c d} + R_{a c b d} + R_{a d b c})

and therefore that

W_{a b c d} = \frac{1}{3} (2 R_{a b c d} - R_{a c b d} - R_{a d b c})

(7)

which implies that $H_{a b} = H_{a b c}^{c} = \frac{1}{3} (T_{a b} + 2 S_{a b})$ and $W_{a b} = W_{a b c}^{c} = \frac{2}{3} (T_{a b} - S_{a b})$ .

The degres of freedom, which for R_abcd is six, is distributed, where $R_{a b c d} \sim {{\overset{_{_{\circ}}}{H}}_{a b c d}, H_{a b}, W_{a b c d}}$ , as

\underset{(6)}{R_{a b c d}} \sim {\underset{(2)}{{\overset{_{_{\circ}}}{H}}_{a b c d}}, \underset{(3)}{H_{a b}}, \underset{(1)}{W_{a b c d}}} \sim {\underset{(2)}{{\overset{_{_{\circ}}}{H}}_{a b c d}}, \underset{(2)}{{\overset{_{_{\circ}}}{H}}_{a b}}, \underset{(1)}{H}, \underset{(1)}{W_{a b c d}}} .

For H_ab (or the pair ${\overset{_{_{\circ}}}{H}}_{a b}$ , H) this is clear. The total symmetry of ${\overset{_{_{\circ}}}{H}}_{a b c d}$ leaves only five components (in a basis), ${\overset{_{_{\circ}}}{H}}_{1111}$ , ${\overset{_{_{\circ}}}{H}}_{1112}$ , ${\overset{_{_{\circ}}}{H}}_{1122}$ , ${\overset{_{_{\circ}}}{H}}_{1222}$ , ${\overset{_{_{\circ}}}{H}}_{2222}$ . However, the trace-free condition ${\overset{_{_{\circ}}}{H}}_{a b c d} g^{c d} = 0$ imposes three conditions. (In an orthonormal frame, ${\overset{_{_{\circ}}}{H}}_{1122} = - {\overset{_{_{\circ}}}{H}}_{1111}$ , ${\overset{_{_{\circ}}}{H}}_{2222} = - {\overset{_{_{\circ}}}{H}}_{1122}$ and ${\overset{_{_{\circ}}}{H}}_{1112} = - {\overset{_{_{\circ}}}{H}}_{1222}$ .) That W_abcd has only one degree of freedom follows from the following lemma.

Lemma 3 Suppose that W_abcd is given by (7), and put W_ab = W_abcdg^cd, W = W_abg^ab. Then (in two dimensions)

W_{a b c d} = \frac{W}{4} (2 g_{a b} g_{c d} - g_{a c} g_{b d} - g_{a d} g_{b c})

Proof By linearity, it is enough to consider the case when R_abcd = A_abA_cd for some (symmetric) A_ab. In terms of eigenvectors (to A^a_b) we can write A_ab = αx_ax_b + βy_ay_b, where x_ax^a = y_ay^a = 1, x_ay^a = 0. In particular g_ab = x_ax_b + y_ay_b. From (7) we then get

W_{a b c d} = \frac{1}{3} (2 R_{a b c d} - R_{a c b d} - R_{a d b c}) = \frac{1}{3} (2 A_{a b} A_{c d} - A_{a c} A_{b d} - A_{a d} A_{b c}) = \frac{1}{3} (2 (α x_{a} x_{b} + β y_{a} y_{b}) (α x_{c} x_{d} + β y_{c} y_{d}) - (α x_{a} x_{c} + β y_{a} y_{c}) (α x_{b} x_{d} + β y_{b} y_{d}) - (α x_{a} x_{d} + β y_{a} y_{d}) (α x_{b} x_{c} + β y_{b} y_{c})) .

(8)

Expanding the parentheses, the components x_ax_bx_cx_d and y_ay_by_cy_d vanish, leaving

\frac{α β}{3} (2 x_{a} x_{b} y_{c} y_{d} + 2 y_{a} y_{b} x_{c} x_{d} - x_{a} x_{c} y_{b} y_{d} - y_{a} y_{c} x_{b} x_{d} - x_{a} x_{d} y_{b} y_{c} - y_{a} y_{d} x_{b} x_{c}) = \frac{α β}{3} (2 g_{a b} g_{c d} - g_{a c} g_{b d} - g_{a d} g_{b c}),

(9)

where the last equality can be seen by inserting g_ab = x_ax_b + y_ay_b (for all indices) and expanding. Taking one trace, i.e., contracting with g^cd gives $W_{a b} = \frac{2 α β}{3} g_{a b}$ , and another trace gives $W = \frac{4 α β}{3}$ , which proves the lemma. □

3. R_abcd as a Quadratic Form on $R^{3}$

Through the orthonormal basis for the space of symmetric two-tensors (in two dimensions) given by (2), the tensor R_abcd viewed as a quadratic form can be represented by a 3 × 3-matrix. Here, we will restrict ourselves to an orthonormal basis for V_(ab), namely the basis ${e_{a b}^{(1)}, e_{a b}^{(2)}, e_{a b}^{(3)}}$ from Sect. 2.1.3, defined in terms of the orthonormal basis [ξ^a, η^a} for V^a. Thus, given R_abcd, we associate the symmetric matrix M_ij, where (the choice of an orthonormal basis justifies the mismatch of the indices i, j)

M_{i j} ≐ R_{c d}^{a b} e_{a b}^{(i)} (e^{(j)})^{c d}, 1 \leq i, j \leq 3 .

It is instructive to see how the various derived tensors show up in M_ij. In terms of the basis (2) it is natural to look at the various parts of M_ij as follows

M_{i j} ≐ (\begin{matrix} \times & \times & \times \\ \times & \times & \times \\ \times & \times & ✕ \end{matrix}) ≐ (\begin{matrix} A & \bar{v} \\ {\bar{v}}^{t} & a \end{matrix}) .

(10)

This splitting is natural for reasons which will become apparent in the next sections. Note, however, that with this representation it is tempting to consider coordinate changes in $R^{3}$ , which is not natural in this case. Rather, of interest is the change of basis in V^a and the related induced change of coordinates in the representation (10). See Sect. 3.2.

3.1. Representation of the Canonically Derived Parts of R_abcd

It is helpful to see how the components of the various tensors T_ab, S_ab, T, S, ${\overset{_{_{\circ}}}{H}}_{a b c d}$ , ${\overset{_{_{\circ}}}{H}}_{a b}$ , H and W show up as components of M_ij. As for ${\overset{_{_{\circ}}}{H}}_{a b}$ , e.g., ${\overset{_{_{\circ}}}{T}}_{a b}$ denotes the trace-free part of T_ab. Immediate is M₃₃:

M_{33} ≐ R_{c d}^{a b} e_{a b}^{(3)} (e^{(3)})^{c d} ≐ \frac{1}{2} R_{c d}^{a b} g_{a b} g^{c d} = \frac{1}{2} T_{c d} g^{c d} = \frac{1}{2} T .

(11)

Similarly, for i = 1, 2 we have

M_{i 3} ≐ \frac{1}{\sqrt{2}} R_{c d}^{a b} e_{a b}^{(i)} g^{c d} ≐ \frac{1}{\sqrt{2}} T^{a b} e_{a b}^{(i)} ≐ \frac{1}{\sqrt{2}} {\overset{_{_{\circ}}}{T}}^{a b} e_{a b}^{(i)},

(12)

where the last equality follows form the trace-freeness of $e_{a b}^{(1)}$ and $e_{a b}^{(2)}$ . This means that the components of ${\overset{_{_{\circ}}}{T}}_{a b}$ (properly rescaled) goes into M_ij as the components of $\bar{v}$ (and ${\bar{v}}^{t}$ ) in (10). The same holds for ${\overset{_{_{\circ}}}{S}}_{a b}$ and ${\overset{_{_{\circ}}}{H}}_{a b}$ , as ${\overset{_{_{\circ}}}{S}}_{a b} = {\overset{_{_{\circ}}}{T}}_{a b}$ by Lemma 1, which then implies that also ${\overset{_{_{\circ}}}{H}}_{a b} = {\overset{_{_{\circ}}}{T}}_{a b} = {\overset{_{_{\circ}}}{S}}_{a b}$ . This latter relation follows from the trace-free part of the relation $H_{a b} = \frac{1}{3} (T_{a b} + 2 S_{a b})$ . Hence

M_{i j} ≐ (\begin{matrix} A & \vec{\overset{_{_{\circ}}}{T}} \\ \vec{\overset{_{_{\circ}}}{T}}^{^{^{t}}} & \frac{1}{2} T \end{matrix}) ≐ (\begin{matrix} \frac{σ}{2} I + Å & \vec{\overset{_{_{\circ}}}{T}} \\ \vec{\overset{_{_{\circ}}}{T}}^{^{^{t}}} & \frac{1}{2} T \end{matrix}),

(13)

where $\vec{\overset{_{_{\circ}}}{T}} = \vec{\overset{_{_{\circ}}}{S}} = \vec{\overset{_{_{\circ}}}{H}}$ encodes the two degrees of freedom in ${\overset{_{_{\circ}}}{T}}_{a b} = {\overset{_{_{\circ}}}{S}}_{a b} = {\overset{_{_{\circ}}}{H}}_{a b}$ . The matrix A is decomposed as $A = \frac{σ}{2} I + Å$ where I is the (2 × 2) identity matrix and Å is trace-free part of A. In particular, [A] = σ.

To investigate [M_ij] = M₁₁ + M₂₂ + M₃₃, i.e., the trace of M_ij we note that for a general symmetric matrix $R_{i j} ≐ (\begin{matrix} a & b \\ b & c \end{matrix})$ we have $R_{i j} e_{i j}^{(1)} ≐ \frac{a - c}{\sqrt{2}}$ , $R_{i j} e_{i j}^{(2)} ≐ \frac{2 b}{\sqrt{2}}$ , $R_{i j} e_{i j}^{(3)} ≐ \frac{a + c}{\sqrt{2}}$ . When M_ij is constructed from R_abcd which is an outer product R_abR_cd the trace is given by $M_{11} + M_{22} + M_{33} = (\frac{a - c}{\sqrt{2}})^{2} + (\frac{2 b}{\sqrt{2}})^{2} + (\frac{a + c}{\sqrt{2}})^{2} = a^{2} + 2 b^{2} + c^{2}$ and from (6) this is S. Together with linearity, this shows that [M] = M₁₁ + M₂₂ + M₃₃ = S also when R_abcd is formed as in (1). Taking trace in (13), this gives

S = σ + \frac{1}{2} T, i.e., σ = S - \frac{1}{2} T .

In addition, the relations below Eq. (7) show that

{\begin{matrix} H = \frac{1}{3} (T + 2 S) \\ W = \frac{2}{3} (T - S) \end{matrix} i.e., {\begin{matrix} T = H + W \\ S = H - \frac{1}{2} W \end{matrix} so that σ = \frac{1}{2} H - W .

The two degres of freedom in Å corresponds to the two degrees of freedom in ${\overset{_{_{\circ}}}{H}}_{a b c d}$ .

3.2. The Behaviour of M_ij Under a Rotation of the Coordinate System in V^a

The components of M_ij are expressed in terms of the orthonormal basis tensors given by (2), and these in turn are based on the ON basis ${\hat{ξ}, \hat{η}}$ for V. Putting the basis vectors in a row matrix $(\hat{ξ} \hat{η})$ and the coordinates in a column matrix $(\begin{matrix} ξ \\ η \end{matrix})$ so that a vector $u = ξ \hat{ξ} + η \hat{η} = (\hat{ξ} \hat{η}) (\begin{matrix} ξ \\ η \end{matrix})$ , and considering only orthonormal frames, the relevant change of basis is given by a rotation matrix $Q (v) = Q_{v} = (\begin{matrix} \cos v & - \sin v \\ \sin v & \cos v \end{matrix})$ , i.e., we consider the change of basis

(\hat{ξ} \hat{η}) \to (\hat{\tilde{ξ}} \hat{\tilde{η}}) = (\hat{ξ} \hat{η}) (\begin{matrix} \cos v & - \sin v \\ \sin v & \cos v \end{matrix}) = (\hat{ξ} \hat{η}) Q (v) .

This means that for a vector $u = (\hat{\tilde{ξ}} \hat{\tilde{η}}) (\begin{matrix} \tilde{ξ} \\ \tilde{η} \end{matrix}) = (\hat{ξ} \hat{η}) (\begin{matrix} ξ \\ η \end{matrix})$ , the coordinates transform as

(\begin{matrix} ξ \\ η \end{matrix}) \to (\begin{matrix} \tilde{ξ} \\ \tilde{η} \end{matrix}) = Q^{- 1} (v) (\begin{matrix} ξ \\ η \end{matrix}) = Q^{t} (v) (\begin{matrix} ξ \\ η \end{matrix}) = Q (- v) (\begin{matrix} ξ \\ η \end{matrix}) .

For the components of the basis vectors $e_{a b}^{(1)}$ , $e_{a b}^{(2)}$ , $e_{a b}^{(3)}$ we find (omitting the factor $1 ∕ \sqrt{2}$ )

(\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \to (\begin{matrix} \cos v & \sin v \\ - \sin v & \cos v \end{matrix}) (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) (\begin{matrix} \cos v & - \sin v \\ \sin v & \cos v \end{matrix}) = (\begin{matrix} \cos 2 v & - \sin 2 v \\ - \sin 2 v & - \cos 2 v \end{matrix}) (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}) \to (\begin{matrix} \cos v & \sin v \\ - \sin v & \cos v \end{matrix}) (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}) (\begin{matrix} \cos v & - \sin v \\ \sin v & \cos v \end{matrix}) = (\begin{matrix} \sin 2 v & \cos 2 v \\ \cos 2 v & - \sin 2 v \end{matrix}) (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) \to (\begin{matrix} \cos v & \sin v \\ - \sin v & \cos v \end{matrix}) (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) (\begin{matrix} \cos v & - \sin v \\ \sin v & \cos v \end{matrix}) = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}),

(14)

and this means that the components M_ij transform as

M_{i j} ≐ (\begin{matrix} A & \bar{v} \\ {\bar{v}}^{t} & a \end{matrix}) \to {\tilde{M}}_{i j} ≐ (\begin{matrix} Q_{2 v}^{t} A Q_{2 v} & Q_{2 v}^{t} \bar{v} \\ {\bar{v}}^{t} Q_{2 v} & a \end{matrix}) .

(15)

But this latter expression is just

(\begin{matrix} Q_{2 v}^{t} & \bar{0} \\ {\bar{0}}^{t} & 1 \end{matrix}) (\begin{matrix} A & \bar{v} \\ {\bar{v}}^{t} & a \end{matrix}) (\begin{matrix} Q_{2 v} & \bar{0} \\ {\bar{0}}^{t} & 1 \end{matrix}),

hence we have the following important remark/observation:

Remark 1 Viewing the matrix M_ij as an ellipsoid in $R^{3}$ , the effect of a rotation by an angle v in V^a corresponds to a rotation of the ellipsoid by an angle 2v around the z-axis in $R^{3}$ (where the z-axis corresponds to the ‘isotropic direction’ given by g_ab).

4. The Equivalence Problem for R_abcd

The equivalence problem for R_abcd can be formulated in different ways (for an account in three dimensions, we refer to [3]). Given two tensors R_abcd and ${\tilde{R}}_{a b c d}$ , both with the symmetries implied by (1), the question whether they are the same or not is straightforward as one can compare the components in any basis. However, R_abcd and ${\tilde{R}}_{a b c d}$ could live in different (but isomorphic) vector spaces, e.g. two tangent spaces at different points, and the concept of equality becomes less clear. On the other hand, in terms of components R_ijkl and ${\tilde{R}}_{i j k l}$ , one could ask whether there is a change of coordinates which takes one set of components into the other. If so, one can find a (invertible) matrix Pⁱ_j so that

R_{i j k l} = {\tilde{R}}_{m n o p} {P^{m}}_{i} {P^{n}}_{j} {P^{o}}_{k} {P^{p}}_{l},

and the tensors are then said to be equivalent. As already mentioned, it is convenient to restrict the coordinate systems to orthonormal coordinates. This means that two different coordinate systems differ only by their orientation, i.e., the change of coordinates are given by a rotation matrix Q ∈ SO(2). Under the ‘Cartesian convention’ that all indices are written as subscripts, R_abcd and ${\tilde{R}}_{a b c d}$ are equivalent if there is a matrix Q ∈ SO(2) so that (their Cartesian components satisfy)

R_{i j k l} = {\tilde{R}}_{m n o p} Q_{m i} Q_{n j} Q_{o k} Q_{p l} .

4.1. Different Ways to Characterize the Equivalence of R_abcd and ${\tilde{R}}_{a b c d}$

In this section, we will discuss three ways to determine whether two tensors R_abcd and ${\tilde{R}}_{a b c d}$ are equivalent or not. In Sects. 4.1.1 and 4.1.2 we present two such methods briefly, while Sect. 4.1.3, which is more complete, contains the main result of this work.

As mentioned in Sect. 1.1, the results of Sects. 4.1.1 and 4.1.2, which may be used in their own rights, rely on particular choices of basis matrices for V_(ab). The formulation in Sect. 4.1.3 on the other hand, is expressed in the components of R_abcd (in any coordinate system) directly.

4.1.1. Orientation of the Ellipsoid in $R^{3}$

A necessary condition for R_abcd and ${\tilde{R}}_{a b c d}$ to be equivalent is that their corresponding 3 × 3-matrices M_ij and ${\tilde{M}}_{i j}$ have the same eigenvalues. On the other hand, this is not sufficient since the representation in $R^{3}$ should reflect the freedom in rotating the coordinate system in $V^{a} \sim R^{2}$ . With the coordinates adopted, this corresponds to a rotation of the associated ellipsoid around the z-axis in $R^{3}$ (see Remark 1 in Sect. 3.2). This is illustrated in Fig. 2 where three ellipsoids, all representing positive definite symmetric mappings having identical eigenvalues, are shown. The two first ellipsoids can be rotated into each other by a rotation around the z-axis. This implies that the corresponding tensors R_abcd and ${\tilde{R}}_{a b c d}$ are equivalent. The third ellipsoid can also be rotated into the two others, but these rotations are around directions other than the z-axis, which means that this ellipsoid represents a different tensor.

In the generic case, with all eigenvalues different, it is easy to test whether two different ellipsoids can be transfered into each other through a rotation around the z-axis. This will be the case if the corresponding eigenvectors (of M_ij and ${\tilde{M}}_{i j}$ ) have the same angle with the z-axis. Hence it is just a matter of checking the z-components of the three normalized eigenvectors and see if they are equal up to sign.

4.1.2. Components in a Canonical Coordinate System

In a sense, this is the most straightforward method. In a coordinate system which respects $e_{a b}^{(3)}$ as the z-axis in $V_{(a b)} \sim R^{3}$ , two tensors R_abcd and ${\tilde{R}}_{a b c d}$ are equivalent if there is a rotation matrix (in two dimensions) Q such that

(\begin{matrix} A & \vec{\overset{_{_{\circ}}}{T}} \\ \vec{\overset{_{_{\circ}}}{T}}^{^{^{t}}} & \frac{1}{2} T \end{matrix}) = (\begin{matrix} Q^{t} \tilde{A} Q & Q^{t} \vec{\overset{_{_{\circ}}}{\tilde{T}}} \\ \vec{\overset{_{_{\circ}}}{\tilde{T}}}^{^{^{t}}} Q & \frac{1}{2} \tilde{T} \end{matrix}) .

(16)

Hence, equivalence can be easily tested by first checking that $T = \tilde{T}$ and that $‖ \vec{\overset{_{_{\circ}}}{T}} ‖ = ‖ \vec{\overset{_{_{\circ}}}{\tilde{T}}} ‖$ . If this is the case, (and if $‖ \vec{\overset{_{_{\circ}}}{T}} ‖ > 0$ ) one determines the rotation matrix Q which gives $\vec{\overset{_{_{\circ}}}{T}} = Q^{t} \vec{\overset{_{_{\circ}}}{\tilde{T}}}$ , and equivalence is then determined by if $A = Q^{t} \tilde{A} Q$ or not. If $‖ \vec{\overset{_{_{\circ}}}{T}} ‖ = ‖ \vec{\overset{_{_{\circ}}}{\tilde{T}}} ‖ = 0$ , the equivalence of A and $\tilde{A}$ can be determined directly, i.e., by checking whether $[A] = [\tilde{A}]$ and $[A^{2}] = [{\tilde{A}}^{2}]$ or not.

4.1.3. Equivalence Through (algebraic) Invariants of R_abcd

If a solution is found, this is perhaps the most satisfactory way to establish equivalence, in particular if the invariants are constructed by simple algebraic operations only. (For instance, to a symmetric 3 × 3-matrix A one can take the three eigenvalues as invariants or else for instance the traces of A, A² and A³. The former set requires some calculations, but the latter is immediate.)

Examples of invariants are T = R_abcdg^abg^cd, S = R_abcdg^acg^bd and the invariants H = H_abg^ab, W = W_abg^ab. To produce the invariants, we use the tensor R_abcd and the metric g_ab. However, if we regard $V^{a} \sim R^{2}$ as oriented, so that the orthonormal basis ${\hat{ξ}, \hat{η}}$ for V^a also is oriented, then invariants can also be formed in another way. Namely, since the space of symmetric 2 × 2 matrices is 3-dimensional, and since the metric g_ab singles out a 1-dimensional subspace, it also determines a 2-dimensional subspace L; all elements orthogonal to g_ab. This subspace is the set of all symmetric 2 × 2 matrices which are also trace-free. L can be given an orientation through an area form, which in turn inherits the orientation from V^a.

In general, with right-handed Cartesian coordinates x¹, x², the area form ϵ is given by ϵ = dx¹ ∧ dx² where (ω ∧ μ)_ab = ω_aμ_b − ω_bμ_a. With the orthonormal basis ${\hat{ξ}, \hat{η}}$ (for V^a ) also right handed, we define, cf. (2),

e_{a b}^{(1)} = \frac{1}{\sqrt{2}} ({\hat{ξ}}_{a} {\hat{ξ}}_{b} - {\hat{η}}_{a} {\hat{η}}_{b}), e_{a b}^{(2)} = \frac{1}{\sqrt{2}} ({\hat{ξ}}_{a} {\hat{η}}_{b} + {\hat{η}}_{a} {\hat{ξ}}_{b}) .

(17)

The area form on L is then ϵ ~ e⁽¹⁾ ∧ e⁽²⁾, or

ϵ \sim E_{a b c d} = e_{a b}^{(1)} e_{c d}^{(2)} - e_{a b}^{(2)} e_{c d}^{(1)} .

(18)

It is not hard to see that this definition is independent of the orientation of ${\hat{ξ}, \hat{η}}$ . We observe that $2 E_{a b c d} = ({\hat{ξ}}_{a} {\hat{ξ}}_{b} - {\hat{η}}_{a} {\hat{η}}_{b}) ({\hat{ξ}}_{c} {\hat{η}}_{d} + {\hat{η}}_{c} {\hat{ξ}}_{d}) - ({\hat{ξ}}_{a} {\hat{η}}_{b} + {\hat{η}}_{a} {\hat{ξ}}_{b}) ({\hat{ξ}}_{c} {\hat{ξ}}_{d} - {\hat{η}}_{c} {\hat{η}}_{d})$ . By replacing $\hat{ξ}$ by $\hat{ω} = \cos v \hat{ξ} + \sin v \hat{η}$ and $\hat{η}$ by $\hat{μ} = - \sin v \hat{ξ} + \cos v \hat{η}$ , i.e., a rotated orthonormal basis, it is straightforward to check that

({\hat{ω}}_{a} {\hat{ω}}_{b} - {\hat{μ}}_{a} {\hat{μ}}_{b}) ({\hat{ω}}_{c} {\hat{μ}}_{d} + {\hat{μ}}_{c} {\hat{ω}}_{d}) - ({\hat{ω}}_{a} {\hat{μ}}_{b} + {\hat{μ}}_{a} {\hat{ω}}_{b}) ({\hat{ω}}_{c} {\hat{ω}}_{d} - {\hat{μ}}_{c} {\hat{μ}}_{d}) = ({\hat{ξ}}_{a} {\hat{ξ}}_{b} - {\hat{η}}_{a} {\hat{η}}_{b}) ({\hat{ξ}}_{c} {\hat{η}}_{d} + {\hat{η}}_{c} {\hat{ξ}}_{d}) - ({\hat{ξ}}_{a} {\hat{η}}_{b} + {\hat{η}}_{a} {\hat{ξ}}_{b}) ({\hat{ξ}}_{c} {\hat{ξ}}_{d} - {\hat{η}}_{c} {\hat{η}}_{d})

(19)

so that E_abcd is well defined. We recollect that area form E_abcd is defined, through the induced metric, on the plane L (which in turn is also defined through the metric g_ab) and the orientation on V^a. Hence E_abcd can be used when forming invariants.

We will now state the result of this work, namely the existence of six invariants which can be used to investigate equivalence of two tensors R_abcd and ${\tilde{R}}_{a b c d}$ . We start by defining

S = R_{a b c d} g^{a c} g^{b d} T = R_{a b c d} g^{a b} g^{c d} J_{0} = R_{a b c d} R^{a b c d} J_{1} = T^{a b} T_{a b} J_{2} = R_{a b c d} T^{a b} T^{c d} J_{3} = T^{a b} R_{a b c d} E^{c d e f} T_{e f} .

(20)

where E_abcd is defined by (17) and (18). Similarly, we define $\tilde{S}$ , $\tilde{T}$ , ${\tilde{J}}_{0}$ , ${\tilde{J}}_{1}$ , ${\tilde{J}}_{2}$ and ${\tilde{J}}_{3}$ as the corresponding invariants formed from ${\tilde{R}}_{a b c d}$ . We make the remark that for most of these invariants, their immediate interpretations still remain to be found. Rather, their value lie in the fact that they form a set which can be used to establish the equivalence in Theorem 1 below. On the other hand, some interpretations are possible. In particular, the quotient T/S (see Definition 1) lies in the interval [1, 2] and has the meaning given by Lemma 2.

Theorem 1 Suppose that $R_{a b c d} = \sum_{i = 1}^{n} R_{a b}^{(i)} R_{c d}^{(i)}$ , with $R_{a b}^{(i)} \geq 0$ and that R_ijkl are the components of R_abcd in some basis. Suppose also that ${\tilde{R}}_{a b c d} = \sum_{i = 1}^{\tilde{n}} {\tilde{R}}_{a b}^{(i)} {\tilde{R}}_{c d}^{(i)}$ , with ${\tilde{R}}_{a b}^{(i)} \geq 0$ and that ${\tilde{R}}_{i j k l}$ are the components of ${\tilde{R}}_{a b c d}$ in some, possibly unrelated, basis. If (and only if) $S = \tilde{S}$ , $T = \tilde{T}$ , $J_{0} = {\tilde{J}}_{0}$ , $J_{1} = {\tilde{J}}_{1}$ , $J_{2} = {\tilde{J}}_{2}$ , $J_{3} = {\tilde{J}}_{3}$ , then there is a transformation matrix Pⁱ_j such that

R_{i j k l} = {\tilde{R}}_{m n o p} {P^{m}}_{i} {P^{n}}_{j} {P^{o}}_{k} {P^{p}}_{l} .

Proof Since the invariants are defined without reference to any basis, it is sufficient to consider the components expressed in an orthonormal frame, and in that case we must prove the existence of a rotation matrix Q ∈ SO(2) so that

R_{i j k l} = {\tilde{R}}_{m n o p} Q_{m i} Q_{n j} Q_{o k} Q_{p l} .

Since

R_{a b c d} = M_{i j} e_{a b}^{(i)} e_{c d}^{(j)},

(21)

we can consider the invariants formed from the components of

M_{i j} = (\begin{matrix} A & \bar{u} \\ {\bar{u}}^{t} & c \end{matrix}) and {\tilde{M}}_{i j} = (\begin{matrix} \tilde{A} & \tilde{\bar{u}} \\ {\tilde{\bar{u}}}^{t} & \tilde{c} \end{matrix})

(22)

and we must demonstrate the existence of a rotation matrix Q = Q_2v such that

\tilde{A} = Q_{2 v}^{t} A Q_{2 v}, \tilde{\bar{u}} = Q_{2 v}^{t} \bar{u}, \tilde{c} = c .

(23)

We make the ansatz

M_{i j} = (\begin{matrix} \begin{matrix} \frac{σ}{2} + a \\ b \end{matrix} \begin{matrix} b \\ \frac{σ}{2} - a \end{matrix} & \begin{matrix} x \\ y \end{matrix} \\ x y & c \end{matrix}), {\tilde{M}}_{i j} = (\begin{matrix} \begin{matrix} \frac{\tilde{σ}}{2} + \tilde{a} \\ \tilde{b} \end{matrix} \begin{matrix} \tilde{b} \\ \frac{\tilde{σ}}{2} - \tilde{a} \end{matrix} & \begin{matrix} \tilde{x} \\ \tilde{y} \end{matrix} \\ \tilde{x} \tilde{y} & \tilde{c} \end{matrix}) .

(24)

Through (21) it is straightforward to see that

S = σ + c, T = 2 c, J_{0} = 2 (a^{2} + b^{2}) + c^{2} + σ^{2} ∕ 2 + 2 (x^{2} + y^{2}), J_{1} = 2 (c^{2} + x^{2} + y^{2})

so if $S = \tilde{S}$ , $T = \tilde{T}$ , $J_{0} = {\tilde{J}}_{0}$ , $J_{1} = {\tilde{J}}_{1}$ , it follows that $σ = \tilde{σ}$ , $c = \tilde{c}$ , $a^{2} + b^{2} = {\tilde{a}}^{2} + {\tilde{b}}^{2}$ and $x^{2} + y^{2} = {\tilde{x}}^{2} + {\tilde{y}}^{2}$ . Since the isotropic part of A, i.e., $\frac{σ}{2} I$ is unaffected by a rotation of the coordinate system, we consider the traceless parts $Å = (\begin{matrix} a & b \\ b & - a \end{matrix})$ , $\overset{_{\circ}}{\tilde{A}} = (\begin{matrix} \tilde{a} & \tilde{b} \\ \tilde{b} & - \tilde{a} \end{matrix})$ , and the task is to assert a rotation matrix Q such that

(\begin{matrix} a & b \\ b & - a \end{matrix}) = Q^{t} (\begin{matrix} \tilde{a} & \tilde{b} \\ \tilde{b} & - \tilde{a} \end{matrix}) Q, (\begin{matrix} x \\ y \end{matrix}) = Q^{t} (\begin{matrix} \tilde{x} \\ \tilde{y} \end{matrix}),

if also $J_{2} = {\tilde{J}}_{2}$ , $J_{3} = {\tilde{J}}_{3}$ . Again it is straightforward to calculate the remaining invariants, and we find

J_{2} = 4 b x y + 2 a (x^{2} - y^{2}) + 2 c^{3} + (4 c + σ) (x^{2} + y^{2}) J_{3} = 4 a x y - 2 b (x^{2} - y^{2}) .

and similarly for ${\tilde{J}}_{2}$ , ${\tilde{J}}_{3}$ . Hence, (since $σ = \tilde{σ}$ , $c = \tilde{c}$ )

a^{2} + b^{2} = {\tilde{a}}^{2} + {\tilde{b}}^{2} x^{2} + y^{2} = {\tilde{x}}^{2} + {\tilde{y}}^{2} 2 b x y + a (x^{2} - y^{2}) = 2 \tilde{b} \tilde{x} \tilde{y} + \tilde{a} ({\tilde{x}}^{2} - {\tilde{y}}^{2}) 2 a x y - b (x^{2} - y^{2}) = 2 \tilde{a} \tilde{x} \tilde{y} - \tilde{b} ({\tilde{x}}^{2} - {\tilde{y}}^{2}) .

(25)

Suppose first that x² + y² > 0. The equality $x^{2} + y^{2} = {\tilde{x}}^{2} + {\tilde{y}}^{2}$ then guarantees the existence of the rotation matrix Q which is determined via the relation $(\begin{matrix} x \\ y \end{matrix}) = Q^{t} (\begin{matrix} \tilde{x} \\ \tilde{y} \end{matrix})$ . This can also be expressed as $Q_{1}^{t} (\begin{matrix} x \\ y \end{matrix}) = Q_{2}^{t} (\begin{matrix} \tilde{x} \\ \tilde{y} \end{matrix})$ for some rotation matrices Q₁, Q₂, where $Q = Q_{2} Q_{1}^{t}$ . We now choose the rotation matrix Q₁ so that in the untilded coordinates, y = 0. Similarly we choose Q₂ so that for the tilded coordinates, we get a frame where $\tilde{y} = 0$ . The equalities between the invariants in (25) then become

a^{2} + b^{2} = {\tilde{a}}^{2} + {\tilde{b}}^{2} x^{2} = {\tilde{x}}^{2} a x^{2} = \tilde{a} {\tilde{x}}^{2} - b x^{2} = - \tilde{b} {\tilde{x}}^{2},

so that $a = \tilde{a}$ , $b = \tilde{b}$ . This proves the theorem when x² + y² > 0. When $x^{2} + y^{2} = {\tilde{x}}^{2} + {\tilde{y}}^{2} = 0$ , i.e., $x = y = \tilde{x} = \tilde{y} = 0$ , the remaining equality $a^{2} + b^{2} = {\tilde{a}}^{2} + {\tilde{b}}^{2}$ is sufficient since we can again choose frames in which $b = \tilde{b} = 0$ and $a > 0, \tilde{a} > 0$ . It then follows that $a = \tilde{a}$ . □

5. Discussion

In this work, we started with a family of symmetric positive (semi-)definite tensors in two dimensions and considered its variance. This lead us to a fourth order tensor R_abcd with the same symmetries as the elasticity tensor in continuum mechanics. After listing a number of possible issues to address, we focused on the equivalence problem. Namely, given the components of two such tensors R_abcd and ${\tilde{R}}_{a b c d}$ , how can one determine if they represent the same tensor (but in different coordinate systems) or not? In Sect. 4, we saw that this could be investigated in different ways. The result of Theorem 1 is most satisfactory in the sense that it is expressible in terms of the components of the fourth order tensors directly.

There are two natural extensions and/or ways to continue this work. The first is to apply the result to realistic families of e.g., diffusion tensors in two dimensions. The objective is then, apart from establishing possible equivalences, to investigate the geometric meaning of the invariants. The other natural continuation is to investigate the corresponding problem in three dimensions. The degrees of freedom of R_abcd will then increase from 6 to 21, leaving us with a substantially harder, but also perhaps more interesting, problem.

Acknowledgements

The authors acknowledge the following sources for funding: Swedish Foundation for Strategic Research AM13-0090, the Swedish Research Council 2015-05356 and 2016-04482, Linköping University Center for Industrial Information Technology (CENIIT), VINNOVA/ITEA3 17021 IMPACT, Analytic Imaging Diagnostics Arena (AIDA), and National Institutes of Health P41EB015902.

Footnotes

For the notation of tensors used here, see Sect. 2.1.

Contributor Information

Magnus Herberthson, Department of Mathematics, Linköping University, Linköping, Sweden.

Evren Özarslan, Department of Biomedical Engineering, Linköping University, Linköping, Sweden.

Carl-Fredrik Westin, Department of Radiology Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA.

References

1.Basser PJ, Mattiello J, LeBihan D: MR diffusion tensor spectroscopy and imaging. Biophys. J 66(1), 259–267 (1994) [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Basser PJ, Pajevic S: A normal distribution for tensor-valued random variables: applications to diffusion tensor MRI. IEEE Trans. Med. Imaging 22(7), 785–94 (2003). 10.1109/TMI.2003.815059 [DOI] [PubMed] [Google Scholar]
3.Boehler JP, Kirillov AA Jr., Onat ET: On the polynomial invariants of the elasticity tensor. J. Elast 34(2), 97–110 (1994) [Google Scholar]
4.Burgeth B, Didas S, Florack L, Weickert J: A generic approach to diffusion filtering of matrix-fields. Computing 81, 179–197 (2007). 10.1007/s00607-007-0248-9 [DOI] [Google Scholar]
5.Callaghan PT: Translational Dynamics and Magnetic Resonance: Principles of Pulsed Gradient Spin Echo NMR. Oxford University Press, New York: (2011) [Google Scholar]
6.Helbig K: Review paper: What Kelvin might have written about elasticity. Geophys. Prospect 61, 1–20 (2013). 10.1111/j.1365-2478.2011.01049.x [DOI] [Google Scholar]
7.Itin Y, Hehl FW: Irreducible decompositions of the elasticity tensor under the linear and orthogonal groups and their physical consequences. J. Phys.: Conf. Ser 597, 012046 (2015) [Google Scholar]
8.Jian B, Vemuri BC, Özarslan E, Carney PR, Mareci TH: A novel tensor distribution model for the diffusion-weighted MR signal. NeuroImage 37(1), 164–176 (2007). 10.1016/j.neuroimage.2007.03.074 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Kanatani K: Group-Theoretical Methods in Image Understanding. Springer, Berlin: (1990) [Google Scholar]
10.Knutsson H: Representing local structure using tensors. In: Proceedings of the 6th Scandinavian Conference on Image Analysis, pp. 244–251. Oulu University, Oulu: (1989) [Google Scholar]
11.Özarslan E, Yolcu C, Herberthson M, Westin CF, Knutsson H: Effective potential for magnetic resonance measurements of restricted diffusion. Front. Phys 5, 68 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Shakya S, Batool N, Özarslan E, Knutsson H: Multi-fiber reconstruction using probabilistic mixture models for diffusion MRI examinations of the brain. In: Schultz T, Özarslan E, Hotz I (eds.) Modeling, Analysis, and Visualization of Anisotropy, pp. 283–308. Springer International Publishing, Cham: (2017) [Google Scholar]
13.Slaughter WS: The Linearized Theory of Elasticity. Birkhäuser, Basel: (2002) [Google Scholar]
14.Thomson W: Xxi. elements of a mathematical theory of elasticity. Philso. Trans. R. Soc. Lond 146, 481–498 (1856) [Google Scholar]
15.Voigt W: Lehrbuch Der Kristallphysik. Vieweg + Teubner Verlag; (1928) [Google Scholar]
16.Wald RM: General Relativity. University of Chicago Press, Chicago: (1984) [Google Scholar]
17.Weickert J: Anisotropic Diffusion in Image Processing. Teubner-Verlag, Stuttgart: (1998) [Google Scholar]
18.Westin CF, Knutsson H, Pasternak O, Szczepankiewicz F, Özarslan E, van Westen D, Mattisson C, Bogren M, O’Donnell LJ, Kubicki M, Topgaard D, Nilsson M: Q-space trajectory imaging for multidimensional diffusion MRI of the human brain. NeuroImage 135, 345–62 (2016). 10.1016/j.neuroimage.2016.02.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Yolcu C, Memiç M, Şimşek K, Westin CF, Özarslan E: NMR signal for particles diffusing under potentials: from path integrals and numerical methods to a model of diffusion anisotropy. Phys. Rev. E 93, 052602 (2016) [DOI] [PubMed] [Google Scholar]
20.Zucchelli M, Deslauriers-Gauthier S, Deriche R: A closed-form solution of rotation invariant spherical harmonic features in diffusion MRI, pp. 77–89. Springer, Cham: (2019) [Google Scholar]

[R1] 1.Basser PJ, Mattiello J, LeBihan D: MR diffusion tensor spectroscopy and imaging. Biophys. J 66(1), 259–267 (1994) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Basser PJ, Pajevic S: A normal distribution for tensor-valued random variables: applications to diffusion tensor MRI. IEEE Trans. Med. Imaging 22(7), 785–94 (2003). 10.1109/TMI.2003.815059 [DOI] [PubMed] [Google Scholar]

[R3] 3.Boehler JP, Kirillov AA Jr., Onat ET: On the polynomial invariants of the elasticity tensor. J. Elast 34(2), 97–110 (1994) [Google Scholar]

[R4] 4.Burgeth B, Didas S, Florack L, Weickert J: A generic approach to diffusion filtering of matrix-fields. Computing 81, 179–197 (2007). 10.1007/s00607-007-0248-9 [DOI] [Google Scholar]

[R5] 5.Callaghan PT: Translational Dynamics and Magnetic Resonance: Principles of Pulsed Gradient Spin Echo NMR. Oxford University Press, New York: (2011) [Google Scholar]

[R6] 6.Helbig K: Review paper: What Kelvin might have written about elasticity. Geophys. Prospect 61, 1–20 (2013). 10.1111/j.1365-2478.2011.01049.x [DOI] [Google Scholar]

[R7] 7.Itin Y, Hehl FW: Irreducible decompositions of the elasticity tensor under the linear and orthogonal groups and their physical consequences. J. Phys.: Conf. Ser 597, 012046 (2015) [Google Scholar]

[R8] 8.Jian B, Vemuri BC, Özarslan E, Carney PR, Mareci TH: A novel tensor distribution model for the diffusion-weighted MR signal. NeuroImage 37(1), 164–176 (2007). 10.1016/j.neuroimage.2007.03.074 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Kanatani K: Group-Theoretical Methods in Image Understanding. Springer, Berlin: (1990) [Google Scholar]

[R10] 10.Knutsson H: Representing local structure using tensors. In: Proceedings of the 6th Scandinavian Conference on Image Analysis, pp. 244–251. Oulu University, Oulu: (1989) [Google Scholar]

[R11] 11.Özarslan E, Yolcu C, Herberthson M, Westin CF, Knutsson H: Effective potential for magnetic resonance measurements of restricted diffusion. Front. Phys 5, 68 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Shakya S, Batool N, Özarslan E, Knutsson H: Multi-fiber reconstruction using probabilistic mixture models for diffusion MRI examinations of the brain. In: Schultz T, Özarslan E, Hotz I (eds.) Modeling, Analysis, and Visualization of Anisotropy, pp. 283–308. Springer International Publishing, Cham: (2017) [Google Scholar]

[R13] 13.Slaughter WS: The Linearized Theory of Elasticity. Birkhäuser, Basel: (2002) [Google Scholar]

[R14] 14.Thomson W: Xxi. elements of a mathematical theory of elasticity. Philso. Trans. R. Soc. Lond 146, 481–498 (1856) [Google Scholar]

[R15] 15.Voigt W: Lehrbuch Der Kristallphysik. Vieweg + Teubner Verlag; (1928) [Google Scholar]

[R16] 16.Wald RM: General Relativity. University of Chicago Press, Chicago: (1984) [Google Scholar]

[R17] 17.Weickert J: Anisotropic Diffusion in Image Processing. Teubner-Verlag, Stuttgart: (1998) [Google Scholar]

[R18] 18.Westin CF, Knutsson H, Pasternak O, Szczepankiewicz F, Özarslan E, van Westen D, Mattisson C, Bogren M, O’Donnell LJ, Kubicki M, Topgaard D, Nilsson M: Q-space trajectory imaging for multidimensional diffusion MRI of the human brain. NeuroImage 135, 345–62 (2016). 10.1016/j.neuroimage.2016.02.039 [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Yolcu C, Memiç M, Şimşek K, Westin CF, Özarslan E: NMR signal for particles diffusing under potentials: from path integrals and numerical methods to a model of diffusion anisotropy. Phys. Rev. E 93, 052602 (2016) [DOI] [PubMed] [Google Scholar]

[R20] 20.Zucchelli M, Deslauriers-Gauthier S, Deriche R: A closed-form solution of rotation invariant spherical harmonic features in diffusion MRI, pp. 77–89. Springer, Cham: (2019) [Google Scholar]

PERMALINK

Variance Measures for Symmetric Positive (Semi-) Definite Tensors in Two Dimensions

Magnus Herberthson

Evren Özarslan

Carl-Fredrik Westin

Abstract

1. Introduction

1.1. Outline

2. Preliminaries

2.1. Tensor Notation and Representations

2.1.1. The Vector Space of Symmetric Two-Tensors

2.1.2. The Tensor R_abcd and the Equivalence Problem

2.1.3. The Voigt/Kelvin Notation

2.1.4. Visualization in $R^{3}$

Fig. 1.

Fig. 2.

2.2. Invariants, Traces and Decompositions

2.2.1. Natural Traces and Invariants

2.2.2. A Canonical Decomposition

3. R_abcd as a Quadratic Form on $R^{3}$

3.1. Representation of the Canonically Derived Parts of R_abcd

3.2. The Behaviour of M_ij Under a Rotation of the Coordinate System in V^a

4. The Equivalence Problem for R_abcd

4.1. Different Ways to Characterize the Equivalence of R_abcd and ${\tilde{R}}_{a b c d}$

4.1.1. Orientation of the Ellipsoid in $R^{3}$

4.1.2. Components in a Canonical Coordinate System

4.1.3. Equivalence Through (algebraic) Invariants of R_abcd

5. Discussion

Acknowledgements

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Variance Measures for Symmetric Positive (Semi-) Definite Tensors in Two Dimensions

Magnus Herberthson

Evren Özarslan

Carl-Fredrik Westin

Abstract

1. Introduction

1.1. Outline

2. Preliminaries

2.1. Tensor Notation and Representations

2.1.1. The Vector Space of Symmetric Two-Tensors

2.1.2. The Tensor Rabcd and the Equivalence Problem

2.1.3. The Voigt/Kelvin Notation

2.1.4. Visualization in R3

Fig. 1.

Fig. 2.

2.2. Invariants, Traces and Decompositions

2.2.1. Natural Traces and Invariants

2.2.2. A Canonical Decomposition

3. Rabcd as a Quadratic Form on R3

3.1. Representation of the Canonically Derived Parts of Rabcd

3.2. The Behaviour of Mij Under a Rotation of the Coordinate System in Va

4. The Equivalence Problem for Rabcd

4.1. Different Ways to Characterize the Equivalence of Rabcd and R~abcd

4.1.1. Orientation of the Ellipsoid in R3

4.1.2. Components in a Canonical Coordinate System

4.1.3. Equivalence Through (algebraic) Invariants of Rabcd

5. Discussion

Acknowledgements

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.1.2. The Tensor R_abcd and the Equivalence Problem

2.1.4. Visualization in $R^{3}$

3. R_abcd as a Quadratic Form on $R^{3}$

3.1. Representation of the Canonically Derived Parts of R_abcd

3.2. The Behaviour of M_ij Under a Rotation of the Coordinate System in V^a

4. The Equivalence Problem for R_abcd

4.1. Different Ways to Characterize the Equivalence of R_abcd and ${\tilde{R}}_{a b c d}$

4.1.1. Orientation of the Ellipsoid in $R^{3}$

4.1.3. Equivalence Through (algebraic) Invariants of R_abcd