The minimum number of rotations about two axes for constructing an arbitrarily fixed rotation

Mitsuru Hamada

doi:10.1098/rsos.140145

. 2014 Nov 26;1(3):140145. doi: 10.1098/rsos.140145

The minimum number of rotations about two axes for constructing an arbitrarily fixed rotation

Mitsuru Hamada ^1,^✉

PMCID: PMC4448841 PMID: 26064554

Abstract

For any pair of three-dimensional real unit vectors $\hat{m}$ and $\hat{n}$ with $| {\hat{m}}^{T} \hat{n} | < 1$ and any rotation U, let $N_{\hat{m}, \hat{n}} (U)$ denote the least value of a positive integer k such that U can be decomposed into a product of k rotations about either $\hat{m}$ or $\hat{n}$ . This work gives the number $N_{\hat{m}, \hat{n}} (U)$ as a function of U. Here, a rotation means an element D of the special orthogonal group SO(3) or an element of the special unitary group SU(2) that corresponds to D. Decompositions of U attaining the minimum number $N_{\hat{m}, \hat{n}} (U)$ are also given explicitly.

Keywords: SU(2), SO(3), rotation

2. Introduction

In this work, an issue on optimal constructions of rotations in the Euclidean space $R^{3}$ , under some restriction, is addressed and solved. By a rotation or rotation matrix, we usually mean an element of the special orthogonal group SO(3). However, we follow the custom, in quantum physics, to call not only an element of SO(3) but also that of the special unitary group SU(2) a rotation. This is justified by the well-known homomorphism from SU(2) onto SO(3) (§2.4). Given a pair of three-dimensional real unit vectors $\hat{m}$ and $\hat{n}$ with $| {\hat{m}}^{T} \hat{n} | < 1$ , where ${\hat{m}}^{T}$ denotes the transpose of $\hat{m}$ , let $N_{\hat{m}, \hat{n}} (A)$ denote the least value of a positive integer k such that any rotation in $A$ can be decomposed into (constructed as) a product of k rotations about either $\hat{m}$ or $\hat{n}$ , where $A = SU (2), SO (3)$ . It is known that $N_{\hat{m}, \hat{n}} (SO (3)) = N_{\hat{m}, \hat{n}} (SU (2)) = ⌈ π / \arccos | {\hat{m}}^{T} \hat{n} | ⌉ + 1$ for any pair of three-dimensional real unit vectors $\hat{m}$ and $\hat{n}$ with $| {\hat{m}}^{T} \hat{n} | < 1$ [1,2].

Then, a natural question arises: What is the least value, $N_{\hat{m}, \hat{n}} (U)$ , of a positive integer k such that an arbitrarily fixed rotation U can be decomposed into a product of k rotations about either $\hat{m}$ or $\hat{n}$ ? In this work, the minimum number $N_{\hat{m}, \hat{n}} (U)$ is given as an explicit function of U, where U is expressed in terms of parameters known as Euler angles [3,4]. Moreover, optimal, that is minimum-achieving, decompositions (constructions) of any fixed element U∈SU(2) are presented explicitly.

In this work, not only explicit constructions but also simple inequalities on geometric quantities, which directly show lower bounds on the number of constituent rotations, will be presented. Remarkably, the proposed explicit constructions meet the obtained lower bounds, which shows both the optimality of the constructions and the tightness of the bounds.

The results in this work were obtained before the author came to know Lowenthal's formula on $N_{\hat{m}, \hat{n}} (SO (3))$ [1,2] and a related result [5]. Prior to this work, the work by D'Alessandro [5] has treated the issue of determining $N_{\hat{m}, \hat{n}} (D)$ , D∈SO(3). That interesting result [5], however, gave $N_{\hat{m}, \hat{n}} (D)$ , D∈SO(3), only algorithmically (with the largest index of a sequence of real numbers with some property). The distinctive features of this work include the following: $N_{\hat{m}, \hat{n}} (U)$ is given in terms of an explicit function of parameters of U∈SU(2); explicit optimal decompositions are presented; and this work's results on $N_{\hat{m}, \hat{n}} (U)$ imply Lowenthal's formula on $N_{\hat{m}, \hat{n}} (SO (3))$ in a consistent self-contained manner.¹

Regarding another direction of related research, we remark that $N_{\hat{m}, \hat{n}} (A)$ is known as the order of (uniform) generation of the Lie group $A$ , and this notion has been extended to other Lie groups. The interested reader is referred to relatively extensive treatments on uniform generation [6,7], where one would find that even determining the order $N_{\hat{m}, \hat{n}} (SO (3))$ needs a special proof (see [1,2] and [7, Appendix]).

Detailed elementary arguments below would help us dispel some confusions related to $N_{\hat{m}, \hat{n}} (SU (2))$ often found in textbooks on quantum computation. There, not to mention the ignorance of the fact $N_{\hat{m}, \hat{n}} (SU (2)) = ⌈ π / \arccos | {\hat{m}}^{T} \hat{n} | ⌉ + 1$ , a wrong statement equivalent to saying that $N_{\hat{m}, \hat{n}} (SU (2))$ were, at most, three, regardless of the choice of non-parallel vectors $\hat{m}$ and $\hat{n}$ , is observed.

Regarding physics, this work has been affected by the issue of constructing an arbitrary unitary operator on a Hilbert space discussed in quantum physics [8]. This is relevant to universal gates for quantum computation [9]. In this context, requiring the availability of rotations about a pair of exactly orthogonal axes seems too idealistic. For example, consider a Hamiltonian H of a quantum system represented by $C^{2}$ , and note that H determines the axis of the rotations ${[c (t)]}^{- 1} \exp (- i t H) \in SU (2)$ , $t \in R$ , where c(t) is a square root of $det \exp (- i t H)$ . (Often, although not always, differences of unitary matrices (evolutions) up to scalar multiples are ignorable.) Thus, explicit decompositions attaining the minimum $N_{\hat{m}, \hat{n}} (U)$ of an arbitrary rotation U for the generic vectors $\hat{m}$ and $\hat{n}$ will be useful. For applications to control, the reader is referred to D'Alessandro [5] and references therein.

This paper is organized as follows. After giving preliminaries in §2, the main theorem establishing $N_{\hat{m}, \hat{n}} (U)$ and explicit constructions of rotations are presented in §3. Then, inequalities that show limits on constructions are presented in §4. The proofs of the results of this work are presented in §5. Section 6 contains the conclusion. Several arguments are relegated to appendices.

3. Preliminaries and a known result

3.1. Definitions

The notation to be used includes the following: $N$ denotes the set of strictly positive integers; $S^{2} = {\hat{v} \in R^{3} ∣ ∥ \hat{v} ∥ = 1}$ , where $∥ \hat{v} ∥ = \sqrt{v_{x}^{2} + v_{y}^{2} + v_{z}^{2}}$ for $\hat{v} = {(v_{x}, v_{y}, v_{z})}^{T}$ ; ⌈x⌉ denotes the smallest integer not less than $x \in R$ . As usual, $\arccos x \in [0, π]$ and $\arcsin x \in [- π / 2, π / 2]$ for x∈[−1,1]. The Hermitian conjugate of a matrix U is denoted by U^†.

Throughout, I denotes the 2×2 identity matrix; X, Y and Z denote the following Pauli matrices:

X = (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}), Y = (\begin{matrix} 0 & - i \\ i & 0 \end{matrix}) and Z = (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) .

We shall work with a matrix

R_{\hat{v}} (θ) := (\cos \frac{θ}{2}) I - i (\sin \frac{θ}{2}) (v_{x} X + v_{y} Y + v_{z} Z),

3.1

where $\hat{v} = {(v_{x}, v_{y}, v_{z})}^{T} \in S^{2}$ and $θ \in R$ . This represents the rotation about $\hat{v}$ by angle θ (through the homomorphism in §2.4). In particular, for $\hat{y} = {(0, 1, 0)}^{T}$ and $\hat{z} = {(0, 0, 1)}^{T}$ , we put

R_{y} (θ) := R_{\hat{y}} (θ) = (\begin{matrix} \cos \frac{θ}{2} & - \sin \frac{θ}{2} \\ \sin \frac{θ}{2} & \cos \frac{θ}{2} \end{matrix}) and R_{z} (θ) := R_{\hat{z}} (θ) = (\begin{matrix} e^{- i (θ / 2)} & 0 \\ 0 & e^{i (θ / 2)} \end{matrix}) .

For $\hat{m}, \hat{n} \in S^{2}$ with $| {\hat{m}}^{T} \hat{n} | < 1$ , we define the following:

N_{\hat{m}, \hat{n}} (U) := min {j \in N ∣ \exists V_{1}, \dots, V_{j} \in R_{\hat{m}} \cup R_{\hat{n}}, U = V_{1} \dots V_{j}}

3.2

for U∈SU(2), where $R_{\hat{v}} := {R_{\hat{v}} (θ) ∣ θ \in R}$ , and

N_{\hat{m}, \hat{n}} := N_{\hat{m}, \hat{n}} (SU (2)) := min {k \in N ∣ \forall U \in SU (2), N_{\hat{m}, \hat{n}} (U) \leq k} .

3.3

Using the homomorphism F from SU(2) onto SO(3) to be defined in §3.4, we put ${\hat{R}}_{\hat{v}} := {F (R_{\hat{v}} (θ)) ∣ θ \in R}$ . We extend the definition of $N_{\hat{m}, \hat{n}}$ to SO(3):

N_{\hat{m}, \hat{n}} (D) := min {j \in N ∣ \exists A_{1}, \dots, A_{j} \in {\hat{R}}_{\hat{m}} \cup {\hat{R}}_{\hat{n}}, D = A_{1} \dots A_{j}}

3.4

for D∈SO(3) and

N_{\hat{m}, \hat{n}} (SO (3)) := min {k \in N ∣ \forall D \in SO (3), N_{\hat{m}, \hat{n}} (D) \leq k} .

3.5

3.2. The maximum of the minimum number of constituent rotations over all target rotations

This work's results lead to an elementary self-contained proof of the following known theorem (appendix F).

Theorem 3.1 (Lowenthal [1,2]) —

For any $\hat{m}, \hat{n} \in S^{2}$ with $| {\hat{m}}^{T} \hat{n} | < 1,$

$N_{\hat{m}, \hat{n}} (SO (3)) = N_{\hat{m}, \hat{n}} (SU (2)) = ⌈ \frac{π}{\arccos | {\hat{m}}^{T} \hat{n} |} ⌉ + 1.$

3.3. Parametrizations of the elements in SU(2)

The following lemma presents a well-known parametrization of SU(2) elements.

Lemma 3.2 —

For any element U∈SU(2), there exist some $α, γ \in R$ and β∈[0,π] such that

$U = (\begin{matrix} e^{- i ((γ + α) / 2)} \cos \frac{β}{2} & - e^{i ((γ - α) / 2)} \sin \frac{β}{2} \\ e^{- i ((γ - α) / 2)} \sin \frac{β}{2} & e^{i ((γ + α) / 2)} \cos \frac{β}{2} \end{matrix}) = R_{z} (α) R_{y} (β) R_{z} (γ) .$ 3.6

The parameters α,β and γ in this lemma are often called Euler angles.² The lemma can be rephrased as follows: any matrix in SU(2) can be written as

(\begin{matrix} a & b \\ - b^{*} & a^{*} \end{matrix})

3.7

with some complex numbers a and b such that |a|²+|b|²=1 [3]. Hence, any matrix in SU(2) can be written as

(\begin{matrix} w + i z & y + i x \\ - y + i x & w - i z \end{matrix}) = w I + i (x X + y Y + z Z)

3.8

with some real numbers x,y,z and w such that w²+x²+y²+z²=1. Take a real number θ such that $\cos (θ / 2) = w$ and $\sin (θ / 2) = \sqrt{1 - w^{2}} = \sqrt{x^{2} + y^{2} + z^{2}}$ ; write x, y and z as $x = - v_{x} \sin (θ / 2), y = - v_{y} \sin (θ / 2)$ and $z = - v_{z} \sin (θ / 2)$ , where $v_{x}, v_{y}, v_{z} \in R$ and $v_{x}^{2} + v_{y}^{2} + v_{z}^{2} = 1$ . Thus, using real numbers $θ, v_{x}, v_{y}, v_{z} \in R$ with $v_{x}^{2} + v_{y}^{2} + v_{z}^{2} = 1$ , any matrix in SU(2) can be written as

(\cos \frac{θ}{2}) I - i (\sin \frac{θ}{2}) (v_{x} X + v_{y} Y + v_{z} Z),

which is nothing but $R_{\hat{v}} (θ)$ in (3.1).

3.4. Homomorphism from SU(2) onto SO(3)

For U∈SU(2), we denote by F(U) the matrix of the linear transformation on $R^{3}$ that sends (x,y,z)^T to (x′,y′,z′)^T through³

U (x X + y Y + z Z) U^{†} = x^{'} X + y^{'} Y + z^{'} Z .

3.9

Namely, for any ${(x, y, z)}^{T}, {(x^{'}, y^{'}, z^{'})}^{T} \in R^{3}$ with (3.9),

(\begin{matrix} x^{'} \\ y^{'} \\ z^{'} \end{matrix}) = F (U) (\begin{matrix} x \\ y \\ z \end{matrix}) .

We also define

{\hat{R}}_{\hat{v}} (θ) := F (R_{\hat{v}} (θ)) for \hat{v} \in S^{2}, θ \in R .

3.10

3.5. Generic orthogonal axes and coordinate axes

Lemma 3.2 can be generalized as follows.

Lemma 3.3 —

Let $\hat{l}, \hat{m} \in S^{2}$ be vectors with $\hat{l}^{T} \hat{m} = 0$ . Then, for any V ∈SU(2), there exist some $α, γ \in R$ and β∈[0,π] such that

$V = R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) .$ 3.11

Proof —

As F is onto SO(3), there exists an element U∈SU(2) such that $\hat{l} = F (U) {(0, 1, 0)}^{T}$ and $\hat{m} = F (U) {(0, 0, 1)}^{T}$ .⁴ With this element U, some $α, γ \in R$ and some β∈[0,π], write U^†V U=R_z(α)R_y(β)R_z(γ) in terms of the parametrization (3.6). Then, since $U R_{z} (α) U^{†} = R_{\hat{m}} (α)$ , $U R_{y} (β) U^{†} = R_{\hat{l}} (β)$ and $U R_{z} (γ) U^{†} = R_{\hat{m}} (γ)$ , we obtain (3.11). ▪

We also have the following lemma, which is easy but worth recognizing.

Lemma 3.4 —

Let arbitrary $κ, ν \in N,$ ${\hat{u}}_{1}, \dots, {\hat{u}}_{κ}, {\hat{v}}_{1}, \dots, {\hat{v}}_{ν} \in S^{2}$ and U∈SU(2) be given. Put ${\hat{u}}_{1}^{'} = F (U) {\hat{u}}_{1}, \dots, {\hat{u}}_{κ}^{'} = F (U) {\hat{u}}_{κ}, {\hat{v}}_{1}^{'} = F (U) {\hat{v}}_{1}, \dots$ and ${\hat{v}}_{ν}^{'} = F (U) {\hat{v}}_{ν}$ . Then, for any $θ_{1}, \dots, θ_{κ}, ϕ_{1}, \dots ϕ_{ν} \in R,$

$R_{{\hat{u}}_{1}} (θ_{1}) \dots R_{{\hat{u}}_{κ}} (θ_{κ}) = R_{{\hat{v}}_{1}} (ϕ_{1}) \dots R_{{\hat{v}}_{ν}} (ϕ_{ν})$

if and only if (iff)

$R_{{\hat{u}}_{1}^{'}} (θ_{1}) \dots R_{{\hat{u}}_{κ}^{'}} (θ_{κ}) = R_{{\hat{v}}_{1}^{'}} (ϕ_{1}) \dots R_{{\hat{v}}_{ν}^{'}} (ϕ_{ν}) .$

Proof —

This readily follows from $U R_{{\hat{u}}_{j}} (θ_{j}) U^{†} = R_{{\hat{u}}_{j}^{'}} (θ_{j})$ and $U R_{{\hat{v}}_{j}} (ϕ_{j}) U^{†} = R_{{\hat{v}}_{j}^{'}} (ϕ_{j})$ . ▪

4. The minimum numbers of constituent rotations and optimal constructions of an arbitrary rotation

Here, we present the result establishing $N_{\hat{m}, \hat{n}} (U)$ with needed definitions.

Definition 4.1 —

For $\hat{v} \in S^{2}$ and

$U = (\begin{matrix} w + i z & y + i x \\ - y + i x & w - i z \end{matrix}) = w I + i (x X + y Y + z Z) \in SU (2),$ 4.1

where $w, x, y, z \in R$ are parameters to express U uniquely, $b (\hat{v}, U)$ is defined by

$b (\hat{v}, U) := | (x, y, z) \hat{v} | .$ 4.2

Definition 4.2 —

Functions $f : R^{3} \to [0, π]$ and $g : R^{2} \times (0, π / 2] \to N$ are defined by

$f (α, β, δ) := 2 \arccos \sqrt{\cos^{2} \frac{β}{2} \cos^{2} \frac{δ}{2} + \sin^{2} \frac{β}{2} \sin^{2} \frac{δ}{2} + 2 \cos α \sin \frac{β}{2} \sin \frac{δ}{2} \cos \frac{β}{2} \cos \frac{δ}{2}}$

and

$g (α, β, δ) := {\begin{cases} 2 ⌈ \frac{f (α, β, δ)}{2 δ} + \frac{1}{2} ⌉ & if f (α, β, δ) \geq δ \\ 4 & otherwise. \end{cases}$

Theorem 4.3 —

For any $\hat{m}, \hat{n} \in S^{2}$ with ${\hat{m}}^{T} \hat{n} \in [0, 1),$ $α, γ \in R$ and β∈[0,π], if

$b (\hat{m}, U_{α, β, γ}^{\hat{m}, \hat{l}}) \geq b (\hat{n}, U_{α, β, γ}^{\hat{m}, \hat{l}}),$

then

$N_{\hat{m}, \hat{n}} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) = N_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = min {2 ⌈ \frac{β}{2 δ} ⌉ + 1, g (α, β, δ), g (γ, - β, δ)},$

where $δ = \arccos {\hat{m}}^{T} \hat{n} \in (0, π / 2],$ $\hat{l} = ∥ \hat{m} \times \hat{n} ∥^{- 1} \hat{m} \times \hat{n}$ and

$U_{α, β, γ}^{\hat{m}, \hat{l}} := R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) .$

Note that there is no loss of generality in assuming $b (\hat{m}, U_{α, β, γ}^{\hat{m}, \hat{l}}) \geq$ $b (\hat{n}, U_{α, β, γ}^{\hat{m}, \hat{l}})$ , but also note that α,β and γ vary, in general, if $\hat{m}$ and $\hat{n}$ are interchanged.

We give two constructions or decompositions, which will turn out to attain the minimum number $N_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}})$ in the theorem.

Proposition 4.4 —

Given arbitrary $\hat{m}, \hat{n} \in S^{2}$ with ${\hat{m}}^{T} \hat{n} \in [0, 1),$ $α, γ \in R$ and β∈[0,π], put

$δ = \arccos {\hat{m}}^{T} \hat{n} \in (0, \frac{π}{2}]$ 4.3

and

$\hat{l} = ∥ \hat{m} \times \hat{n} ∥^{- 1} \hat{m} \times \hat{n} .$

Then, for any $k \in N$ and β₁,…,β_k∈(0,2δ] satisfying

$β = β_{1} + \dots + β_{k},$ 4.4

there exist some $α_{j}, γ_{j}, θ_{j} \in R$ such that

$R_{\hat{l}} (β_{j}) = R_{\hat{m}} (- α_{j}) R_{\hat{n}} (θ_{j}) R_{\hat{m}} (- γ_{j})$ 4.5

for j=1,…,k. For these parameters, it holds that

$\begin{aligned} R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) \\ = R_{\hat{m}} (α - α_{1}) R_{\hat{n}} (θ_{1}) R_{\hat{m}} (- γ_{1} - α_{2}) R_{\hat{n}} (θ_{2}) R_{\hat{m}} (- γ_{2} - α_{3}) R_{\hat{n}} (θ_{3}) \dots \\ \cdot R_{\hat{m}} (- γ_{k - 1} - α_{k}) R_{\hat{n}} (θ_{k}) R_{\hat{m}} (- γ_{k} + γ) . \end{aligned}$ 4.6

Remark 4.5 —

The least value of k such that (4.4) holds for some β₁,…,β_k∈ (0,2δ] is ⌈β/(2δ)⌉.⁵ Hence, this proposition gives a decomposition of an arbitrary element $U = R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) \in SU (2)$ into the product of 2⌈β/(2δ)⌉+1 rotations.⁶

Remark 4.6 —

For $β, δ \in R$ with 0≤β/2≤δ≤π/2, δ≠0, and $t \in R$ , let

$H_{t} (β, δ) := {\begin{cases} 0 & if \frac{β}{2} < δ = \frac{π}{2} \\ t & if \frac{β}{2} = δ = \frac{π}{2} \\ \arcsin \frac{\tan (β / 2)}{\tan δ} & otherwise. \end{cases}$

Then, an explicit instance of the set of parameters α_j, γ_j and θ_j for which (4.5) holds is given by (α_j,γ_j,θ_j)^T=σ_{t_j}(β_j,δ), where

$σ_{t} (β, δ) := (\begin{matrix} H_{t} (β, δ) - \frac{π}{2} \\ H_{t} (β, δ) + \frac{π}{2} \\ 2 \arcsin \frac{\sin (β / 2)}{\sin δ} \end{matrix})$ 4.7

and $t_{j} \in R$ can be chosen arbitrarily, j=1,…,k. (These make (4.6) hold.)

Proposition 4.7 —

Given any $\hat{m}, \hat{n} \in S^{2}$ with ${\hat{m}}^{T} \hat{n} \in [0, 1),$ put $δ = \arccos {\hat{m}}^{T} \hat{n} \in (0, π / 2]$ and $\hat{l} = ∥ \hat{m} \times \hat{n} ∥^{- 1} \hat{m} \times \hat{n}$ . For an arbitrary U∈SU(2), choose parameters $α^{'}, γ^{'} \in R$ and β′∈[0,π] such that

$R_{\hat{l}} (- δ) U = R_{\hat{m}} (α^{'}) R_{\hat{l}} (β^{'}) R_{\hat{m}} (γ^{'}) .$ 4.8

Then,

$U = R_{\hat{n}} (α^{'}) R_{\hat{l}} (β^{'} + δ) R_{\hat{m}} (γ^{'}) .$ 4.9

Furthermore, for any $k^{'} \in N$ and β′₁,…,β′_k′∈(0,2δ] satisfying

$β^{'} + δ = β_{1}^{'} + \dots + β_{k^{'}}^{'},$ 4.10

there exist some $α_{j}^{'}, γ_{j}^{'}, θ_{j}^{'} \in R$ such that

$R_{\hat{l}} (β_{j}^{'}) = R_{\hat{m}} (- α_{j}^{'}) R_{\hat{n}} (θ_{j}^{'}) R_{\hat{m}} (- γ_{j}^{'})$ 4.11

for j=1,…,k′. For these parameters, it holds that

$\begin{aligned} U & = R_{\hat{n}} (α^{'}) R_{\hat{m}} (- α_{1}^{'}) R_{\hat{n}} (θ_{1}^{'}) R_{\hat{m}} (- γ_{1}^{'} - α_{2}^{'}) R_{\hat{n}} (θ_{2}^{'}) R_{\hat{m}} (- γ_{2}^{'} - α_{3}^{'}) R_{\hat{n}} (θ_{3}^{'}) \dots \\ \cdot R_{\hat{m}} (- γ_{k^{'} - 1}^{'} - α_{k^{'}}^{'}) R_{\hat{n}} (θ_{k^{'}}^{'}) R_{\hat{m}} (- γ_{k^{'}}^{'} + γ^{'}) . \end{aligned}$ 4.12

Remark 4.8 —

The least value of k′ such that (4.10) holds for some β′₁,…,β′_k′∈ (0,2δ] is ⌈(β′+δ)/(2δ)⌉=⌈β′/(2δ)+1/2⌉. Moreover, if β′≥δ and k′=⌈β′/(2δ)+1/2⌉, the parameter α′₁ can be chosen so that it satisfies α′₁=0 as well as (4.11) and (4.12). Hence, when β′≥δ, this proposition and the fact just mentioned give a decomposition of an arbitrary element $U = R_{\hat{n}} (α^{'}) R_{\hat{l}} (β^{'} + δ) R_{\hat{m}} (γ^{'}) \in SU (2)$ into the product of $2 ⌈ β^{'} / (2 δ) + \frac{1}{2} ⌉$ rotations, and when β′<δ, a decomposition of U into the product of four rotations.

Remark 4.9 —

An explicit instance of the set of parameters α′_j,γ′_j and θ′_j, j=1,…,k′, for which (4.11) and (4.12) hold is given by (α′_j,γ′_j,θ′_j)^T= σ_{t_j}(β′_j,δ), where $t_{j} \in R$ can be chosen arbitrarily, j=1,…,k′.

5. Limits on constructions

In order to bound $N_{\hat{m}, \hat{n}} (D)$ , etc., from below, we use the geodesic metric on the unit sphere S², which is denoted by d. Specifically,

d (\hat{u}, \hat{v}) := \arccos {\hat{u}}^{T} \hat{v} \in [0, π]

5.1

for $\hat{u}, \hat{v} \in S^{2}$ . This is the length of the geodesic connecting $\hat{u}$ and $\hat{v}$ on S². We have the following lemma. (Recall we have put ${\hat{R}}_{\hat{v}} (θ) = F (R_{\hat{v}} (θ))$ .)

Lemma 5.1 —

Let $\hat{n}, \hat{m}$ be arbitrary vectors in S² with $δ = d (\hat{m}, \hat{n}) = \arccos {\hat{m}}^{T} \hat{n} \in (0, π]$ . Then, for any $k \in N$ and $ϕ_{1}, \dots, ϕ_{2 k} \in R,$ the following inequalities hold:

$\begin{aligned} d ({\hat{R}}_{\hat{m}} (ϕ_{2 k - 1}) {\hat{R}}_{\hat{n}} (ϕ_{2 k - 2}) \dots {\hat{R}}_{\hat{m}} (ϕ_{3}) {\hat{R}}_{\hat{n}} (ϕ_{2}) {\hat{R}}_{\hat{m}} (ϕ_{1}) \hat{m}, \hat{m}) \leq 2 (k - 1) δ, \end{aligned}$ 5.2

$\begin{aligned} d ({\hat{R}}_{\hat{m}} (ϕ_{2 k - 1}) {\hat{R}}_{\hat{n}} (ϕ_{2 k - 2}) \dots {\hat{R}}_{\hat{m}} (ϕ_{3}) {\hat{R}}_{\hat{n}} (ϕ_{2}) {\hat{R}}_{\hat{m}} (ϕ_{1}) \hat{m}, \hat{n}) \leq (2 k - 1) δ, \end{aligned}$ 5.3

$\begin{aligned} d ({\hat{R}}_{\hat{n}} (ϕ_{2 k}) {\hat{R}}_{\hat{m}} (ϕ_{2 k - 1}) \dots {\hat{R}}_{\hat{m}} (ϕ_{3}) {\hat{R}}_{\hat{n}} (ϕ_{2}) {\hat{R}}_{\hat{m}} (ϕ_{1}) \hat{m}, \hat{n}) \leq (2 k - 1) δ \end{aligned}$ 5.4

$\begin{aligned} and d ({\hat{R}}_{\hat{n}} (ϕ_{2 k}) {\hat{R}}_{\hat{m}} (ϕ_{2 k - 1}) \dots {\hat{R}}_{\hat{m}} (ϕ_{3}) {\hat{R}}_{\hat{n}} (ϕ_{2}) {\hat{R}}_{\hat{m}} (ϕ_{1}) \hat{m}, \hat{m}) \leq 2 k δ . \end{aligned}$ 5.5

This can be shown easily by induction on k using the triangle inequality for d. In what follows, (5.2) and (5.4) will be used in the following forms:

2 ⌈ \frac{d (D \hat{m}, \hat{m})}{2 δ} ⌉ + 1 \leq 2 k - 1 and 2 ⌈ \frac{d (D^{'} \hat{m}, \hat{n})}{2 δ} + \frac{1}{2} ⌉ \leq 2 k .

5.6

These bounds hold when D and D′∈SO(3) equal the product of 2k−1 rotations and that of 2k rotations, respectively, in lemma 5.1 (since k is an integer). It will turn out that these bounds are tight.

6. Proof of the results

6.1. Structure of the proof

Here, the structure of the whole proof of the results in this work is described. Theorem 4.3 is obtained as a consequence of lemma 6.2 to be presented. The constructive half of lemma 6.2 is due to propositions 4.4 and 4.7. The other half of lemma 6.2, related to limits on constructions, is due to lemma 5.1. Theorem 3.1 is derived from theorem 4.3 in appendix F.

6.2. Proof of propositions 4.4 and 4.7

The following lemma is fundamental to the results in this work.

Lemma 6.1 —

For any $β, θ \in R$ and for any $\hat{u}, \hat{l}, \hat{m} \in S^{2}$ such that $\hat{l}^{T} \hat{m} = 0,$ the following two conditions are equivalent.

I. There exist some $α, γ \in R$ such that
$R_{\hat{u}} (θ) = R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) .$ 6.1

II. $\sqrt{1 - {({\hat{m}}^{T} \hat{u})}^{2}} | \sin (θ / 2) | = | \sin (β / 2) |$ .

Proof —

(1) Take an element U∈SU(2) such that

$\hat{l} = F (U) {(0, 1, 0)}^{T} and \hat{m} = F (U) {(0, 0, 1)}^{T},$ 6.2

and put $\hat{v} = {(v_{x}, v_{y}, v_{z})}^{T}$ for the parameters v_x,v_y and v_z such that

$\hat{u} = v_{x} \hat{l} \times \hat{m} + v_{y} \hat{l} + v_{z} \hat{m} .$ 6.3

Then, owing to lemma 3.4, (6.1) holds iff

$R_{\hat{v}} (θ) = R_{z} (α) R_{y} (β) R_{z} (γ) .$ 6.4

(2) A direct calculation shows

$\begin{aligned} R_{z} (α) R_{y} (β) R_{z} (γ) & = \cos \frac{β}{2} \cos \frac{γ + α}{2} I - i \sin \frac{β}{2} \sin \frac{γ - α}{2} X \\ - i \sin \frac{β}{2} \cos \frac{γ - α}{2} Y - i \cos \frac{β}{2} \sin \frac{γ + α}{2} Z . \end{aligned}$ 6.5

Hence, (6.4) is equivalent to

$\begin{aligned} \cos \frac{θ}{2} & = \cos \frac{β}{2} \cos \frac{γ + α}{2}, \end{aligned}$ 6.6

$\begin{aligned} v_{x} \sin \frac{θ}{2} & = \sin \frac{β}{2} \sin \frac{γ - α}{2}, \end{aligned}$ 6.7

$\begin{aligned} v_{y} \sin \frac{θ}{2} & = \sin \frac{β}{2} \cos \frac{γ - α}{2} \end{aligned}$ 6.8

$\begin{aligned} and v_{z} \sin \frac{θ}{2} & = \cos \frac{β}{2} \sin \frac{γ + α}{2} . \end{aligned}$ 6.9

(3) We shall prove I ⇒ II. On each side of (6.7) and (6.8), squaring and summing the resultant pair, we have

$\sqrt{1 - v_{z}^{2}} | \sin \frac{θ}{2} | = | \sin \frac{β}{2} | .$ 6.10

(Equations (6.6) and (6.9) also imply (6.10) similarly.) But (6.10) implies II in view of (6.3).

(4) Next, we shall prove II ⇒ I.

Transforming (α,β) into (η,ζ), where the two pairs are related by

$η = \frac{γ + α}{2} and ζ = \frac{γ - α}{2},$ 6.11

we see, from paragraphs (1) and (2), that I is equivalent to the following condition: There exist some $η, ζ \in R$ such that

$\begin{aligned} \cos \frac{θ}{2} & = \cos \frac{β}{2} \cos η, \end{aligned}$ 6.12

$\begin{aligned} v_{x} \sin \frac{θ}{2} & = \sin \frac{β}{2} \sin ζ, \end{aligned}$ 6.13

$\begin{aligned} v_{y} \sin \frac{θ}{2} & = \sin \frac{β}{2} \cos ζ \end{aligned}$ 6.14

$\begin{aligned} and v_{z} \sin \frac{θ}{2} & = \cos \frac{β}{2} \sin η . \end{aligned}$ 6.15

Hence, it is enough to show that II implies the existence of some $η, ζ \in R$ satisfying (6.12)–(6.15).

Now suppose $\cos (β / 2) \neq 0$ . Then, if we show

$\frac{\cos^{2} (θ / 2)}{\cos^{2} (β / 2)} + \frac{v_{z}^{2} \sin^{2} (θ / 2)}{\cos^{2} (β / 2)} = 1,$ 6.16

it will immediately imply the existence of η satisfying (6.12) and (6.15). From II, however, we have (6.10), and hence, $(1 - v_{z}^{2}) \sin^{2} (θ / 2) = \sin^{2} (β / 2)$ , i.e. $1 - (1 - v_{z}^{2}) \sin^{2} (θ / 2) = \cos^{2} (β / 2)$ , which is equivalent to (6.16) by the assumption $\cos (β / 2) \neq 0$ . If $\cos (β / 2) = 0$ , then $| \sin (β / 2) | = 1$ . This and (6.10) imply $1 - v_{z}^{2} = | \sin (θ / 2) | = 1$ , and hence, $v_{z} = \cos (θ / 2) = 0$ . Then, (6.12) and (6.15) hold for any choice of η.

In a similar way, if $\sin (β / 2) \neq 0$ ,

$\frac{v_{x}^{2} \sin^{2} (θ / 2)}{\sin^{2} (β / 2)} + \frac{v_{y}^{2} \sin^{2} (θ / 2)}{\sin^{2} (β / 2)} = 1$ 6.17

will immediately imply the existence of ζ satisfying (6.13) and (6.14). But (6.17) follows again from II or (6.10) since $1 - v_{z}^{2} = v_{x}^{2} + v_{y}^{2}$ . If $\sin (β / 2) = 0$ , both (6.13) and (6.14) hold for any choice of ζ similarly. ▪

Proof of proposition 4.4 —

Choose a parameter θ_j such that $| \sin (θ_{j} / 2) | = \sin (β_{j} / 2) / \sin δ$ , which is possible by the assumption β_j∈(0,2δ]; then, it follows from lemma 6.1 that there exist some $α_{j}, γ_{j} \in R$ such that (4.5), i.e. $R_{\hat{l}} (β_{j}) = R_{\hat{m}} (- α_{j}) R_{\hat{n}} (θ_{j}) R_{\hat{m}} (- γ_{j})$ holds, j=1,…,k. Inserting these into

$R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) = R_{\hat{m}} (α) R_{\hat{l}} (β_{1}) \dots R_{\hat{l}} (β_{k}) R_{\hat{m}} (γ),$

we obtain (4.6). ▪

Proof of proposition 4.7 —

Note $R_{\hat{l}} (δ) R_{\hat{m}} (α^{'}) R_{\hat{l}} (- δ) = R_{\hat{n}} (α^{'})$ , which is equivalent to R_y(δ)R_z(α′)R_y(−δ)=R_v(α′), where $\hat{v} = {(\sin δ, 0, \cos δ)}^{T}$ , by lemma 3.4 (figure 1) and therefore, can be checked easily by a direct calculation. Using this equation, we can rewrite (4.8) as $U = R_{\hat{n}} (α^{'}) R_{\hat{l}} (β^{'} + δ) R_{\hat{m}} (γ^{'})$ , which is (4.9). Then, applying to $R_{\hat{l}} (β^{'} + δ) R_{\hat{m}} (γ^{'})$ , the decomposition in proposition 4.4 with (α,β,γ) replaced by (0,β′+δ,γ′), it readily follows that there exist some α′_j,γ′_j and $θ_{j}^{'} \in R$ , j=1,…,k′, that satisfy the following: $| \sin (θ_{j}^{'} / 2) | = \sin (β_{j}^{'} / 2) / \sin δ$ and (4.11) for j=1,…,k′, and

$\begin{aligned} R_{\hat{l}} (β^{'} + δ) R_{\hat{m}} (γ^{'}) \\ = R_{\hat{m}} (- α_{1}^{'}) R_{\hat{n}} (θ_{1}^{'}) R_{\hat{m}} (- γ_{1}^{'} - α_{2}^{'}) R_{\hat{n}} (θ_{2}^{'}) R_{\hat{m}} (- γ_{2}^{'} - α_{3}^{'}) R_{\hat{n}} (θ_{3}^{'}) \dots \\ \cdot R_{\hat{m}} (- γ_{k^{'} - 1}^{'} - α_{k^{'}}^{'}) R_{\hat{n}} (θ_{k^{'}}^{'}) R_{\hat{m}} (- γ_{k^{'}}^{'} + γ^{'}) . \end{aligned}$ 6.18

Thus, we obtain the proposition. ▪

Figure 1. — Configuration of $\hat{l}, \hat{m}$ and $\hat{n}$ in propositions 4.4 and 4.7, and configuration of $\hat{y} = {(0, 1, 0)}^{T}$ , $\hat{z} = {(0, 0, 1)}^{T}$ and $\hat{v}$ in arguments around these propositions.

Remarks 4.6 and 4.9 to these propositions are proved in appendix B. The statement on α′₁ in remark 4.8 follows from remark 4.9 (put β′₁=2δ and t₁=π/2) or, more directly, from an equation $R_{\hat{l}} (2 δ) = R_{\hat{n}} (π) R_{\hat{m}} (- π)$ , which is equivalent to R_y(2δ)=R_v(π)R_z(−π), where $\hat{v} = {(\sin δ, 0, \cos δ)}^{T}$ , by lemma 3.4.

6.3. Proof of theorem 4.3

Let $2 N - 1$ and $2 N$ denote the set of odd numbers in $N$ and that of even numbers in $N$ , respectively. We define the following for $\hat{m}, \hat{n} \in S^{2}$ with $| {\hat{m}}^{T} \hat{n} | < 1$ :

\begin{aligned} M_{\hat{m}, \hat{n}}^{odd} (U) := min {j \in 2 N - 1 ∣ & \exists V_{1}, V_{3}, \dots, V_{j} \in R_{\hat{m}}, \\ \exists V_{2}, V_{4}, \dots, V_{j - 1} \in R_{\hat{n}}, U = V_{j} V_{j - 1} \dots V_{1}}, \\ M_{\hat{m}, \hat{n}}^{even} (U) := min {j \in 2 N ∣ & \exists V_{1}, V_{3}, \dots, V_{j - 1} \in R_{\hat{m}}, \\ \exists V_{2}, V_{4}, \dots, V_{j} \in R_{\hat{n}}, U = V_{j} V_{j - 1} \dots V_{1}} \\ and M_{\hat{m}, \hat{n}} (U) & := min {M_{\hat{m}, \hat{n}}^{odd} (U), M_{\hat{m}, \hat{n}}^{even} (U)} \end{aligned}

for U∈SU(2);

\begin{aligned} M_{\hat{m}, \hat{n}}^{odd} (D) := min {j \in 2 N - 1 ∣ & \exists A_{1}, A_{3}, \dots, A_{j} \in {\hat{R}}_{\hat{m}}, \\ \exists A_{2}, A_{4}, \dots, A_{j - 1} \in {\hat{R}}_{\hat{n}}, D = A_{j} A_{j - 1} \dots A_{1}}, \\ M_{\hat{m}, \hat{n}}^{even} (D) := min {j \in 2 N ∣ & \exists A_{1}, A_{3}, \dots, A_{j - 1} \in {\hat{R}}_{\hat{m}}, \\ \exists A_{2}, A_{4}, \dots, A_{j} \in {\hat{R}}_{\hat{n}}, D = A_{j} A_{j - 1} \dots A_{1}}, \\ and M_{\hat{m}, \hat{n}} (D) & := min {M_{\hat{m}, \hat{n}}^{odd} (D), M_{\hat{m}, \hat{n}}^{even} (D)} \end{aligned}

for D∈SO(3). The following lemma largely solves the issue of determining the optimal number $N_{\hat{m}, \hat{n}} (U)$ .

Lemma 6.2 —

Let $\hat{m}, \hat{n},$ $\hat{l}$ and δ be as in theorem 4.3. Then, for any $α, γ \in R$ and β∈[0,π],

$M_{\hat{m}, \hat{n}}^{odd} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) = M_{\hat{m}, \hat{n}}^{odd} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = 2 ⌈ \frac{β}{2 δ} ⌉ + 1$ 6.19

and

$M_{\hat{m}, \hat{n}}^{even} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) = M_{\hat{m}, \hat{n}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = g (α, β, δ),$ 6.20

where $U_{α, β, γ}^{\hat{m}, \hat{l}}$ is as defined in theorem 4.3.

Corollary 6.3 —

Let $\hat{m}, \hat{n},$ $\hat{l}$ and δ be as in theorem 4.3. Then, for any $α, γ \in R$ and β∈[0,π],

$M_{\hat{m}, \hat{n}} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) = M_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = min {2 ⌈ \frac{β}{2 δ} ⌉ + 1, g (α, β, δ)} .$ 6.21

Proof —

In the case where β=0, since $M_{\hat{m}, \hat{n}}^{odd} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = 1$ and $M_{\hat{m}, \hat{n}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = 2$ , (6.19) and (6.20) are trivially true. We shall prove the statement for β>0.

To establish (6.19), we shall show the first and third inequalities in

$2 ⌈ \frac{β}{2 δ} ⌉ + 1 \leq M_{\hat{m}, \hat{n}}^{odd} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) \leq M_{\hat{m}, \hat{n}}^{odd} (U_{α, β, γ}^{\hat{m}, \hat{l}}) \leq 2 ⌈ \frac{β}{2 δ} ⌉ + 1$ 6.22

while the second inequality trivially follows from the definition of $M_{\hat{m}, \hat{n}}^{odd}$ .

Note first that remark 4.5 to proposition 4.4 immediately implies the third inequality in (6.22). To prove the first inequality, assume

$F (U_{α, β, γ}^{\hat{m}, \hat{l}}) = A_{j} A_{j - 1} \dots A_{1}$ 6.23

for some j=2k−1 with $k \in N$ , where $A_{ν} \in {\hat{R}}_{\hat{m}}$ if ν is odd and $A_{ν} \in {\hat{R}}_{\hat{n}}$ otherwise.

We shall evaluate $d (F (U_{α, β, γ}^{\hat{m}, \hat{l}}) \hat{m}, \hat{m}) = d (A_{j} A_{j - 1} \dots A_{1} \hat{m}, \hat{m})$ . Noting that $d (F (U_{α, β, γ}^{\hat{m}, \hat{l}}) \hat{m}, \hat{m}) = β$ , we have β≤2(k−1)δ by (5.2) of lemma 4.1. This implies ⌈β/(2δ)⌉≤k−1, and therefore,

$2 ⌈ \frac{β}{2 δ} ⌉ + 1 \leq 2 k - 1 = j .$ 6.24

From this bound, we have the first inequality in (6.22), and hence (6.19).

To establish (6.20), we shall first treat the major case where f(α,β,δ)≥δ. Recalling that $g (α, β, δ) = 2 ⌈ f (α, β, δ) / (2 δ) + \frac{1}{2} ⌉$ in this case, we shall show the first and third inequalities in

$2 ⌈ \frac{f (α, β, δ)}{2 δ} + \frac{1}{2} ⌉ \leq M_{\hat{m}, \hat{n}}^{even} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) \leq M_{\hat{m}, \hat{n}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) \leq 2 ⌈ \frac{f (α, β, δ)}{2 δ} + \frac{1}{2} ⌉$ 6.25

while the second inequality holds trivially.

Note that remark 4.8 to proposition 4.7 will imply the third inequality upon showing that β′ in proposition 4.7 satisfies β′=f(α,β,δ) when $U = U_{α, β, γ}^{\hat{m}, \hat{l}}$ . To see β′=f(α,β,δ), rewrite (4.8), using lemma 3.4, as

$R_{y} (- δ) R_{z} (α) R_{y} (β) R_{z} (γ) = R_{z} (α^{'}) R_{y} (β^{'}) R_{z} (γ^{'}) .$ 6.26

Then, a direct calculation shows the absolute value of the (1,1)-entry of the left-hand side equals

$\sqrt{\cos^{2} \frac{β}{2} \cos^{2} \frac{δ}{2} + \sin^{2} \frac{β}{2} \sin^{2} \frac{δ}{2} + 2 \cos α \sin \frac{β}{2} \sin \frac{δ}{2} \cos \frac{β}{2} \cos \frac{δ}{2}} .$

This shows β′=f(α,β,δ) in view of (3.6).

To prove the first inequality in (6.25), assume (6.23) holds for some j=2k with $k \in N$ , where $A_{ν} \in {\hat{R}}_{\hat{m}}$ if ν is odd and $A_{ν} \in {\hat{R}}_{\hat{n}}$ otherwise. Note that $\hat{n} = {\hat{R}}_{\hat{l}} (δ) \hat{m}$ and hence, for $U = R_{\hat{n}} (α^{'}) R_{\hat{l}} (β^{'} + δ) R_{\hat{m}} (γ^{'})$ in proposition 3.7,

$d (F (U) \hat{m}, \hat{n}) = d ({\hat{R}}_{\hat{l}} (β^{'} + δ) \hat{m}, \hat{n}) = d ({\hat{R}}_{\hat{l}} (β^{'} + δ) \hat{m}, {\hat{R}}_{\hat{l}} (δ) \hat{m}) = (β^{'} + δ) - δ = β^{'} .$

Then, we have β′≤(2k−1)δ by (5.4) of lemma 5.1. This implies ⌈(β′+δ)/(2δ)⌉≤k, and, therefore,

$2 ⌈ \frac{β^{'} + δ}{2 δ} ⌉ \leq 2 k = j .$ 6.27

From this bound, we have the first inequality in (6.25) and, hence, the equality among all sides of (6.25). This shows (6.20) in the case where f(α,β,δ)≥δ. The proof of (6.20) in the other case is given in appendix C. This completes the proof of the lemma. The proved lemma immediately implies the corollary. ▪

Proof of theorem 4.3 —

Note that for any U∈SU(2),

$N_{\hat{m}, \hat{n}} (U) = min {M_{\hat{m}, \hat{n}}^{odd} (U), M_{\hat{m}, \hat{n}}^{even} (U), M_{\hat{n}, \hat{m}}^{odd} (U), M_{\hat{n}, \hat{m}}^{even} (U)},$

and we can write U in terms of three parametric expressions:

$U = R_{\hat{u}} (θ) = U_{α, β, γ}^{\hat{m}, \hat{l}} = U_{\tilde{α}, \tilde{β}, \tilde{γ}}^{\hat{n}, - \hat{l}},$

where $β, \tilde{β} \in [0, π]$ , $α, γ, \tilde{α}, \tilde{γ}, θ \in R$ and $\hat{u} \in S^{2}$ . Then, we have

$\frac{β}{2} = \arcsin [\sqrt{1 - {({\hat{m}}^{T} \hat{u})}^{2}} | \sin \frac{θ}{2} |] and \frac{\tilde{β}}{2} = \arcsin [\sqrt{1 - {({\hat{n}}^{T} \hat{u})}^{2}} | \sin \frac{θ}{2} |]$

owing to lemma 5.1, and, hence,

$M_{\hat{m}, \hat{n}}^{odd} (U) = 2 ⌈ \frac{\arcsin \sqrt{1 - {({\hat{m}}^{T} \hat{u})}^{2}} | \sin (θ / 2) |}{δ} ⌉ + 1$

and

$M_{\hat{n}, \hat{m}}^{odd} (U) = 2 ⌈ \frac{\arcsin \sqrt{1 - {({\hat{n}}^{T} \hat{u})}^{2}} | \sin (θ / 2) |}{δ} ⌉ + 1$

owing to lemma 5.2. Then, if $| {\hat{m}}^{T} \hat{u} | \geq | {\hat{n}}^{T} \hat{u} |$ whenever $\sin (θ / 2) \neq 0$ , which implies $M_{\hat{m}, \hat{n}}^{odd} (U) \leq M_{\hat{n}, \hat{m}}^{odd} (U)$ , we shall have

$\begin{aligned} N_{\hat{m}, \hat{n}} (U) & = min {M_{\hat{m}, \hat{n}}^{odd} (U), M_{\hat{m}, \hat{n}}^{even} (U), M_{\hat{n}, \hat{m}}^{even} (U)} \\ = min {2 ⌈ \frac{β}{2 δ} ⌉ + 1, g (α, β, δ), M_{\hat{n}, \hat{m}}^{even} (U)} \end{aligned}$ 6.28

for $U = U_{α, β, γ}^{\hat{m}, \hat{l}}$ . But $[\sin (θ / 2) \neq 0 \to | {\hat{m}}^{T} \hat{u} | \geq | {\hat{n}}^{T} \hat{u} |]$ follows from $b (\hat{m}, U_{α, β, γ}^{\hat{m}, \hat{l}}) \geq b (\hat{n}, U_{α, β, γ}^{\hat{m}, \hat{l}})$ by the definition of b. (This is because writing U in (4.1) as $U = R_{\hat{u}} (θ)$ , $θ \in R$ , $\hat{u} \in S^{2}$ , results in $- \sin (θ / 2) \hat{u} = {(x, y, z)}^{T}$ as in §2.3, whereby $b (\hat{v}, U) = | \sin (θ / 2) | | {\hat{u}}^{T} \hat{v} |$ .) Hence, we have (6.28).

A short additional argument (appendix D) shows

$M_{\hat{n}, \hat{m}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = g (γ, - β, δ),$ 6.29

and, therefore,

$N_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = min {2 ⌈ \frac{β}{2 δ} ⌉ + 1, g (α, β, δ), g (γ, - β, δ)} .$

Finally, from corollary 6.3 or from the argument in appendix E, it readily follows that $N_{\hat{m}, \hat{n}} (F (U_{α, β, γ}^{\hat{m}, \hat{l}})) = N_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}})$ . Hence, we obtain the theorem. ▪

From the viewpoint of construction, we summarize the (most directly) suggested way to obtain an optimal construction of a given element U∈SU(2), where we assume $δ = \arccos {\hat{m}}^{T} \hat{n} \in (0, π / 2]$ without loss of generality. If $b (\hat{m}, U)$ $\geq b (\hat{n}, U)$ , choose a construction that attains the minimum in (6.28). The construction is among that of proposition 4.4, that of proposition 4.7 and that of proposition 4.7 applied to U^† in place of U [note $U^{†} = R_{{\hat{u}}_{1}} (ϕ_{1}) \dots R_{{\hat{u}}_{j}} (ϕ_{j})$ implies $U = R_{{\hat{u}}_{j}} (- ϕ_{j}) \dots R_{{\hat{u}}_{1}} (- ϕ_{1})$ ]. If $b (\hat{m}, U) < b (\hat{n}, U)$ , interchanging $\hat{m}$ and $\hat{n}$ , apply the construction just described.⁷ See appendix G for a detailed description of the above construction method.

7. Conclusion

This work has established the least value $N_{\hat{m}, \hat{n}} (U)$ of a positive integer k such that U can be decomposed into the product of k rotations about either $\hat{m}$ or $\hat{n}$ for an arbitrarily fixed element U in SU(2), or in SO(3), where $\hat{m}, \hat{n} \in S^{2}$ are arbitrary real unit vectors with $| {\hat{m}}^{T} \hat{n} | < 1$ . Decompositions of U attaining the minimum number $N_{\hat{m}, \hat{n}} (U)$ have also been given explicitly.

8. Comments on Brezov et al. [10–12]

In this paper, an algorithm for solving the following unusual optimization problem was presented:

\begin{aligned} minimize & length (τ_{1}, \dots, τ_{ν}, {\hat{m}}_{1}, \dots, {\hat{m}}_{ν}) \\ subject to & R_{{\hat{m}}_{1}} (τ_{1}) R_{{\hat{m}}_{2}} (τ_{2}) \dots R_{{\hat{m}}_{ν}} (τ_{ν}) = U, \\ ν \in N; τ_{j} \in R, {\hat{m}}_{j} \in A for j = 1, \dots, ν \end{aligned}

where $length (τ_{1}, \dots, τ_{ν}, {\hat{m}}_{1}, \dots, {\hat{m}}_{ν}) := ν$ , U is an arbitrary fixed rotation and A⊂S² with |A|=2 (the minimum of ‘length’, the primary part of an optimal solution, has been denoted by $N_{\hat{m}, \hat{n}} (U)$ ). To this author's knowledge, only the work by D'Alessandro [5] and this paper have discussed this optimization problem.

Naturally, the present author could not find any (explicit or implicit) indication that Brezov et al. [10–12] suggest considering the quantity $N_{\hat{m}, \hat{n}} (U)$ or analogues. A difference in background between this paper and Brezov et al. [10–12] may be understood as follows. While the situation assumed in this paper is that only two axes are available in constructing an arbitrary rotation, assuming a different situation results in problem formulations different from ours. For example, in Leite [7, Lemma 4.2] (attributed to Davenport), a situation where three axes are available but the number of factors in a decomposition is limited to three or less (in words, an equation $R_{{\hat{m}}_{1}} (τ_{1}) R_{{\hat{m}}_{2}} (τ_{2}) R_{{\hat{m}}_{3}} (τ_{3}) = U$ , i.e. the above equation with ν=3) is considered. In the series of Brezov et al. [10–12], they investigated such decompositions of the Davenport type, seemingly with emphasis on physical aspects. Note that $N_{\hat{m}, \hat{n}} (SU (2)) = max_{U} N_{\hat{m}, \hat{n}} (U) = ⌈ π / \arccos | {\hat{m}}^{T} \hat{n} | ⌉ + 1$ , $\hat{m} \neq \pm \hat{n}$ , is greater than three except in the classical case, where $\hat{m}$ and $\hat{n}$ are orthogonal to each other.

Despite such differences in essence and background, note in the proof of this paper's formula (6.20) for the minimum even number of factors in lemma 6.2, on which the main theorem (theorem 4.3) relies, the case where the minimum even number is 2 or 4 needs an exceptional treatment (appendix C). This exceptionality would motivate one to read treatments on decompositions into two factors, and such can be found in Brezov et al. [10–12].

Appendix A. Element in SU(2) associated with $\hat{l}$ and $\hat{m}$

Our goal here is to prove (in a constructive manner) that for any pair of vectors $\hat{l}, \hat{m} \in S^{2}$ with $\hat{l}^{T} \hat{m} = 0$ , there exists some element U∈SU(2) such that $\hat{l} = F (U) {(0, 1, 0)}^{T}$ and $\hat{m} = F (U) {(0, 0, 1)}^{T}$ . Expressing U as $U = R_{z} (\tilde{α}) R_{y} (\tilde{β}) R_{z} (\tilde{γ})$ , we shall specify desired $\tilde{α}, \tilde{β}$ and $\tilde{γ}$ . By a direct calculation with

{\hat{R}}_{y} (θ) = (\begin{matrix} \cos θ & 0 & \sin θ \\ 0 & 1 & 0 \\ - \sin θ & 0 & \cos θ \end{matrix}) and {\hat{R}}_{z} (θ) = (\begin{matrix} \cos θ & - \sin θ & 0 \\ \sin θ & \cos θ & 0 \\ 0 & 0 & 1 \end{matrix}),

where ${\hat{R}}_{y} (θ) := F (R_{y} (θ))$ and ${\hat{R}}_{z} (θ) := F (R_{z} (θ))$ , we have $F (U) {(0, 0, 1)}^{T} = {(\cos \tilde{α} \sin \tilde{β}, \sin \tilde{α} \sin \tilde{β}, \cos \tilde{β})}^{T}$ . On the other hand, the condition $\hat{l} = F (U) {(0, 1, 0)}^{T}$ is equivalent to ${\hat{R}}_{y} (- \tilde{β}) {\hat{R}}_{z} (- \tilde{α}) \hat{l} = {\hat{R}}_{z} (\tilde{γ}) {(0, 1, 0)}^{T}$ , i.e.

(\begin{matrix} \cos \tilde{β} \cos \tilde{α} & \cos \tilde{β} \sin \tilde{α} & - \sin \tilde{β} \\ - \sin \tilde{α} & \cos \tilde{α} & 0 \\ \cos \tilde{α} \sin \tilde{β} & \sin \tilde{α} \sin \tilde{β} & \cos \tilde{β} \end{matrix}) \hat{l} = (\begin{matrix} - \sin \tilde{γ} \\ \cos \tilde{γ} \\ 0 \end{matrix}) .

A 1

Hence, choosing parameters $\tilde{α}$ and $\tilde{β}$ such that ${(\cos \tilde{α} \sin \tilde{β}, \sin \tilde{α} \sin \tilde{β}, \cos \tilde{β})}^{T} = \hat{m}$ , cf. spherical coordinates, and $\tilde{γ}$ that satisfies (A.1), we have a desired element $U = R_{z} (\tilde{α}) R_{y} (\tilde{β}) R_{z} (\tilde{γ})$ such that $\hat{l} = F (U) {(0, 1, 0)}^{T}$ and $\hat{m} = F (U) {(0, 0, 1)}^{T}$ .

Appendix B. Details on angles in propositions 4.4 and 4.7

Examining the proof of lemma 6.1, we can be specific about α and γ to have the following lemma and corollary. In particular, the corollary gives a sufficient condition, (i), and two necessary conditions, (ii) and (iii), for $R_{\hat{n}} (θ) = R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ)$ , where $\hat{l}, \hat{m}$ and $\hat{n}$ are set as in propositions 4.4 and 4.7. Remarks 4.6 and 4.9 will be clear from (i). Later, (ii) and (iii) will be used in appendices C and D, respectively, though the use of them is not mandatory.

Lemma B.1 —

For any $θ, α, β, γ \in R,$ and $\hat{n}, \hat{l}, \hat{m} \in S^{2}$ such that $\hat{l}^{T} \hat{m} = 0,$

$R_{\hat{n}} (θ) = R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ)$ B 1

holds iff the following conditions hold:

$\cos \frac{γ + α}{2} = \frac{\cos (θ / 2)}{\cos (β / 2)} and \sin \frac{γ + α}{2} = \frac{{\hat{m}}^{T} \hat{n} \sin (θ / 2)}{\cos (β / 2)}$ B 2

whenever $\cos (β / 2) \neq 0,$

$\sin \frac{γ - α}{2} = \frac{{(\hat{l} \times \hat{m})}^{T} \hat{n} \sin (θ / 2)}{\sin (β / 2)} and \cos \frac{γ - α}{2} = \frac{\hat{l}^{T} \hat{n} \sin (θ / 2)}{\sin (β / 2)}$ B 3

whenever $\sin (β / 2) \neq 0,$ and

$\sqrt{1 - {({\hat{m}}^{T} \hat{n})}^{2}} | \sin \frac{θ}{2} | = | \sin \frac{β}{2} | .$ B 4

Corollary B.2 —

Given any δ∈(0,π/2] and $\hat{l}, \hat{m} \in S^{2}$ such that $\hat{l}^{T} \hat{m} = 0,$ put

$\hat{n} = (\sin δ) \hat{l} \times \hat{m} + (\cos δ) \hat{m} .$ B 5

Then, (i) for any $θ, α, γ \in R$ and β∈[0,π], (B 1) holds if

$β \leq 2 δ$

and there exists some $t \in R$ such that (recall H_t is defined in remark 4.6)

$(\begin{matrix} α \\ γ \\ θ \end{matrix}) = \pm (\begin{matrix} H_{t} (β, δ) - \frac{π}{2} \\ H_{t} (β, δ) + \frac{π}{2} \\ 2 \arcsin \frac{\sin (β / 2)}{\sin δ} \end{matrix}) or (\begin{matrix} α \\ γ \\ θ \end{matrix}) = \pm (\begin{matrix} - H_{t} (β, δ) + \frac{π}{2} \\ - H_{t} (β, δ) + \frac{3 π}{2} \\ 2 π - 2 \arcsin \frac{\sin (β / 2)}{\sin δ} \end{matrix});$ B 6

(ii) for any $α \in R$ and β∈(0,π], if (B 1) holds for some $θ, γ \in R,$ then β≤2δ and there exist some $j \in Z$ and $t \in R$ such that⁸

$α = \pm H_{t} (β, δ) \pm \frac{π}{2} + π j;$

(iii) for any $γ \in R$ and β∈(0,π], if (B 1) holds for some $θ, α \in R,$ then β≤2δ and there exist some $j \in Z$ and $t \in R$ such that

$γ = \pm H_{t} (β, δ) \pm \frac{π}{2} + π j .$

Proof —

Set $\hat{v} = {(v_{x}, v_{y}, v_{z})}^{T}$ with

$v_{x} = {(\hat{l} \times \hat{m})}^{T} \hat{n}, v_{y} = \hat{l}^{T} \hat{n} and v_{z} = {\hat{m}}^{T} \hat{n} .$

Then, according to paragraphs (1) and (2) in the proof of lemma 6.1, for any $θ, α, β, γ \in R$ , (B 1) holds iff (6.6)–(6.9) hold. But (6.6)–(6.9) hold iff (B 4), [ $\cos (β / 2) \neq 0 \to$ ] and [ $\sin (β / 2) \neq 0 \to$ ] hold. This completes the proof of the lemma.

To see the corollary (recall figure 1 and), note

${(\hat{l} \times \hat{m})}^{T} \hat{n} = \sin δ, \hat{l}^{T} \hat{n} = 0 and {\hat{m}}^{T} \hat{n} = \cos δ .$

Then, (B 4), [ $\cos (β / 2) \neq 0 \to$ ] and [ $\sin (β / 2) \neq 0 \to$ ] hold if the following two conditions are satisfied: (a) β≤2δ and (b)

${\begin{cases} \frac{γ + α}{2} = \arcsin \frac{\tan (β / 2)}{\tan δ} \\ \frac{γ - α}{2} = \frac{π}{2} \\ θ = 2 \arcsin \frac{\sin (β / 2)}{\sin δ} \end{cases} or {\begin{cases} \frac{γ + α}{2} = π - \arcsin \frac{\tan (β / 2)}{\tan δ} \\ \frac{γ - α}{2} = \frac{π}{2} \\ θ = 2 π - 2 \arcsin \frac{\sin (β / 2)}{\sin δ} \end{cases}$

unless β/2=δ=π/2,⁹ and

${\begin{aligned} \frac{γ + α}{2} = s \\ \frac{γ - α}{2} = \frac{π}{2} \\ θ = β \end{aligned}$

for some $s \in R$ if β/2=δ=π/2. This readily gives two solutions for (B 1). Rewriting these solutions with H_t and checking that flipping the signs of the solutions gives other solutions, we obtain (i). Showing (ii) and (iii) is as easy as showing (i). ▪

Appendix C. Proofs of (6.20) in the case f(α,β,δ)<δ

Proof 1 —

Proposition 4.7 and remark 4.8 show $M_{\hat{m}, \hat{n}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) \leq 4$ , i.e. either $M_{\hat{m}, \hat{n}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = 2$ or $M_{\hat{m}, \hat{n}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = 4$ . We also have $M_{\hat{m}, \hat{n}}^{even} (F (U)) = M_{\hat{m}, \hat{n}}^{even} (U)$ for any U∈SU(2) (appendix E). Hence, all we need to show is that

$\exists θ, ϕ \in R, R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) = R_{\hat{n}} (θ) R_{\hat{m}} (ϕ)$ C 1

implies f(α,β,δ)≥δ. This can be shown easily with corollary B.2, (ii). ▪

Proof 2 —

We shall show that (C 1), i.e.

$\exists θ, \tilde{γ} \in R, R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (\tilde{γ}) = R_{\hat{n}} (θ),$ C 2

implies f(α,β,δ)=δ, which is enough. Note that f(α,β,δ)=β′ for the angle β′∈[0,π] such that

$\exists α^{'}, γ^{'} \in R, R_{\hat{l}} (- δ) R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) = R_{\hat{m}} (α^{'}) R_{\hat{l}} (β^{'}) R_{\hat{m}} (γ^{'})$ C 3

(Proof of lemma 6.2 in §6.3). From (C 2) and (C 3), we have

$\exists α^{'}, γ^{'}, \tilde{γ}, θ \in R, R_{\hat{m}} (α^{'}) R_{\hat{l}} (β^{'}) R_{\hat{m}} (γ^{'} - γ + \tilde{γ}) = R_{\hat{l}} (- δ) R_{\hat{n}} (θ),$

which is, by lemma 3.4, equivalent to

$\exists α^{'}, γ^{'}, \tilde{γ}, θ \in R, R_{z} (α^{'}) R_{y} (β^{'}) R_{z} (γ^{'} - γ + \tilde{γ}) = R_{y} (- δ) R_{\hat{v}} (θ),$ C 4

where $\hat{v} = {(\sin δ, 0, \cos δ)}^{T}$ . The absolute value of the (1,1)-entry of the right-hand side in (C 4) equals $\cos (δ / 2)$ since R_y(−δ)R_v(θ)=R_z(θ)R_y(−δ), which is equivalent to the equation R_v(θ)=R_y(δ)R_z(θ)R_y(−δ) used before. In view of (3.6), this implies β′=δ, i.e. f(α,β,δ)=δ as desired. ▪

Appendix D. Proof of (6.29)

Observe that $M_{\hat{n}, \hat{m}}^{even} (U) = M_{\hat{m}, \hat{n}}^{even} (U^{†})$ for any U∈SU(2), by definition, and also that ${(U_{α, β, γ}^{\hat{m}, \hat{l}})}^{†} = U_{- γ, - β, - α}^{\hat{m}, \hat{l}} = U_{- γ - π, β, - α + π}^{\hat{m}, \hat{l}}$ for any α,γ and β∈[0,π], cf. footnote 2. These facts give $M_{\hat{n}, \hat{m}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = g (- γ - π, β, δ) = g (γ, - β, δ)$ as desired.¹⁰

Appendix E. Proof that $M_{\hat{m}, \hat{n}}^{even} (F (U)) = M_{\hat{m}, \hat{n}}^{even} (U)$ and $N_{\hat{m}, \hat{n}} (F (U)) = N_{\hat{m}, \hat{n}} (U)$

Let any $\hat{m}, \hat{n} \in S^{2}$ with $| {\hat{m}}^{T} \hat{n} | < 1$ and U∈SU(2) be given. By definition, $M_{\hat{m}, \hat{n}}^{even} (F (U)) \leq M_{\hat{m}, \hat{n}}^{even} (U)$ . We shall show the inequality in the other direction using the following lemma.

Lemma E.1 —

For any U,V ∈SU(2), F(U)=F(V) iff U=±V .

Proof —

This directly follows from the well-known fact that the kernel of F is {I,−I}, which can be checked with (3.1). ▪

From this lemma, it readily follows that if there exist some $j \in N$ , ${\hat{v}}_{1}, \dots, {\hat{v}}_{j} \in S^{2}$ and ϕ₁,…,ϕ_j $\in R$ such that $F (U) = F (R_{{\hat{v}}_{1}} (ϕ_{1})) \dots F (R_{{\hat{v}}_{j}} (ϕ_{j}))$ , then $U = \pm R_{{\hat{v}}_{1}} (ϕ_{1}) \dots R_{{\hat{v}}_{j}} (ϕ_{j})$ . But $- R_{{\hat{v}}_{1}} (ϕ_{1}) \dots R_{{\hat{v}}_{j}} (ϕ_{j}) = R_{{\hat{v}}_{1}} (ϕ_{1} + 2 π) R_{{\hat{v}}_{2}} (ϕ_{2}) \dots R_{{\hat{v}}_{j}} (ϕ_{j})$ . This implies $M_{\hat{m}, \hat{n}}^{even} (F (U)) \geq M_{\hat{m}, \hat{n}}^{even} (U)$ , and hence, $M_{\hat{m}, \hat{n}}^{even} (F (U)) = M_{\hat{m}, \hat{n}}^{even} (U)$ . We also have $N_{\hat{m}, \hat{n}} (F (U)) = N_{\hat{m}, \hat{n}} (U)$ , etc., similarly.

Appendix F. Proof of theorem 3.1

Put

δ = \arccos | {\hat{m}}^{T} \hat{n} | \in (0, \frac{π}{2}] .

Note $N_{- \hat{m}, \hat{n}} (U) = N_{\hat{m}, \hat{n}} (U)$ by definition. Hence, we shall prove the statement assuming ${\hat{m}}^{T} \hat{n} \geq 0$ , which is enough.

First, we give another corollary to lemma 6.2.

Corollary F.1 —

For any $α, γ \in R,$ and for any β∈[0,π],

$N_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}}) \leq M_{\hat{m}, \hat{n}} (U_{α, β, γ}^{\hat{m}, \hat{l}}) \leq min {2 ⌈ \frac{β}{2 δ} ⌉ + 1, max_{α^{'} \in R} g (α^{'}, β, δ)} .$

Proof —

The first inequality follows from the definitions of $N_{\hat{m}, \hat{n}}$ and $M_{\hat{m}, \hat{n}}$ . The second inequality immediately follows from corollary 6.3. ▪

It is easy to show, using corollary F.1, that

N_{\hat{m}, \hat{n}} (F (U)) \leq N_{\hat{m}, \hat{n}} (U) \leq ν + 1

F 1

for any U∈SU(2), where ν:=⌈π/δ⌉. But we have $ν + 1 \leq N_{\hat{m}, \hat{n}} (F (U))$ and, therefore, the equality among all sides of (F 1) for

U = {\begin{cases} R_{\hat{m}} (π) R_{\hat{l}} (π - δ) & if ν is even \\ R_{\hat{l}} (π) & if ν is odd. \end{cases}

Thus, we have proved theorem 3.1 elementarily.

Appendix G. Procedure for obtaining an optimal decomposition

In proposition 4.4 and remark 4.6, setting k=⌈β/(2δ)⌉, β_j=2δ for j≠k and t_j=π/2 (j=1,…,k), we have the following special form of (4.6):

R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ) = R_{\hat{m}} (α) {[R_{\hat{n}} (π) R_{\hat{m}} (- π)]}^{k - 1} R_{\hat{m}} (- α_{k}) R_{\hat{n}} (θ_{k}) R_{\hat{m}} (- γ_{k} + γ),

G 1

where

β_{k} = β - 2 (k - 1) δ

since β_j=2δ for j<k.

The analogous special case of (4.12) with k′=⌈β′/(2δ)+1/2⌉, β′_j=2δ for j≠k′ and t_j=π/2 (j=1,…,k′) is

U = R_{\hat{n}} (α^{'}) {[R_{\hat{n}} (π) R_{\hat{m}} (- π)]}^{k^{'} - 1} R_{\hat{m}} (- α_{k^{'}}^{'}) R_{\hat{n}} (θ_{k^{'}}^{'}) R_{\hat{m}} (- γ_{k^{'}}^{'} + γ^{'}),

G 2

where

β_{k}^{'} = β^{'} + δ - 2 (k^{'} - 1) δ .

In particular, if β′≤δ, this equation becomes

U = R_{\hat{n}} (α^{'}) R_{\hat{m}} (- α_{k^{'}}^{'}) R_{\hat{n}} (θ_{k^{'}}^{'}) R_{\hat{m}} (- γ_{k^{'}}^{'} + γ^{'}) .

G 3

The aim of this appendix is to present a procedure to produce the parameters (angles) of the optimal decomposition having the form of (G 1), (G 2) or (G 3), where interchange of $\hat{m}$ and $\hat{n}$ is allowed.

First, we describe the data format of output decompositions. We shall use a label taking values in {0,1}, where the label 0 indicates that the rightmost factor in the output decomposition is a rotation about $\hat{m}$ , and the label 1 indicates the other case. To express sequences of angles efficiently, we introduce the following notation:

\land \land j stands for π, - π, \dots, π, - π,

G 4

where the pattern ‘π,−π’ is repeated j times, and

- \land \land j stands for - π, π, \dots, - π, π,

G 5

where the pattern ‘−π,π’ is repeated j times ( $j \in Z, j \geq 0$ ). We put $Π = {\land \land j ∣ j \in Z, j \geq 0} \cup {- \land \land j ∣ j \in Z, j \geq 0}$ .

A decomposition is represented as a list of the form

[r_{0}, r_{1}, \dots, r_{N}],

G 6

where r₀∈{0,1} and $r_{j} \in R \cup Π$ for j=1,…,N. The first entry r₀ denotes the label. The part r₁,…,r_N lists the angles of all factors in a decomposition, where the order is preserved in listing the angles. For example, if the optimal decomposition is $R_{\hat{n}} (π / 3) R_{\hat{m}} (- π / 4)$ , the output expressing this is [0,π/3,−π/4]; if the optimal one is $R_{\hat{m}} (π / 8) R_{\hat{n}} (π) R_{\hat{m}} (- π)$ , the output is [0,π/8,π,−π] (or [0,π/8,∧∧1]).

To proceed, we need some definitions. The symbol ⊕ denotes the exclusive or operation (addition in $Z / 2 Z$ ). We define a function reverse as follows: reverse(s)=−s for $s \in R$ , reverse(s)=s for s∈Π and

reverse (r) = [r_{0} \oplus 1, reverse (r_{N}), \dots, reverse (r_{1})]

for a list r of the form (G 6).

We use functions a'(α,β,γ,δ) and c'(α,β,γ,δ) that return α′ and γ′, respectively, such that

R_{y} (- δ) R_{z} (α) R_{y} (β) R_{z} (γ) = R_{z} (α^{'}) R_{y} (β^{'}) R_{z} (γ^{'}) .

We shall not write down algorithms for these functions as it is as trivial as writing down the standard functions a and c in what follows. Below, the functions b, f, g and σ_t defined in §3 will be used.

The core of the procedure consists of the following two functions to represent the above two decompositions, where interchange∈{0,1} is an external variable to be defined outside the functions.

DecompositionOdd (α,β,γ,δ,N){

k:=(N−1)/2; If k=0, then return [interchange, α+γ]

β_last:=β−2(k−1)δ;

(α_last,γ_last,θ_last)^T:=σ_π/2(β_last,δ); /* σ_t is defined in (4.7) */

If k>1, then

return [interchange,α,∧∧k−2,π,−π−α_last,θ_last,−γ_last+γ];

else

return [interchange,α−α_last,θ_last,−γ_last+γ]; }

DecompositionEven (α,β,γ,δ,N,β′){

k′:=N/2;

α′:=a'(α,β,γ,δ);

γ′:=c'(α,β,γ,δ);

β′_last:=β′+δ−2(k′−1)δ;

(α′_last,γ′_last,θ′_last)^T:=σ_π/2(β′_last,δ);

If β′>δ, then

return [interchange,α′+π, −∧∧k′−2, −π−α′_last, θ′_last, −γ′_last+γ′];

else {

If β′=δ, then

return [interchange,α′+θ′_last, −γ′_last+γ′];

else

return [interchange, α′, −α′_last, θ′_last, −γ′_last+γ′]; } }

In what follows, w,x,y and z are the parameters to specify

U (w, x, y, z) = (\begin{matrix} w + i z & y + i x \\ - y + i x & w - i z \end{matrix}) \in SU (2)

G 7

as in definition 4.1. Throughout, relations

\hat{m} = {(m_{x}, m_{y}, m_{z})}^{T} and \hat{n} = {(n_{x}, n_{y}, n_{z})}^{T}

G 8

should be understood.

The following standard functions for converting (w,x,y,z) into the Euler angles would not need to be described: a(w,x,y,z), b(w,x,y,z) and c(w,x,y,z), which return $α \in R$ , β∈[0,π] and $γ \in R$ , respectively, such that

\begin{aligned} \sqrt{w^{2} + z^{2}} \cos \frac{γ + α}{2} & = w, \end{aligned}

G 9

\begin{aligned} \sqrt{x^{2} + y^{2}} \sin \frac{γ - α}{2} & = - x, \end{aligned}

G 10

\begin{aligned} \sqrt{x^{2} + y^{2}} \cos \frac{γ - α}{2} & = - y \end{aligned}

G 11

and

\begin{aligned} \sqrt{w^{2} + z^{2}} \sin \frac{γ + α}{2} & = - z \end{aligned}

G 12

and $\cos (β / 2) = \sqrt{w^{2} + z^{2}}$ , i.e. such that

(\begin{matrix} e^{- i ((γ + α) / 2)} \cos \frac{β}{2} & - e^{i ((γ - α) / 2)} \sin \frac{β}{2} \\ e^{- i ((γ - α) / 2)} \sin \frac{β}{2} & e^{i ((γ + α) / 2)} \cos \frac{β}{2} \end{matrix}) = R_{z} (α) R_{y} (β) R_{z} (γ) = (\begin{matrix} w + i z & y + i x \\ - y + i x & w - i z \end{matrix}) .

Similarly, functions $\tilde{a} (m_{x}, m_{y}, m_{z})$ and $\tilde{b} (m_{x}, m_{y}, m_{z})$ that return spherical coordinates $\tilde{α}$ and $\tilde{β}$ , respectively, such that $(\cos \tilde{α} \sin \tilde{β}, \sin \tilde{α} \sin \tilde{β}, \cos \tilde{β}) = (m_{x}, m_{y}, m_{z})$ will be used freely. We also use

sign (x) = {\begin{cases} 1 & if x \geq 0 \\ - 1 & if x < 0, \end{cases}

and a function normalised_vprod(m_x,m_y,m_z,n_x,n_y,n_z) that returns $∥ \hat{m} \times \hat{n} ∥^{- 1} {(\hat{m} \times \hat{n})}^{T}$ , recall (G 8).

The following function represents the main step (for obtaining $\tilde{γ}$ ) of the calculation of the SU(2) element associated with $\hat{l} = {(l_{x}, l_{y}, l_{z})}^{T}$ and $\hat{m}$ that has been described in appendix A:

\begin{aligned} \tilde{c} (\tilde{α}, \tilde{β}, l_{x}, l_{y}, l_{z}) \\ = sign (- l_{x} \cos \tilde{β} \cos \tilde{α} - l_{y} \cos \tilde{β} \sin \tilde{α} + l_{z} \sin \tilde{β}) \arccos (- l_{x} \sin \tilde{α} + l_{y} \cos \tilde{α}) . \end{aligned}

G 13

Now we present the procedure, where w,x,y and z are the parameters of U(w,x,y,z) as in (G 7) to be decomposed.

Procedure for obtaining an optimal decomposition.

Inputs: $w, x, y, z \in R$ with w²+x²+y²+z²=1; $m_{x}, m_{y}, m_{z}, n_{x}, n_{y}, n_{z} \in R$ with $m_{x}^{2} + m_{y}^{2} + m_{z}^{2} = 1$ , $n_{x}^{2} + n_{y}^{2} + n_{z}^{2} = 1$ and m_xn_x+m_yn_y+m_zn_z≥0.

Output: a list consisting of a label ∈{0,1}, and the angles of all factors in an optimal decomposition.

interchange:=0;

$δ := \arccos {\hat{m}}^{T} \hat{n}$ ;

If $b (\hat{m}, U (w, x, y, z)) < b (\hat{n}, U (w, x, y, z))$ , then {

(t_x,t_y,t_z):=(m_x,m_y,m_z);

(m_x,m_y,m_z):=(n_x,n_y,n_z);

(n_x,n_y,n_z):=(t_x,t_y,t_z);

interchange:=1; }

(l_x,l_y,l_z):=normalised_vprod(m_x,m_y,m_z,n_x,n_y,n_z);

/* Euler angles of SU(2) element associated with $\hat{l}$ and $\hat{m}$ in appendix A */

$\tilde{α} := \tilde{a} (m_{x}, m_{y}, m_{z})$ ;

$\tilde{β} := \tilde{b} (m_{x}, m_{y}, m_{z})$ ;

$\tilde{γ} := \tilde{c} (\tilde{α}, \tilde{β}, l_{x}, l_{y}, l_{z})$ ;

/* Main step */

1. Set $V = R_{z} (\tilde{α}) R_{y} (\tilde{β}) R_{z} (\tilde{γ})$ and calculate parameters w′,x′,y′,z′ such that

U (w^{'}, x^{'}, y^{'}, z^{'}) = V^{†} U (w, x, y, z) V .

2. Obtain α:=a(w′,x′,y′,z′), β:=b(w′,x′,y′,z′), and γ:=c(w′,x′,y′,z′).

3. Put β′:=f(α,β,δ), β′′:=f(γ,−β,δ), and

N := min {2 ⌈ \frac{β}{2 δ} ⌉ + 1, g (α, β, δ), g (γ, - β, δ)} .

4. Do one of the following three processes according to the case:

Case 1 [ N=2⌈β/(2δ)⌉+1 ]

return DecompositionOdd (α,β,γ,δ,N);

Case 2 [ N=g(α,β,δ) ]

return DecompositionEven (α,β,γ,δ,N,β′);

Case 3 [ N=g(γ,−β,δ) ]

return reverse(DecompositionEven (−γ−π,β,−α+π,δ,N,β′′));

End of the procedure.

Footnotes

Here, the crux of the difficulty in obtaining this work's results will be explained. Finding the minimum odd number of factors needed for decomposing U, which is expressed with a standard parameter β of U, together with minimum-achieving decompositions, was relatively easy. The crux lay in obtaining a solution to attain the minimum even number of factors, which was found to be expressed with another parameter β′ eventually.

The restriction of β to [0,π] does not seem common. However, in a straightforward proof of this lemma, β∈[0,π] can be chosen so that $\cos (β / 2) = | a |$ and $\sin (β / 2) = | b |$ when the first row of U is (a,b). Also any R_z(α′)R_y(β′)R_z(γ′) without this restriction can be written as R_z(α)R_y(β)R_z(γ) with some β∈[0,π] and $α, γ \in R$ . This readily follows from equations $R_{\hat{v}} (θ + 2 π) = - R_{\hat{v}} (θ)$ , $\hat{v} \in S^{2}$ , $θ \in R$ , and R_z(−π)R_y(β′)R_z(π)=R_y(−β′), $β^{'} \in R$ .

The objects treated in this subsection and previous one can be found in Wigner [3, ch. 15], where −Y and −Z have been used instead of our Y and Z in defining the homomorphism. Owing to this difference, the homomorphism in Wigner [3] is TF(U)T, where T is the diagonal matrix with diagonal entries 1 (leftmost), −1 and −1. (For example, TF(R_y(θ))T and TF(R_z(θ))T have appeared in Wigner [3, ch. 15] while we shall use F(R_y(θ)) and F(R_z(θ)) in appendix A. The general form $R_{\hat{v}} (θ)$ can be derived in a natural manner, but one may consult Biedenharn & Louck [4, ch. 2] for it.)

⁴

For the sake of constructiveness, such an element U is constructed in appendix A.

⁵

To make the construction explicit, one can set β_j=2δ for j≠k. The analogous comment applies to the division of β′+δ in proposition 4.7.

⁶

All remarks except remark 4.5, which needs no proof, will be proved in what follows.

⁷

One (seemingly difficult) issue arises: determine all optimal decompositions of an arbitrarily fixed rotation. Note that in propositions 4.4 and 4.7 and their proofs, any solution for $R_{\hat{n}} (θ) = R_{\hat{m}} (α) R_{\hat{l}} (β) R_{\hat{m}} (γ)$ can be used (see corollary B.2 in appendix B for explicit solutions, among which one is chosen to be used in remarks 4.6 and 4.9).

⁸

Here, w=±x±y+z means w∈{x+y+z,x−y+z,−x+y+z,−x−y+z}.

⁹

$\tan (β / 2) / \tan δ$ should be understood as 0 if β/2<δ=π/2.

¹⁰

As a check, one can show, using corollary B.2, (iii), that $M_{\hat{n}, \hat{m}}^{even} (U_{α, β, γ}^{\hat{m}, \hat{l}}) = 4$ if f(γ,−β,δ)<δ in the same way as in appendix C.

Funding statement

This work was supported by SCOPE (Ministry of Internal Affairs and Communications) and by Japan Society for the Promotion of Science KAKENHI grant nos. 22540150 and 21244007.

References

1.Lowenthal F. 1971. Uniform finite generation of the rotation group. Rocky Mt. J. Math. 1, 575–586. (doi:10.1216/RMJ-1971-1-4-575) [Google Scholar]
2.Lowenthal F. 1972. Uniform finite generation of the SU(2) and SL(2,R). Can. J. Math. 24, 713–727. (doi:10.4153/CJM-1972-067-x) [Google Scholar]
3.Wigner EP. 1959. Group theory and its application to the quantum mechanics of atomic spectra. New York, NY: Academic Press. [Google Scholar]
4.Biedenharn LC, Louck JD. 1985. Angular momentum in quantum physics: theory and application. New York, NY: Cambridge University Press. [Google Scholar]
5.D'Alessandro D. 2004. Optimal evaluation of generalized Euler angles with applications to control. Automatica 40, 1997–2002. (doi:10.1016/j.automatica.2004.06.006) [Google Scholar]
6.Koch RM, Lowenthal F. 1975. Uniform finite generation of three-dimensional linear Lie groups. Can. J. Math. 27, 396–417. (doi:10.4153/CJM-1975-048-0) [Google Scholar]
7.Leite FS. 1991. Bounds on the order of generation of SO(n,R) by one-parameter subgroups. Rocky Mt. J. Math. 21, 879–911. (doi:10.1216/rmjm/1181072975) [Google Scholar]
8.Reck M, Zeilinger A, Bernstein HJ, Bertani P. 1994. Experimental realization of any discrete unitary operator. Phys. Rev. Lett. 73, 58–61. (doi:10.1103/PhysRevLett.73.58) [DOI] [PubMed] [Google Scholar]
9.Boykin PO, Mor T, Pulver M, Roychowdhury V, Vatan F. 1999. On universal and fault-tolerant quantum computing: a novel basis and new constructive proof of universality for Shor's basis. In 40th Annu. Symp. on Foundations of Computer Science, 17–19 October 1999, New York, NY, pp. 486–494. IEEE. [Google Scholar]
10.Brezov D, Mladenova C, Mladenov I. 2012. Vector decompositions of rotations. J. Geom. Symmetry Phys. 28, 67–103. [Google Scholar]
11.Brezov D, Mladenova C, Mladenov I. 2013. Vector parameters in classical hyperbolic geometry. J. Geom. Symmetry Phys. 30, 19–48. [Google Scholar]
12.Brezov D, Mladenova C, Mladenov I. 2014. A decoupled solution to the generalized Euler decomposition problem in ℝ³ and ℝ^2,1. J. Geom. Symmetry Phys. 33, 47–78. [Google Scholar]

[RSOS140145C1] 1.Lowenthal F. 1971. Uniform finite generation of the rotation group. Rocky Mt. J. Math. 1, 575–586. (doi:10.1216/RMJ-1971-1-4-575) [Google Scholar]

[RSOS140145C2] 2.Lowenthal F. 1972. Uniform finite generation of the SU(2) and SL(2,R). Can. J. Math. 24, 713–727. (doi:10.4153/CJM-1972-067-x) [Google Scholar]

[RSOS140145C3] 3.Wigner EP. 1959. Group theory and its application to the quantum mechanics of atomic spectra. New York, NY: Academic Press. [Google Scholar]

[RSOS140145C4] 4.Biedenharn LC, Louck JD. 1985. Angular momentum in quantum physics: theory and application. New York, NY: Cambridge University Press. [Google Scholar]

[RSOS140145C5] 5.D'Alessandro D. 2004. Optimal evaluation of generalized Euler angles with applications to control. Automatica 40, 1997–2002. (doi:10.1016/j.automatica.2004.06.006) [Google Scholar]

[RSOS140145C6] 6.Koch RM, Lowenthal F. 1975. Uniform finite generation of three-dimensional linear Lie groups. Can. J. Math. 27, 396–417. (doi:10.4153/CJM-1975-048-0) [Google Scholar]

[RSOS140145C7] 7.Leite FS. 1991. Bounds on the order of generation of SO(n,R) by one-parameter subgroups. Rocky Mt. J. Math. 21, 879–911. (doi:10.1216/rmjm/1181072975) [Google Scholar]

[RSOS140145C8] 8.Reck M, Zeilinger A, Bernstein HJ, Bertani P. 1994. Experimental realization of any discrete unitary operator. Phys. Rev. Lett. 73, 58–61. (doi:10.1103/PhysRevLett.73.58) [DOI] [PubMed] [Google Scholar]

[RSOS140145C9] 9.Boykin PO, Mor T, Pulver M, Roychowdhury V, Vatan F. 1999. On universal and fault-tolerant quantum computing: a novel basis and new constructive proof of universality for Shor's basis. In 40th Annu. Symp. on Foundations of Computer Science, 17–19 October 1999, New York, NY, pp. 486–494. IEEE. [Google Scholar]

[RSOS140145C10] 10.Brezov D, Mladenova C, Mladenov I. 2012. Vector decompositions of rotations. J. Geom. Symmetry Phys. 28, 67–103. [Google Scholar]

[RSOS140145C11] 11.Brezov D, Mladenova C, Mladenov I. 2013. Vector parameters in classical hyperbolic geometry. J. Geom. Symmetry Phys. 30, 19–48. [Google Scholar]

[RSOS140145C12] 12.Brezov D, Mladenova C, Mladenov I. 2014. A decoupled solution to the generalized Euler decomposition problem in ℝ³ and ℝ^2,1. J. Geom. Symmetry Phys. 33, 47–78. [Google Scholar]

PERMALINK

The minimum number of rotations about two axes for constructing an arbitrarily fixed rotation

Mitsuru Hamada

Abstract

2. Introduction

3. Preliminaries and a known result

3.1. Definitions

3.2. The maximum of the minimum number of constituent rotations over all target rotations

Theorem 3.1 (Lowenthal [1,2]) —

3.3. Parametrizations of the elements in SU(2)

Lemma 3.2 —

3.4. Homomorphism from SU(2) onto SO(3)

3.5. Generic orthogonal axes and coordinate axes

Lemma 3.3 —

Proof —

Lemma 3.4 —

Proof —

4. The minimum numbers of constituent rotations and optimal constructions of an arbitrary rotation

Definition 4.1 —

Definition 4.2 —

Theorem 4.3 —

Proposition 4.4 —

Remark 4.5 —

Remark 4.6 —

Proposition 4.7 —

Remark 4.8 —

Remark 4.9 —

5. Limits on constructions

Lemma 5.1 —

6. Proof of the results

6.1. Structure of the proof

6.2. Proof of propositions 4.4 and 4.7

Lemma 6.1 —

Proof —

Proof of proposition 4.4 —

Proof of proposition 4.7 —

Figure 1.

6.3. Proof of theorem 4.3

Lemma 6.2 —

Corollary 6.3 —

Proof —

Proof of theorem 4.3 —

7. Conclusion

8. Comments on Brezov et al. [10–12]

Appendix A. Element in SU(2) associated with l^ and m^

Appendix B. Details on angles in propositions 4.4 and 4.7

Lemma B.1 —

Corollary B.2 —

Proof —

Appendix C. Proofs of (6.20) in the case f(α,β,δ)<δ

Proof 1 —

Proof 2 —

Appendix D. Proof of (6.29)

Appendix E. Proof that Mm^,n^even(F(U))=Mm^,n^even(U) and Nm^,n^(F(U))=Nm^,n^(U)

Lemma E.1 —

Proof —

Appendix F. Proof of theorem 3.1

Corollary F.1 —

Proof —

Appendix G. Procedure for obtaining an optimal decomposition

Footnotes

Funding statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Appendix A. Element in SU(2) associated with $\hat{l}$ and $\hat{m}$

Appendix E. Proof that $M_{\hat{m}, \hat{n}}^{even} (F (U)) = M_{\hat{m}, \hat{n}}^{even} (U)$ and $N_{\hat{m}, \hat{n}} (F (U)) = N_{\hat{m}, \hat{n}} (U)$