Quantizing Euclidean Motions via Double-Coset Decomposition

Christian Wülker; Sipu Ruan; Gregory S Chirikjian

doi:10.34133/2019/1608396

. 2019 Sep 15;2019:1608396. doi: 10.34133/2019/1608396

Quantizing Euclidean Motions via Double-Coset Decomposition

Christian Wülker ¹, Sipu Ruan ¹, Gregory S Chirikjian ^1,^2,^✉

PMCID: PMC7006946 PMID: 32043079

Abstract

Concepts from mathematical crystallography and group theory are used here to quantize the group of rigid-body motions, resulting in a “motion alphabet” with which robot motion primitives are expressed. From these primitives it is possible to develop a dictionary of physical actions. Equipped with an alphabet of the sort developed here, intelligent actions of robots in the world can be approximated with finite sequences of characters, thereby forming the foundation of a language in which robot motion is articulated. In particular, we use the discrete handedness-preserving symmetries of macromolecular crystals (known in mathematical crystallography as Sohncke space groups) to form a coarse discretization of the space SE(3) of rigid-body motions. This discretization is made finer by subdividing using the concept of double-coset decomposition. More specifically, a very efficient, equivolumetric quantization of spatial motion can be defined using the group-theoretic concept of a double-coset decomposition of the form Γ\SE(3)/Δ, where Γ is a Sohncke space group and Δ is a finite group of rotational symmetries such as those of the icosahedron. The resulting discrete alphabet is based on a very uniform sampling of SE(3) and is a tool for describing the continuous trajectories of robots and humans. An efficient coarse-to-fine search algorithm is presented to round off any motion sampled from the continuous group of motions to the nearest element of our alphabet. It is shown that our alphabet and this efficient rounding algorithm can be used as a geometric data structure to accelerate the performance of other sampling schemes designed for desirable dispersion or discrepancy properties. Moreover, the general “signals to symbols” problem in artificial intelligence is cast in this framework for robots moving continuously in the world.

1. Introduction

The aim of this paper is to develop a “motion alphabet” with which robot motion primitives are expressed. From these primitives one can develop a dictionary of physical actions. Two main themes from the theory of Lie groups are used to construct this alphabet (or quantization) and to efficiently solve the “signals to symbols” problem in this context: (1) the decomposition of a Lie group into cosets, double cosets, and corresponding fundamental domains; (2) the possibility of constructing such fundamental domains as Voronoi or Voronoi-like cells.

Equipped with an alphabet of the sort developed in this paper and the associated algorithms for efficiently rounding off continuous motions to nearby discrete representatives, intelligent actions of robots in the world can be approximated with finite sequences of characters, thereby forming the foundation of a language in which robot motion is articulated.

At the macroscopic scale, the world can be thought of as continuous. Coarse descriptions of this continuous world are used by intelligent systems (e.g., humans and computers) to classify objects, actions, and scenarios. The classical “signals to symbols” problem in artificial intelligence (AI) seeks to bin everything in the continuous world into countable classes characterized by strings of discrete symbols, such as the letters in an alphabet. In a sense, this is the inverse problem of what the genetic code does, since the finite alphabet {A, C, G, T} encodes the morphology and metabolism of every living creature that moves in the continuous world and processes information with an analog brain.

Discrete alphabets (including the Roman alphabet, the radicals that form characters in Asian languages, sign language, etc.) form the basis for all human languages [1]. All discrete characters can be reduced to a binary code, e.g., ASCII or Morse code in the case of western letters. The efficiency of an alphabet (or a code) depends on how much information can be conveyed with a given number of symbols, and how difficult it is to convey those symbols. For example, since the letter “e” is the most often used symbol in the English language, it is represented by a single “·” in Morse code. Perhaps the simplest and most widely occurring motion of a robot is the “nil motion”, or identity element of the motion group, describing the “home” pose (position and orientation), which coincidentally can be denoted simply as “e”. All other elements of our motion alphabet will be written as a product of the form γδ where γ ∈ Γ, a crystallographic space group, and δ ∈ Δ, a finite group of rotational symmetries.

As an example of how discrete symbols classify the world, consider how each of the following sentences describes a class of situations in which there is continuous freedom which becomes more restricted as the number of discrete symbols increases:

The cat is on the table.
The black cat is sitting on the table.
The black cat is sitting on the table and looking at you.

In (1), the cat could be of any color and lying down, sitting, or standing in an infinite variety of postures. The color and posture of the cat get somewhat restricted as the sentences get longer, but we still do not know how the cat is sitting, what its tail is doing, how big it is, its weight, the color of its eyes, etc. Indeed, the sentence would need to be quite long to hone in on what is actually going on, hence the old saying “one picture is worth a thousand words.”

The basic problem that must be reconciled by intelligent robots is to approximate, or round off, continuous objects and actions within a discrete descriptive framework and then truncate the discrete description at some finite level given limitations on computational and sensing resources. This is true in both machine-learning (ML) frameworks such as deep learning and classical AI. A specific kind of rounding off (of Euclidean motions) is the main subject of this paper, which is about making precise the round-off statement

\begin{matrix} d i s c r e t e d e s c r i p t i o n = [c o n t i n u o u s d e s c r i p t i o n] \end{matrix}

(1)

in the context of motions of objects and intelligent agents in the world.

A real-world problem that can be addressed using this discrete description of motions is robot motion planning, which is one of the fundamental topics in the field of intelligent robotics. It has a wide range of applications such as autonomous vehicles [2–4], mechanical parts assembly [5, 6], space on-orbit manipulations [7], and protein folding [8]. In general, motion planning seeks to answer the query: “how to plan a path that guides the robot from a given start pose to an end pose subject to some geometric or dynamical constraints”. And a popular way that has been applied for decades is to build a “roadmap” [9], which is basically a graph structure consisting of valid vertices and edges. A large number of efficient algorithms have been proposed such as visibility graph [10], Voronoi graph [11], cell decomposition [12], probabilistic roadmap [13], and their variants. For a given environment and constraints (obstacles, nonholonomic constraints, etc.), a roadmap provides information with qualified vertices and connectivity among nearby vertices. And once a query is submitted, graph searching algorithms give a valid optimal sequence of motions from the roadmap. Although a roadmap method is able to answer multiple searching queries, when the environment is changing, those vertices or edges information needs to be updated. To some extent, quantizing a continuous motion of the robot from a predefined motion alphabet is closely related to the roadmap concept. However, the advantage lies in the storage and representation of the motion sequences—once a rich library of alphabets is built a priori, representing different motions in different environment subject to different constraints is just a matter of combinations and ordering of the alphabet indices.

The major contributions of this paper are listed as follows:

(1) A “signals to symbols” framework for Euclidean motions based on double-coset decomposition is proposed and its properties are analyzed.

(2) Concrete formulations of the motion alphabet are constructed through crystallographic symmetry.

(3) Fast decoding algorithms via coarse-to-fine double-coset decomposition are proposed and numerically verified.

(4) Comparisons with existing methods for discretizing the Euclidean group and decoding random motions are performed.

(5) A fast hybrid search method that incorporates some existing sampling methods for rotation group with good dispersion or discrepancy properties has been proposed and verified.

The remainder of this paper is structured as follows: In Section 2, a review of the immense literature on machine intelligence as it applies to intelligent robot action in the world is provided. This is a rapidly changing field and impossible to capture in full detail, but some classical highlights are covered. Section 3 reviews some relevant aspects of abstract group theory. This is made concrete in Section 4 which focuses on the group of rigid-body motions. Section 5 reviews crystallographic symmetry, which is a source of discrete symbols from which a motion alphabet is constructed. In Section 6, we develop a motion alphabet by “dividing up” the group SE(3) of rigid-body motions via fine double-coset decomposition based on a crystallographic Sohncke space group and the subgroup of rotational symmetries of the icosahedron. Section 7 presents other choices for motion alphabets and solves the decoding problem efficiently by introducing a coarse-to-fine search scheme. In Section 8, comparisons with existing methods for sampling rotations and Euclidean motions are performed, and it is shown how our alphabet can be used as a geometric data structure to enhance the speed of these other motion-approximation methods.

2. Literature Review

This section reviews two largely disjoint areas of the literature: (1) the interface between group theory and machine learning and AI; (2) sampling methods and measures of their quality.

2.1. Lie Groups in Machine Learning and AI

The recognition of human (and humanoid-robot) actions has been studied from many different perspectives including [14–19]. Probabilistic graphical models [20], generative models [21], and recently “SE3-nets” [22] have been used to describe motion uncertainty in the context of learning. Works on vision and reasoning use concepts of quotient operations to mod out irrelevant information [23–25].

Group-theoretic methods (and abstract algebra more generally) can be found sprinkled throughout the AI literature [26–32]. Although group theory has long history [33–35], it is still a useful and popular tool for solving problems related to motions [36–40].

Of particular importance in the current context is the relationship between artificial intelligence and machine learning. AI arose as a branch of cybernetics, focusing on artificial aspects of reasoning and cognition, thereby leading to a redefinition of cybernetics to focus on information and control. Machine learning (and particularly deep neural networks) led by Hinton, LeCun, Bengio, and others can be viewed as an alternative to classical AI [41–45]. Recently, geometric and algebraic methods are being explored in some forms of machine learning [46–48].

The goal of this paper is to develop an alphabet of basic motions from which discrete words that capture the essence of a continuous motion/action are constructed. Throughout the literature, discretization of motions has attracted significant interest, where the Euclidean group is one of the popular ones [49]. In particular, the uniform sampling of rotations, either random or deterministic, has a wide range of applications [50–56]. This is also one of the applications that this work will address. The discretization of the continuous motions will generate part of a dictionary. This dictionary will, in the future, serve as the knowledge base for an expert system that will enable the robot to function at first use “right out of the box.”

2.2. Measurements of Sample Quality: Discrepancy, Dispersion, Consistency, and Uniformity

Sampling rotations in an efficient way play important roles in several fields including computer graphics [50, 57], protein crystallography [58], molecular physics [59], materials science [60], and robot motion planning [53, 55].

Several measures of the quality of a finite sampled set of rotations have been proposed in the literature. One is discrepancy as defined and used in [53, 61, 62]:

\begin{matrix} D (P, R) ≔ \underset{R \in R}{s u p} |\frac{|P \cap R|}{|P|} - \frac{V o l (R)}{V o l (S O (3))}|, \end{matrix}

(2)

where ℛ is a collection of measurable subsets of SO(3), P is a finite set of sample points in SO(3), and |S| is the number of elements in the finite set S. This concept was updated recently to include products of motion groups [63].

Natural measures of distance between rotations, or between rigid-body motions, can be computed in a variety of different ways, as reviewed in detail in [64].

Given any such metric, ρ : SO(3) × SO(3) → ℝ_≥0, the dispersion of the points can be computed as

\begin{matrix} D (S, ρ) ≔ \underset{R \in S O (3)}{m a x} [\underset{R_{S} \in S}{m i n} ρ (R, R_{S})], \end{matrix}

(3)

where S is a set of the sampled elements in SO(3).

In addition to discrepancy and dispersion, many other measures of sample quality can be defined. For example, a sampling can be called consistent if the distribution of the round-off error for any random rotation is concentrated. Therefore, we define the consistency of a set of samples on SO(3) as

\begin{matrix} C (S, ρ) ≔ σ_{R \in S} \{m (R)\}, \end{matrix}

(4)

where

\begin{matrix} m (R) ≔ \underset{R_{S} \in S - \{R\}}{m i n} ρ (R, R_{S}) \end{matrix}

(5)

and σ is the standard deviation of the set of distances between each sample point and its nearest neighbor. Here a low value of ℭ(S, ρ) indicates high consistency. For example, if ℭ(S, ρ) is zero, then every point in the set has nearest neighbors of the same distance. Moreover, if each point in the set has many neighbors that achieve the minimal value of distance, then the set S has a high level of uniformity, which can be quantified as follows. For each R ∈ S compute the subset A_R ⊂ S as

\begin{matrix} A_{R} ≔ \{Q \in S : ρ (R, Q) = m (R)\} . \end{matrix}

(6)

Then the cardinality of this set measures the number of equally close nearest neighbors to R, and uniformity weights this by the spread of this number:

\begin{matrix} U (S, ρ) ≔ \frac{\min_{R \in S} |A_{R}|}{1 + \max_{R \in S} |A_{R}| - \min_{R \in S} |A_{R}|}, \end{matrix}

(7)

where a high value indicates high uniformity. The numerator reflects the number of nearest neighbors for the worst sample, and the denominator reflects the spread (with 1 included since the difference between max and min can be zero).

For example, an integer lattice in 3D Euclidean space has a high level of uniformity with regard to the Euclidean metric, with each point having six nearest neighbors, each with the same minimized value of distance, and so ℭ(S, ρ) = 0 and 𝔘(S, ρ) = 6. A spherical close-packing can have an even higher value of 𝔘(S, ρ).

3. Some Relevant Aspects of Group Theory

The alphabets constructed in this paper consist of carefully chosen elements of the group of rigid-body motions, drawn from fundamental domains of double-coset spaces. Since this terminology might not be familiar to some readers with an interest in the topic, the relevant concepts from group theory are reviewed here. The concept of a group itself is assumed to be known.

3.1. Definitions and Properties

Let a group G and a subgroup H < G be given. For an element g ∈ G, the corresponding left coset is defined as

\begin{matrix} g H ≔ \{g h : h \in H\} . \end{matrix}

(8)

Analogously, the respective right coset is defined as

\begin{matrix} H g ≔ \{h g : h \in H\} . \end{matrix}

(9)

A subgroup N < G is called a normal subgroup of G if it is conjugated to itself; i.e., gNg⁻¹ = N for all g ∈ G. If N is both a normal and a proper subgroup of G, then it is denoted as N◁G.

A group can be divided into cosets (left or right), all with the same number of elements, and the set of all these cosets is called a coset space. In general, the left-coset space

\begin{matrix} G / H ≔ \{g H : g \in G\} \end{matrix}

(10)

and the right-coset space

\begin{matrix} H \ G ≔ \{H g : g \in G\} \end{matrix}

(11)

are different from each other, except when H is normal in G (H◁G), in which case gH = Hg for all g ∈ G. In this special case G/H = H\G≕H/G is a factor (or quotient) group.

Given two subgroups H, K < G, it is also possible to define double cosets

\begin{matrix} H g K ≔ \{h g k : h \in H, k \in K\}, g \in G, \end{matrix}

(12)

and the corresponding double-coset space,

\begin{matrix} H \ G / K ≔ \{H g K : g \in G\} . \end{matrix}

(13)

Given a left-coset decomposition with respect to a subgroup H, it is possible to define a (nonunique) fundamental domain

\begin{matrix} F_{G / H} \subset G \end{matrix}

(14)

consisting of exactly one element per left coset. Since a group can be partitioned into disjoint cosets, it is

\begin{matrix} G = ⋃_{g \in F_{G / H}} g H \end{matrix}

(15)

and

\begin{matrix} G = ⋃_{h \in H} F_{G / H} h \end{matrix}

(16)

(and analogously in the right-coset case [65]). It is important to note that we are generally dealing with unions of disjoint sets in this paper. When H◁G, such fundamental domain is a group with respect to the original group operation modulo H, and this group is isomorphic to G/H.

Given two subgroups H, K < G, the corresponding double-coset decomposition is

\begin{matrix} G = ⋃_{g \in F_{H \ G / K}} H g K \end{matrix}

(17)

and we further have that

\begin{matrix} G = ⋃_{h \in H} ⋃_{k \in K} h F_{H \ G / K} k, \end{matrix}

(18)

where F_H\G/K is a fundamental domain for the double-coset space H\G/K consisting of exactly one element per double coset. When G is a Lie group (i.e., when G has the structure of a differentiable manifold, and when further the group operation and inversion are compatible with this smooth structure) and H, K are discrete subgroups, then such fundamental domains F_G/H and F_H\G/K will have the same dimensionality as G, but smaller volume.

When G is a Lie group, one way to construct fundamental domains is as Voronoi-like cells: Since G is a smooth manifold, a proper distance function (metric) ρ exists, and we can define

\begin{matrix} F_{G / H}^{\circ} ≔ \{g \in G : ρ (e, g) < ρ (e, g h) for  all h \in H \ \{e\}\}, \\ F_{H \ G}^{\circ} ≔ \{g \in G : ρ (e, g) < ρ (e, h g) for  all h \in H \ \{e\}\}, \end{matrix}

(19)

where e is the identity element of G, and when H∩K = {e},

\begin{matrix} F_{H \ G / K}^{\circ} ≔ \{g \in G : ρ (e, g) < ρ (e, h g k) for  all (h, k) \in H \times K \ \{(e, e)\}\} . \end{matrix}

(20)

Above we have defined the interior of fundamental domains. The union of the corresponding shifts of these open sets will not in general completely exhaust G (cf. (16) and (18)). However, a set of measure zero will at most be missing, which is not relevant for our practical purposes.

A unimodular Lie group G is one for which there exists an integration measure, μ, or equivalently a volume element dg, such that for every integrable function f : G → ℝ,

\begin{matrix} μ (f) ≔ \int_{G} f (g) d g = \int_{G} f (g_{0} g) d g = \int_{G} f (g g_{0}) d g = \int_{G} f (g^{- 1}) d g . \end{matrix}

(21)

In the case of a compact Lie group such as G = SO(3), μ(1) = Vol(G).

In terms of ZXZ Euler angles (α, β, γ), it is well known that the volume element for SO(3) is [64]

\begin{matrix} d g = s i n β d α d β d γ . \end{matrix}

(22)

Consequently, sampling uniformly in Euler angles leads to clumping of samples around β = 0 and π, and under sampling near β = π/2.

Some authors therefore sample β nonuniformly, by making a change of coordinates as β = cos⁻¹(x). Then $s i n β = \sqrt{1 - x^{2}}$ , and $|d β / d x| = 1 / \sqrt{1 - x^{2}}$ . This results in equivolumetric portioning of SO(3) in the coordinates (α, x, γ) with volume element dg = dα dx dγ.

However, equipartitioning into units of equal volume is not the same as equipartitioning into units of equal shape. One attempt to partition based on shape arose nearly 50 years ago in the context of protein crystallography, where Lattman [58] realized that the metric tensor for SO(3) as expressed in Euler angles (α, β, γ) could be diagonalized by changing to (α′, β′, γ′)≔(α + γ, β, α − γ). This diagonalization does not change the volume element, which remains dg = sin β′ dα′ dβ′ dγ′.

Group theory has been a cornerstone in all areas of the physical sciences including crystallography, classical mechanics, quantum mechanics, chemistry, and particle physics. Moreover, in classical works on finite automata theory, attempts were made to incorporate the theory of finite groups [26–28]. Many roboticists and computer vision researchers know about the special Euclidean group SE(3), which is a Lie group that describes rigid-body motions. This will be discussed later, after first reviewing the group of pure rotations.

3.2. A Concrete Example: The Rotation Group

In the following, the abstract definitions are illustrated with a concrete example. The set of 3 × 3 rotation matrices is called the special orthogonal group and is denoted as SO(3). That is,

\begin{matrix} S O (3) ≔ \{R \in R^{3 \times 3} : R^{T} R = I a n d d e t R = 1\}, \end{matrix}

(23)

where 𝕀 is the identity matrix. Here the group operation is simply matrix multiplication. It is not difficult to show that 𝕀 is the group identity, given any two R₁, R₂ ∈ SO(3) that the matrix product R₁R₂ ∈ SO(3), and that R⁻¹ = R^T is the inverse of R ∈ SO(3).

Explicitly for SO(3), elements of the associated Lie algebra 𝔰𝔬(3), which correspond to infinitesimal rotations, are skew-symmetric matrices

\begin{matrix} X = [\begin{matrix} \begin{matrix} 0 & - x_{3} & x_{2} \\ x_{3} & 0 & - x_{1} \\ - x_{2} & x_{1} & 0 \end{matrix} \end{matrix}], \end{matrix}

(24)

and the matrix exponential (or exponential) map gives

\begin{matrix} R (x) = e x p (X) = I + \frac{s i n {‖x‖}_{2}}{{‖x‖}_{2}} X + \frac{1 - c o s {‖x‖}_{2}}{{‖x‖}_{2}^{2}} X^{2}, \end{matrix}

(25)

where x = (x₁, x₂, x₃)^T≕X^∨ is the dual vector corresponding to X. The opposite operation gives $\hat{x} ≔ X$ . The parameters x₁, x₂, and x₃ can be thought of as Cartesian coordinates in the Lie algebra 𝔰𝔬(3), and when these coordinates are restricted to the range ‖x‖₂ ≤ π, they can be used to parameterize all of SO(3) through the exponential map. When ‖x‖₂ = π the point is at the boundary. In such a case x and −x describe the same rotation, and so one model for SO(3) is that of a solid ball of radius π in Euclidean space, with antipodal points identified as being equivalent, or “glued.”

The inverse map for each is the matrix logarithm. This degenerates when the rotation angle θ≔‖x‖₂ is π. By restricting the discussion to the case when θ < π, the logarithm is uniquely defined on a subset of SO(3) depleted by the set of measure zero defined by θ = π. This depletion will have no effect on our formulation. Indeed, we can define the metric

\begin{matrix} ρ (R_{1}, R_{2}) ≔ {‖{l o g}^{\lor} (R_{1}^{T} R_{2})‖}_{2} \end{matrix}

(26)

when R₁^TR₂ is not a rotation by π, and otherwise ρ(R₁, R₂)≔π.

It is interesting to note that the above distance function ρ for SO(3) is bi-invariant; i.e.,

\begin{matrix} ρ (R_{1}, R_{2}) = ρ (R R_{1}, R R_{2}) = ρ (R_{1} R, R_{2} R), R \in S O (3) . \end{matrix}

(27)

Using this particular metric ρ it is possible to construct Voronoi cells (in the classical sense) in SO(3) for fundamental domains F_H\SO(3) and F_H\SO(3)/K, because then (19) and (20) become

\begin{matrix} F_{S O (3) / H}^{\circ} = F_{H \ S O (3)}^{\circ} = \{R \in S O (3) : ρ (R, I) < ρ (R, h) for  all h \in H \ \{I\}\} \end{matrix}

(28)

and

\begin{matrix} F_{H \ S O (3) / K}^{\circ} = \{R \in S O (3) : ρ (R, I) < ρ (R, h k) for  all (h, k) \in H \times K \ \{(I, I)\}\}, \end{matrix}

(29)

respectively.

Of particular interest to us are the cases where H is one of the finite groups of rotational symmetries of the Platonic solids. This is shown in Figure 1 (see also [54]). In this figure the fundamental domains F_H\SO(3) are depicted in exponential coordinates in 𝔰𝔬(3) (identified with a ball of radius π, as explained above). Note that this is a conceptual plot, since actually the edges and faces of these Voronoi cells are slightly bent. The number |H| of elements in H is 12 for the group of tetrahedral, 24 for the group of octahedral, and 60 for the group of icosahedral rotational symmetries. By the left (or right) action of H on the respective fundamental domain, it is possible to (almost completely) cover SO(3); cf. (16).

Fundamental domain F_H\SO(3) with H as the group of (from left to right) tetrahedral, octahedral, and icosahedral symmetries, constructed as Voronoi cells, viewed in exponential coordinates.

The coset spaces resulting from quotienting the rotation group by discrete subgroups have been studied in the pure mathematics literature under the names “spherical space forms” [66] and Poincaré homology 3-spheres [67]. The geometric and topological properties of these manifolds are related to how the opposing faces of our tiles can be glued together.

If H is the group of rotational symmetry operations of the icosahedron, then |H| = 60 and F_H\SO(3) can be viewed as a dodecahedral cell centered at the origin of the Lie algebra 𝔰𝔬(3) (see Figure 1, right). If we choose the second subgroup K to be a conjugated group of the tetrahedral, octahedral, or icosahedral symmetries (i.e., K≔gPg⁻¹, where P is the group of the rotational symmetries of the respective Platonic solid and g ∈ SO(3) is chosen such that H∩K = {𝕀}), then the Voronoi cell F_H\SO(3)/K takes a shape as exemplarily shown in Figure 2. On the other hand, if we choose K≔H, then F_H\SO(3)/H cannot be constructed as a Voronoi cell, but it can be chosen as an irregular tetrahedron (the ruby region in Figure 3), yielding a subdivision of the dodecahedral cell F_H\SO(3) by conjugation of the tetrahedron with the elements in H. Note that such conjugation has the effect of rotation in 𝔰𝔬(3) since log^∨(gRg⁻¹) = g log^∨R. Similarly, if K < H, then a |K|-fold division of F_H\SO(3) can be computed to represent F_H\SO(3)/K, and F_H\SO(3) can be reconstructed from these pieces, again by conjugation.

Center Voronoi cell F_H\SO(3) in coset space (emerald region) with H as the group of icosahedral symmetries, and center Voronoi cell F_H\SO(3)/K in double-coset space (ruby region) with H as the group of icosahedral symmetries for all cases and K as a conjugated group of (from left to right) tetrahedral, octahedral, and icosahedral symmetries, respectively.

Dodecahedral cell F_H\SO(3) (emerald region) and tetrahedral wedge F_H\SO(3)/H (ruby region), with H as the group of icosahedral symmetry. The dodecahedral cell can be decomposed into 60 identical tetrahedral wedges, with five packed on each pentagonal face.

When choosing H to be icosahedral and K = H or K = gHg⁻¹, this means that we can divide SO(3) into 3600 pieces of equal size. The 3600 respective barycentric or Voronoi centers of these cells can be taken as a discretization of SO(3), and any of these centers can be written in a unique way as R_ij = h_ik_j where (h_i, k_j) ∈ H × K with i, j ∈ {1,2,…, 60}. This means that any one of the 3600 points R_ij corresponds to a two-letter word (h_i, k_j).

A natural question to ask is then, for R ∈ SO(3), how do we find the closest word (h_i, k_j) to approximate it? This decoding or “signals to symbols” problem is addressed in Section 7.1.

4. Rigid-Body Motions as a Group Used for Framing Robots

Given any rigid object or multi-rigid-body actor (such as a human, robot, self-driving car, or smart house), the behavior and use of this item involve the time evolution of its conformation and its overall position and orientation, or “pose”. A common descriptive framework for both the internal (conformational) degrees of freedom and their relative pose (configuration) is to attach reference frames on each rigid component.

Let g = (R, t) denote a rigid-body motion relative to a reference frame fixed in space, where R ∈ SO(3) is a rotation matrix and t ∈ ℝ³ is a translation vector. The set of all such motions forms a six-dimensional Lie group, the special Euclidean group SE(3). This is a group because the composition operation

\begin{matrix} g_{1} g_{2} = (R_{1}, t_{1}) (R_{2}, t_{2}) ≔ (R_{1} R_{2}, R_{1} t_{2} + t_{1}) \end{matrix}

(30)

satisfies the properties of closure and associativity, the identity exists and is simply e = (𝕀, 0) (with zero translation), and the inverse of g is g⁻¹ = (R^T, −R^Tt). Note that SE(3) is not commutative (Abelian). The group operation is the same as multiplying homogeneous transformation matrices; i.e.,

\begin{matrix} H (g_{1} g_{2}) = H (g_{1}) H (g_{2}), w h e r e H (g) ≔ [\begin{matrix} \begin{matrix} R & t \\ 0^{T} & 1 \end{matrix} \end{matrix}] . \end{matrix}

(31)

In the case of planar motions, we deal with the special Euclidean group SE(2), where R ∈ SO(2) is parameterized by a single angle θ, and t ∈ ℝ² has components (x, y), totalling three degrees of freedom. SO(2) is the group of rotations in ℝ², defined analogously to (23). In general, SE(d) is an example of an outer (or external) semidirect product which combines the two groups (ℝ^d, +) and SO(d) into the new group

\begin{matrix} S E (d) ≔ S O (d) ⋉ R^{d} . \end{matrix}

(32)

The underlying set of this group is the Cartesian product SO(d) × ℝ^d, but the symbol ⋉ reflects the fact that the group operation is not simply (R₁, t₁)(R₂, t₂) = (R₁R₂, t₁ + t₂), which is also a group (called the direct product), but does not reflect the way that rigid-body motions work.

The next section reviews the handedness-preserving (Sohncke) crystallographic space groups, which are discrete subgroups of SE(3) and which form an important component of the motion alphabets developed in this paper.

5. Crystallographic Groups

A crystallographic group is a discrete (and also cocompact) subgroup of the Euclidean group E(d)≔O(d)⋉ℝ^d, where O(d) is the orthogonal group consisting of all orthogonal real-valued d × d matrices (defined as in (23) for d = 3, but also allowing det R = −1 there). In addition to rotations, the group O(d) also contains reflections and rotoreflections (improper rotations). If d = 3, a crystallographic group is commonly called a space group, for d = 2, it is referred to as a wallpaper group. The literature on mathematical crystallography is immense and spans many decades. Some notable introductions include [68–71]. The relationship between torsion-free crystallographic groups (i.e., Bieberbach groups; see below) and flat manifolds has been studied extensively [72–76].

Elements of a crystallographic group Γ can be expressed as pairs

\begin{matrix} γ = (R_{γ}, t_{γ} + v (R_{γ})), \end{matrix}

(33)

where R_γ ∈ ℙ (a discrete point group, i.e., a subgroup of O(d)), t_γ ∈ 𝕃 (a lattice in ℝ^d), and v : ℙ → ℝ^d. In particular, v is the translational part of a glide-reflection or screw-displacement lattice motion. In general v will satisfy the cocycle identities

\begin{matrix} v (I) = 0, \\ v (R_{γ_{1}} R_{γ_{2}}) = (R_{γ_{1}} v (R_{γ_{2}}) + v (R_{γ_{1}})) m o d T, \end{matrix}

(34)

where T≔{𝕀}⋉𝕃 is the subgroup of pure (or primitive) translations in Γ, which is always normal (T◁Γ). The “mod T” removes components in the sum that are in T, in analogy with, for example, (1 + 5)mod 4 = 2 in modulo-4 arithmetic.

If an element γ ∈ Γ\{e} is of finite order (i.e., if there exists an n ∈ ℕ such that γⁿ = e), it is called a torsion element. The group Γ is called torsion-free (or a Bieberbach group) if it is free of torsion elements. This is equivalent to the property that no element γ ∈ Γ other than the identity e has a fixed point (i.e., a point p ∈ ℝ^d with γ p = p). If v ≡ 0 in (33), then Γ can be written as the semidirect product Γ = ℙ⋉𝕃 and the group is called symmorphic. Of the 230 possible types of space groups, 73 can be decomposed in this way. These are the symmorphic space groups. Bieberbach groups are not symmorphic.

In the theory of crystallographic groups, a well-known isomorphism is due to the above-mentioned fact that the translation subgroup T of Γ is normal:

\begin{matrix} \frac{Γ}{T} ≅ P . \end{matrix}

(35)

Moreover, it can be shown that ℙ is not only discrete, but must be finite. In the case d = 3 the point group ℙ will belong to one of 32 discrete crystallographic point groups that constitute the so-called crystal classes.

In the context of space groups, we can distinguish between Bieberbach (i.e., torsion-free) groups as one extreme, and groups which contain only the identity and torsion elements (rotations, reflections, and improper rotations) as the other extreme, with all other space groups lying somewhat “in between.” For a symmorphic space group Γ, a Bieberbach subgroup Γ_B < Γ with minimum index in Γ is the subgroup T of primitive translations; for many nonsymmorphic space groups, on the other hand, there is a Bieberbach subgroup Γ_B with index [Γ : Γ_B]<[Γ : T] allowing for a decomposition of Γ as a group product [77, p. 719]

\begin{matrix} Γ = Γ_{B} S ≔ \{γ_{B} s : γ_{B} \in Γ_{B}, s \in S\}, \end{matrix}

(36)

where S < Γ is a proper subgroup of ℙ⋉{0} < E(3), and thus Γ_B∩S = {e}. If Γ_B◁Γ, then (36) is an inner (or internal) semidirect product (Γ = Γ_B⋊S). In this case, the decomposition γ = γ_Bs is unique for each γ ∈ Γ. Another useful property is that each of the 65 Sohncke space groups (i.e., handedness-preserving space groups Γ < SE(3)) can be written as a product

\begin{matrix} Γ = Γ_{B} Γ_{S} ≔ \{γ_{B} γ_{S} : γ_{B} \in Γ_{B}, γ_{S} \in Γ_{S}\}, \end{matrix}

(37)

where Γ_B and Γ_S are, respectively, Bieberbach and symmorphic subgroups (see [77, Thm. 3]). If Γ_B∩Γ_S = {e} and Γ_B or Γ_S is normal, then (37) is an inner semidirect product, and for every γ ∈ Γ there exist unique γ_B ∈ Γ_B and γ_S ∈ Γ_S such that γ = γ_Bγ_S. If both Γ_B and Γ_S are normal, then Γ is decomposed into the inner direct product Γ = Γ_B × Γ_S.

6. A Motion Alphabet Based on a Fine Double-Coset Decomposition

The essence of language is communicating information about the continuous world using a finite number of symbols, characters, or patterns. In the same spirit, in classical AI, a fundamental goal is to convert perceptual information from the continuous world to strings of symbols drawn from a finite alphabet, so that the machinery of finite-state automata reasoning can be applied. Every action of a robot in the physical world can be described as a sequence of paths in SE(3). Such paths are continuous mathematical trajectories which can be viewed as “signals”.

A fundamental problem in AI and sensory perception is the so-called “signals to symbols” problem [78–80], in which observations in the continuous world are converted to coarsified representations (i.e., symbols), on which AI systems can execute logical reasoning algorithms. A natural way to discretize the space of paths is to first discretize time, thereby reducing the infinite dimensions of the path space to a finite number of dimensions, and then to replace each sampled pose on the path with a rounded-off version in a discrete set (the alphabet of maneuvers). Because SE(3) does not have dense subgroups, a clever discretization needs to be constructed.

In the three-dimensional case, one of the finest space groups is Γ = P432 which has a total of 24 rotational elements corresponding to the rotational symmetry operations of a cube (cf. [69, p. 634f]). Therefore, if we want to quantize rigid-body motions with a resolution that is reasonable for real-world tasks, something finer than rounding off to the nearest element of Γ is necessary.

Building on ideas introduced in the context of protein-packing models in X-ray crystallography [55, 65, 77, 81, 82], we can augment Γ by an auxiliary discrete rotation group. For example, let Δ denote the group of rotational symmetries of the icosahedron (as a discrete subgroup of SE(3)). Δ has 60 elements, and there is no lattice of discrete translations that corresponds to it. In particular, Γ, Δ < SE(3) and Γ∩Δ = {e}, the identity element. This means that the double-coset space Γ\SE(3)/Δ is a compact Riemannian manifold. It is possible to define a compact fundamental domain F_Γ\SE(3)/Δ ⊂ SE(3) using (20). Restricting the quotient map from SE(3) to Γ\SE(3)/Δ to the fundamental domain F_Γ\SE(3)/Δ then gives a bijective (i.e., one-to-one) mapping from F_Γ\SE(3)/Δ to Γ\SE(3)/Δ. Moreover, the action of Γ (from the left) and Δ (from the right) gives a way to tile SE(3) with disjoint shifts of F_Γ\SE(3)/Δ, because (cf. (18))

\begin{matrix} S E (3) = ⋃_{γ \in Γ} ⋃_{δ \in Δ} γ F_{Γ \ S E (3) / Δ} δ . \end{matrix}

(38)

Intensive study of the fundamental domains F_Γ\SE(3) and F_Γ\SE(3)/Δ has been conducted (ibid.).

When the fundamental domain F_Γ\SE(3)/Δ is constructed using (20), it has the identity element e at its center, and so the tiling in (38) has the effect of sampling each center point by moving from e to γδ where γ ∈ Γ and δ ∈ Δ. In other words, the product Γ × Δ < SE(3)² can be used as a quantized version of SE(3). The number of rotational elements will be 24 × 60 = 1440 which is sufficiently fine to capture the essence of any frame along a trajectory during a robot task. The alphabet defined by Γ × Δ is infinite, but by limiting the extent of translations to be contained in a bounded region, it becomes finite. This means that continuous trajectories can be translated into a finite string of alphabet characters (Figure 4). This opens up the possibility of mapping these quantized trajectories into words expressed in a natural language.

Discretizing a continuous motion trajectory g at times τ₁,…, τ₇ using the alphabet Γ × Δ (conceptual plot). After discretization the continuous motion can be expressed as the sentence (γ₂, δ₃), (γ₃, δ₃), (γ₁, δ₄), (γ₄, δ₄), (γ₄, δ₄), (γ₄, δ₁), (γ₄, δ₁).

An important advantage of the quantization scheme (38) is that the shifted fundamental domains γF_Γ\SE(3)/Δδ will all have the same volume; i.e.,

\begin{matrix} μ (γ F_{Γ \ S E (3) / Δ} δ) = μ (F_{Γ \ S E (3) / Δ}) for  all (γ, δ) \in Γ \times Δ, \end{matrix}

(39)

where μ is the (left- and right-invariant) Haar measure on the (unimodular) Lie group SE(3). This means that the centers γδ of these shifted fundamental domains used for quantization at the same time also allow for a very uniform sampling of the group SE(3).

7. More Alphabets and Coarse-to-Fine Decoding Algorithms

As explained above, after wisely designing a pair of discrete subgroups Γ, Δ < SE(3) with Γ∩Δ = {e}, we can construct a fundamental domain as in (20) and decompose SE(3) as in (38). There are many other ways to choose F_Γ\SE(3) and F_Γ\SE(3)/Δ, as explained in [65]. A particularly simple choice is the Cartesian product

\begin{matrix} F_{Γ \ S E (3) / Δ} ≔ F_{P \ S O (3) / Δ} \times F_{L \ R^{3}}, \end{matrix}

(40)

where ℙ≅Γ/T is the point group of Γ and 𝕃 the lattice of primitive translations. Alternatively, given decomposition (36), another natural choice is

\begin{matrix} F_{Γ \ S E (3) / Δ} ≔ F_{S \ S O (3) / Δ} \times F_{Γ_{B} \ R^{3}} . \end{matrix}

(41)

Here S\SO(3)/Δ and Γ_B\ℝ³ are not (double-)coset spaces—because S≮SO(3) and Γ_B≮ℝ³—but are orbit spaces consisting, respectively, of orbits SRΔ≔{sRδ : s ∈ S, δ ∈ Δ} and Γ_Bx≔{γ_Bx : γ_B ∈ Γ_B} (R ∈ SO(3), x ∈ ℝ³). The fundamental domains F_S\SO(3)/Δ and F_{Γ_B\ℝ³} above can be constructed by choosing exactly one point per orbit, which is consistent with the definitions in Section 3. Different fundamental domains F_Γ\SE(3)/Δ such as those above can be used to express different quantizations via (38).

7.1. The Purely Rotational Case

The choice for F_Γ\SE(3)/Δ in (40) allows us to bootstrap off of the fundamental domains for double-coset spaces for SO(3) discussed earlier. In fact, we can go even further and describe an SE(3) motion trajectory g(τ) = (R(τ), t(τ)) in SO(3) × ℝ³ (as a direct product rather than a semidirect product). This is not merely to make things easier—viewing pose change trajectories in this way has some advantages, as described in [40], where the direct product SO(3) × ℝ³ is called the pose change group and is denoted as PCG(3). Therefore, below we describe in some detail how the “signals to symbols” problem can be solved efficiently in the purely rotational case.

Since (40) is a set rather than a group, we can view it as a subset of either SE(3) or PCG(3). Either way, the general decoding problem reduces to this: Given H, K < SO(3), and R ∈ SO(3), how can we efficiently find the unique pair (h_i, k_j) ∈ H × K such that

\begin{matrix} R = h_{i} Q k_{j} \end{matrix}

(42)

with Q ∈ F_H\SO(3)/K? In particular, if the Voronoi choice is made for F_H\SO(3)/K, solving (42) allows for simply rounding off R to h_ik_j, as indicated in Section 3.2.

The question then becomes how to do this. With the crystallographic constraint, in SE(3) it is possible to define H such that |H| = 24 (octahedral symmetry) and |K| = 60 (icosahedral symmetry), leading to 24 × 60 = 1440 combinations. In PCG(3), on the other hand, subgroups need not be restricted to the crystallographic constraint, and we can have more rotational elements (see below). The question is, is there a better way to test for h_i and k_j than two nested for loops over i and j resulting in a large number of evaluations to find where ρ(R, h_ik_j) is minimized, which is equivalent to solving (42) when the Voronoi choice is made for F_H\SO(3)/K?

The answer is positive, and we shall now present a technique to achieve this. Consider the double-coset space H\SO(3)/K, where H is the group of rotational symmetries of the icosahedron and K = gHg^T is a conjugated group, with g being chosen so that H∩K = {𝕀}. It is thus |H\SO(3)/K| = |H| × |K| = 60² = 3600. We can construct a fundamental domain F_H\SO(3) for the coset space H\SO(3) as a dodecahedral Voronoi cell (cf. (28) and Figure 1, right). Due to the Voronoi property, we can find the shifted tile h_iF_H\SO(3) containing the rotation R of interest by computing the distance ρ(R, h_i) of R to the 60 tile centers h_i ∈ H. We then know that h_i^TR lies in the identity-centered tile F_H\SO(3). We can construct the fundamental domain F_H\SO(3)/K for the double-coset space as a Voronoi cell, too (cf. (29) and Figure 2, right). Since the shifts h_iF_H\SO(3)/Kk_j of this identity-centered fundamental domain will cover SO(3), they will also cover F_H\SO(3). However, the number of required shifts will be much smaller than 3600. Indeed, we found that approximately 180 shifted fundamental domains h_iF_H\SO(3)/Kk_j are sufficient when the conjugation element g is empirically chosen, for example, to minimize the extent

\begin{matrix} \underset{R \in F_{H \ S O (3) / K}}{s u p} ρ (R, I) \end{matrix}

(43)

of F_H\SO(3)/K. We chose g in this particular way so as to minimize the round-off error in the quantization scheme. We can now quickly find the shifted fundamental domain h_i′F_H\SO(3)/Kk_j containing the above h_i^TR by exploiting the Voronoi property of these shifted domains. We then have that R ∈ (h_ih_i′)F_H\SO(3)/Kk_j, and so the decomposition (42) is found. Instead of the 3600 distances on SO(3), in the above technique, we have to compute only about 60 + 180 = 240 distances, resulting in a potential speedup of approximately 3600/240 = 15. We further exploited the fact that ρ(R₁, R₂) = arccos((1/2)[tr(R₁^TR₂) − 1]), the trace wherein can be computed efficiently as

\begin{matrix} t r (R_{1}^{T} R_{2}) = \sum_{i = 1}^{3} \sum_{j = 1}^{3} {[R_{1}]}_{i j} {[R_{2}]}_{i j}, \end{matrix}

(44)

as well as the fact that arccos x < arccos y if and only if x > y.

An even more efficient decoding algorithm can be obtained when considering the double-coset space H\SO(3)/H, where H is the group of rotational icosahedral symmetry. Here again we can use the dodecahedral Voronoi fundamental domain F_H\SO(3) for the coset space H\SO(3) and find the shifted tile h_iF_H\SO(3) containing the rotation R easily as described above. As a fundamental domain F_H\SO(3)/H for the double-coset space, we can choose the tetrahedral wedge shown in Figure 3. We can find the conjugated wedge h_jF_H\SO(3)/Hh_j^T containing the “pulled-back” rotation h_i^TR by using one of the standard query methods. We then know that R ∈ (h_ih_j)F_H\SO(3)/Hh_j^T, and so (42) is solved.

We note that as is the case in Section 6 (cf. (39)), the shifts of the fundamental domains F_H\SO(3)/K and F_H\SO(3)/H above all have the same volume, which is an important advantage of the double-coset approach presented in this paper.

7.2. Planar-Motion Alphabets Based on Wallpaper Groups

As mentioned in Section 5, crystallographic groups are called wallpaper groups in the two-dimensional setting. Figure 5 shows fundamental domains F_{p_i\SE(2)} constructed using (19) for instances of the well-known wallpaper groups (cf. [69, Chap. 6]) p₁, p₂, p₄, p₃, and p₆ (see also [55]), all of which are symmorphic. Here SE(2) is identified with ℝ² × (−π, π) ⊂ ℝ³, with the x and y axes representing translations in x and y direction and the z axis representing the rotation angle θ. These fundamental domains are generated using the Euclidean metric ‖·−·‖₂ on ℝ³, adapted so as to take into account the 2π-periodicity in the rotation angle θ. It is important to note that this metric is left- but not right-invariant (there is no bi-invariant metric on SE(2)). Therefore, the fundamental domains shown in Figure 5 are actually Voronoi rather than Voronoi-like cells (as is the case for SO(3), cf. (28)).

Center Voronoi cell F_{p_i\SE(2)} for certain instances of the wallpaper groups (left to right, top to bottom) p₁, p₂, p₄, p₃, and p₆.

The group p₁ consists solely of translations, constituting a parallelogrammatic lattice in the translational x-y plane. This results in a box with (irregular) hexagonal shape in the x-y plane and height 2π as the fundamental domain F_p₁\SE(2). In addition to the translations in p₁, the wallpaper group p₂ also contains a rotation of order two (i.e., with angle π). Therefore, the fundamental domain F_p₂\SE(2) also has a hexagonal shape in the translational plane, but the height is only π (from −π/2 to π/2) instead of 2π. The group p₄ is a group with rotations of order four (i.e., with angles π/2, π, and 3π/2), as well as translations in a square lattice. Thus the fundamental domain F_p₄\SE(2) has the shape of a square in the translational plane, with its height being π/2 (from −π/4 to π/4). The groups p₃ and p₆ both have a hexagonal translation lattice. In addition to these translations, p₃ contains rotations of order three (rotation angles 2π/3 and 4π/3), while p₆ contains rotations of order six (rotation angles π/3, 2π/3, π, 4π/3, and 5π/3). Both fundamental domains F_p₃\SE(2) and F_p₆\SE(2) have a regular hexagonal shape in the translational plane, with a height of 2π/3 (from −π/3 to π/3) and π/3 (from −π/6 to π/6), respectively.

As an example of a planar-motion alphabet similar to the one described in Section 6, let us consider the double-coset space Γ\SE(2)/Δ with Γ≔p₄ and Δ≔C_2n−1⋉{0} < SE(2), where

\begin{matrix} C_{2 n - 1} ≔ \{[\begin{matrix} c o s θ_{j} & - s i n θ_{j} \\ s i n θ_{j} & c o s θ_{j} \end{matrix}] : θ_{j} = \frac{2 π j}{2 n - 1}, j = 0, \dots, 2 (n - 1)\} < S O (2) \end{matrix}

(45)

is the group of rotations of order 2n − 1 (n ∈ ℕ). It is Γ∩Δ = {e} and so we can construct a fundamental domain F_Γ\SE(2)/Δ as a Voronoi-like cell using (20). This is shown in Figure 6 for n = 3. In fact, because the left-invariant metric used here is also invariant under purely rotational actions from the right, the fundamental domain F_Γ\SE(2)/Δ is a classical Voronoi cell here, too (again as in the case of SO(3), cf. (29)). Analogously as in Section 6, we can use the alphabet Γ × Δ < SE(2)² for an equivolumetric quantization of SE(2), which at the same time allows for a very uniform sampling of the group. Because the scaling of the translational lattice in the wallpaper groups is arbitrary, we can make the alphabet Γ × Δ arbitrarily fine by reducing the translational scaling in p₄ and increasing the parameter n above.

Center Voronoi cell F_Γ\SE(2) in single-coset space (emerald region) based on an instance of the wallpaper group Γ = p₄, and center Voronoi cell F_Γ\SE(2)/Δ in double-coset space (ruby region) with Δ = C₅⋉{0} as the group of rotations of order five.

To illustrate the use of the planar-motion alphabets constructed above, let us consider the SE(2) trajectory g(τ)≔(R(τ), t(τ)), τ ∈ [0,2π), where R(τ) is a rotation by an angle of τ and

\begin{matrix} t (τ) ≔ {[4 c o s τ, 6 (\frac{τ}{2 π}) - 3]}^{T} . \end{matrix}

(46)

We may discretize g at the five equidistant time points π(1 + 2k/5), k = −2,…, 2. As a motion alphabet for SE(2), let us use Γ × Δ = p₄ × C₅. We can denote the elements of C₅ by δ_j with the index j as in (45). Let us denote the elements of p₄ as γ_lmn, where m ∈ ℤ and n ∈ ℤ denote the translation in x and y direction, respectively, while l ∈ {0,1, 2,3} indicates a rotation by an angle of lπ/2. The continuous motion trajectory g can now be expressed as the sentence (γ_2,3,−2, δ₃), (γ_2,−1,−1, δ₄), (γ_2,−4,0, δ₀), (γ_2,−1,1, δ₁), (γ_2,3,2, δ₂).

To close this section, we shall discuss how such decoding problem can be solved efficiently. We can use the same coarse-to-fine search scheme that we used in the purely rotational case in the previous section: In a first step, for a given element g ∈ SE(2), we find γ ∈ p₄ such that g ∈ γF_Γ\SE(2). This step is particularly easy in the case of p₄ when compared with, e.g., the group p₁ (with anisotropic translational lattice). In fact, as implied by Figure 7, in the case of p₄ the above step can be realized by appropriately rounding off (in the decimal sense) the translational components of g, as well as the rotation angle. In the case of p₁, on the other hand, we would have to compute the distances of g to the Voronoi centers. In a second step, we search for the shifted fundamental domain γ′F_Γ\SE(2)/Δδ containing the pulled-back element γ⁻¹g by a purely rotational search. The decomposition of g then reads (γγ′)Qδ with Q = (γγ′)⁻¹gδ⁻¹ ∈ F_Γ\SE(2)/Δ. Of course it is also possible to first treat the translational part of gand then, say, the rotational part. With appropriate modifications, the above coarse-to-fine decoding scheme can also be used with the fine alphabet for SE(3) developed in Section 6.

Tessellation of SE(2) illustrated in ℝ² × (−π, π), based on the fundamental domain (left) F_p₁\SE(2) and (right) F_p₄\SE(2) (conceptual plot).

8. Comparisons and Applications

To clearly demonstrate the advantageous potential of our proposed discretization and decoding algorithms, we provide comparisons of performance with existing methods and introduce a hybrid sampling method to take advantage of the speed and low dispersion properties.

8.1. The Accuracy and Speed of Rounding Off Motions

8.1.1. Uniformity of Sampling on SO(3)

Discretization on SO(3) is an important application of this work, which gives equivolumetric decomposition of the group in the sense that any rotation is located inside an identical Voronoi cell. The uniformity is essential to evaluate how good a sampling method is, and can be measured by dispersion, discrepancy, consistency, etc., as introduced in Section 2.2. For the computations of dispersion and consistency in this work, ρ(R, R_s)≔‖log^∨(R^TR_S)‖₂ is the distance metric defined in Section 3.2.

We compare the dispersion (3), consistency (4), and uniformity (7) of our method in sampling from SO(3) with uniform Euler angles and uniform sampling using Hopf fibration [53]. For the dispersion comparison, we randomly generate 10000 rotations and compute the distance with the nearest sample. The maximum value of these 10000 resulting distances approximates the dispersion. For the consistency, on the other hand, for each sample, we compute the distance to its nearest sample and take the standard deviation. And for the uniformity measure, we compute the number of nearest neighbors of each sample.

For our method, we decompose SO(3) using the double-coset space H\SO(3)/K, where H is the group of rotational symmetries of the icosahedron and K = gHg⁻¹, where g is chosen such that

\begin{matrix} {l o g}^{\lor} g ≔ {[0.4359, - 0.07692, - 0.1282]}^{T} . \end{matrix}

(47)

We found that the 181 shifted fundamental domains h_iF_H\SO(3)/Kk_j with center h_ik_j closest to the identity are sufficient to cover F_H\SO(3). And for the Euler angle, we choose ZYZ parameterization and uniformly generate α and γ within their range [−π, π] and β such that cos β ∈ [−1,1].

In terms of the dispersion, our method is higher than the other two methods: the dispersion of ours is 0.4291, Hopf fibration method is 0.2690, and Euler angle method is 0.3401. However, for the consistency, ours can achieve 0 deviation, while Hopf is 0.0264 and Euler angle is 0.0865. This consistency result is a significant advantage of our method in the sense that the sampling grid always has equi-distance edges. Also since the number of nearest neighbors with minimum distance for our method is always 2, the uniformity is 2. This also outperforms the other two methods, where Hopf fibration has a uniformity of 0.25, and Euler angle is only 0.0303. The results show that our method can be used in uniformly sampling rotations from SO(3).

8.1.2. Computational Time of SO(3) Nearest Neighbor Search

Another key factor for performance evaluation is the running time when searching for the nearest sample for a random rotation. A common and efficient way is using the Euler angle parameterization. Suppose we are given a set of sample points constructed using Euler angles (either with or without cos⁻¹ sampling, or Lattman's diagonalization of the metric tensor, as discussed in the Introduction). Then given an arbitrary rotation, R, one can compute its Euler angles (α_R, β_R, γ_R) and attempt to round off to the nearest Euler angles in the sample set as ([α_R], [β_R], [γ_R]). This is simple and fast to compute, but because Euler angles are not an equi-metric spacing, the resulting rounded rotation matrix [R] = R₃([α_R])R₁([β_R])R₃([γ_R]) may not be the closest of the sampled rotations to R. In contrast, our method is both fast and accurate in the sense of metric round-off.

To illustrate this, we perform a comparison here at an Intel Core i7-4790 CPU at 3.60 GHz × 8 and with Matlab R2018b. For our method, we use the double-coset space H\SO(3)/H, where we have a faster version of the decoding algorithm. Comparisons are performed with the Euler angle searching method (described above). 1000 random rotations are generated and localized to the nearest sample from the sampling list. The accuracy of computing the nearest neighbor is evaluated using the brute-force nearest neighbor search (i.e., minimization of the distance ρ(R, h_ih_j) with respect to both i and j).

The result shows that the proposed decoding algorithm yields an average runtime for each search at around 55.9μs in our method, while the Euler angle method runs 53.6μs per rotation. Both methods are at the same level of efficiency, but ours outperforms in terms of round-off accuracy. Figure 8 shows the minimum distance between the queried rotations (50 of all the testing rotations are shown for a clearer plot) to the set of samples. Ours can always find the true nearest neighbor, while Euler angle sometimes returns the sample which is not the nearest to the queried rotation.

Comparisons of the minimum distance between the queried rotations to the set of samples. The true values are computed using the brute-force nearest neighbor search, which is shown in blue curve. The Euler angle search (left figure) sometimes returns higher values of distance, but ours (right figure) can always give the correct answer.

8.2. Combining the Benefits of Speed and Good Dispersion Properties

Sampling methods such as those based on the Hopf fibration were designed for good performance in terms of minimal dispersion and consequently outperform both Euler angles and our sampling scheme in terms of minimizing dispersion. Ours was designed for rapid query. However, if one wants both, it is possible to simply combine them in a natural way as follows:

(1) Partition any given set of sample rotations with desirable properties (such as dispersion and discrepancy) by determining in which shard each sample point belongs.

(2) Given an arbitrary rotation, determine to which double-coset fundamental domain it belongs.

(3) Compute the distance between the arbitrary given rotation and all sample points in the same shard, and those in the nearest surrounding shards.

In this scheme the number of sample points could be in the same order or even much higher than the number of shards.

We perform a numerical experiment to verify this proposed hybrid searching method using the H\SO(3)/H decomposition. As a preprocessing step, we first compute the nearest neighbors for each of the 60 elements in H, and we find that each rotation is surrounded by 12 neighbors. This step is to construct a connectivity map for the single-coset space or, in other words, for each icosahedron cell. Then we uniformly sample 10000 rotations using Hopf fibration, which is known to have low dispersion and discrepancy. For each sample, we decompose the rotation group via our proposed fast decoding algorithm and precompute a list of index pair, which locates the cell and shard of those samples. Afterwards, we randomly generate 1000 rotations and the goal is to find their closest sampling rotations. For each of the random rotations, we decompose it, locate the cell (determined by the first index), and then calculate the minimum distance with all the samples located in the same cell and the 12 neighboring cells. By using this hybrid method, the running time is around 5 times faster than the brute-force searching method, and the resulting minimum distance is verified to be 100% correct.

Another experiment has also been performed using the H\SO(3)/K decomposition. The difference with the previous test is the preprocess, where here we precompute the nearest neighbors for the 3600 elements, i.e., h_ik_j ∈ H × K where i, j = 1,2,…, 60. This is equivalent to finding the neighbors of each shard in the double-coset space. The same sampling and testing sets are input into the hybrid algorithm and we achieve a speedup of around 20 times than the brute-force method. Also, the resulting minimum distance is verified as 100% accurate.

9. Conclusion

Robot tasks involve continuous motions in space. The quantization of these motions by introducing a class of motion alphabets has been established in this paper. With such an alphabet, continuous motion trajectories can be captured with finite words/sentences. It was demonstrated in some examples how the possibility of constructing fundamental domains for coset and double-coset spaces as Voronoi or Voronoi-like cells can be used to solve this decoding or “signals to symbols” problem efficiently via a coarse-to-fine search scheme. The performance, such as uniformity, of the proposed group discretization method and the fast decoding algorithms are compared with other existing methods. The alphabets developed here will be used in the future to facilitate the connection between advances in artificial intelligence (such as the use of artificial neural networks) and physical robots acting in the world.

Acknowledgments

This work was performed under Office of Naval Research Award N00014-17-1-2142 and National Science Foundation grant CCF-1640970. The authors gratefully acknowledge the supporting agencies.

Disclosure

The findings and opinions expressed here are only those of the authors, and not of the funding agencies.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

References

1.Studdert-Kennedy M. How did language go discrete. Evolutionary Prerequisites for Language. 2005 [Google Scholar]
2.Sgorbissa A. Integrated robot planning, path following, and obstacle avoidance in two and three dimensions: Wheeled robots, underwater vehicles, and multicopters. International Journal of Robotics Research. 2019;38(7):853–876. doi: 10.1177/0278364919846910. [DOI] [Google Scholar]
3.Shiller Z., Gwo Y.-R. Dynamic motion planning of autonomous vehicles. IEEE Transactions on Robotics and Automation. 1991;7(2):241–249. doi: 10.1109/70.75906. [DOI] [Google Scholar]
4.Dolgov D., Thrun S., Montemerlo M., Diebel J. Path planning for autonomous vehicles in unknown semi-structured environments. International Journal of Robotics Research. 2010;29(5):485–501. doi: 10.1177/0278364909359210. [DOI] [Google Scholar]
5.Gandia A., Parascho S., Rust R., Casas G., Gramazio F., Kohler M. Robotic Fabrication in Architecture, Art and Design. Springer; 2018. Towards automatic path planning for robotically assembled spatial structures; pp. 59–73. [Google Scholar]
6.Dogar M., Spielberg A., Baker S., Rus D. Multi-robot grasp planning for sequential assembly operations. Autonomous Robots. 2019;43(3):649–664. doi: 10.1007/s10514-018-9748-z. [DOI] [Google Scholar]
7.Capolupo F., Labourdette P. Receding-horizon trajectory planning algorithm for passively safe on-orbit inspection missions. Journal of Guidance, Control, and Dynamics. 2019;42(5):1–10. doi: 10.2514/1.G003736. [DOI] [Google Scholar]
8.Amato N. M., Song G. Using motion planning to study protein folding pathways. Journal of Computational Biology. 2002;9(2):149–168. doi: 10.1089/10665270252935395. [DOI] [PubMed] [Google Scholar]
9.Jean-Claude L. Robot Motion Planning. Vol. 124. Springer Science & Business Media; 2012. [Google Scholar]
10.Han-Pang H., Shu-Yun C. Dynamic visibility graph for path planning. Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No. 04CH37566); 2004; Sendai, Japan. IEEE; pp. 2813–2818. [DOI] [Google Scholar]
11.Bhattacharya P., Gavrilova M. L., Marina L. Voronoi diagram in optimal path planning. Proceedings of the 4th International Symposium on Voronoi Diagrams in Science and Engineering (ISVD '07); July 2007; IEEE; pp. 38–47. [DOI] [Google Scholar]
12.Lingelbach F. Path planning using probabilistic cell decomposition. Proceedings of the IEEE International Conference on Robotics and Automation, 2004 (ICRA’04); 2004; IEEE; pp. 467–472. [Google Scholar]
13.Kavraki L. E., Jean-Claude L. Probabilistic roadmaps for robot path planning. 1998.
14.Yilmaz A., Shah M. Actions sketch: a novel action representation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '05); June 2005; pp. 984–989. [DOI] [Google Scholar]
15.Yilmaz A., Shah M. A differential geometric approach to representing the human actions. Computer Vision and Image Understanding. 2008;109(3):335–351. doi: 10.1016/j.cviu.2007.09.006. [DOI] [Google Scholar]
16.Turaga P., Chellappa R., Subrahmanian V. S., Udrea O. Machine recognition of human activities: a survey. IEEE Transactions on Circuits and Systems for Video Technology. 2008;18(11):1473–1488. doi: 10.1109/TCSVT.2008.2005594. [DOI] [Google Scholar]
17.Liu J., Kuipers B., Savarese S. Recognizing human actions by attributes. Proceedings of the Conference on Computer Vision and Pattern Recognition; 2011; pp. 3337–3344. [Google Scholar]
18.Vemulapalli R., Arrate F., Chellappa R. Human action recognition by representing 3D skeletons as points in a lie group. Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14); June 2014; Columbus, Ohio, USA. pp. 588–595. [DOI] [Google Scholar]
19.Hausman K., Niekum S., Osentoski S., Sukhatme G. S. Active articulation model estimation through interactive perception. Proceedings of the 2015 IEEE International Conference on Robotics and Automation, ICRA 2015; May 2015; USA. pp. 3305–3312. [Google Scholar]
20.Koller D., Friedman N. Probabilistic Graphical Models: Principles and Techniques. Cambridge, MA, USA: MIT Press; 2009. [Google Scholar]
21.Leyton M. A Generative Theory of Shape. Berlin, Heidelberg, Germany: Springer-Verlag; 2003. [Google Scholar]
22.Byravan A., Fox D. SE3-nets: learning rigid body motion using deep neural networks. Proceedings of the 2017 IEEE International Conference on Robotics and Automation, ICRA 2017; June 2017; Singapore. pp. 173–180. [Google Scholar]
23.Soatto S. Actionable information in vision. Proceedings of the 12th International Conference on Computer Vision, ICCV 2009; October 2009; Japan. pp. 2138–2145. [Google Scholar]
24.Li Y., Fermuller C., Aloimonos Y., Ji H. Learning shift-invariant sparse representation of actions. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2010; June 2010; USA. pp. 2630–2637. [Google Scholar]
25.Zhang L., Zhang B. Quotient Space Based Problem Solving: A Theoretical Foundation of Granular Computing. Burlington, MA, USA: Morgan Kaufmann Publishers Inc; 2014. [Google Scholar]
26.Hartmanis J. Algebraic Structure Theory of Sequential Machines. Englewood Cliffs, NJ, USA: Prentice-Hall; 1966. [Google Scholar]
27.Ginzburg A. Algebraic Theory of Automata. New York, NY, USA: Academic Press; 1968. [Google Scholar]
28.Arbib M. A. Theories of Abstract Automata. Englewood Cliffs, NJ, USA: Prentice-Hall; 1969. [Google Scholar]
29.Minsky M., Papert S. Perceptrons: An Introduction to Computational Geometry. Cambridge, MA, USA: The MIT Press; 1987. [Google Scholar]
30.Liu Y., Collins R. T. Computational model for repeated pattern perception using Frieze and wallpaper groups. Proceedings of the CVPR '2000: IEEE Conference on Computer Vision and Pattern Recognition; June 2000; pp. 537–544. [Google Scholar]
31.Henderson T. C., Fan X., Devnani S., Kumar S., Cohen E., Grant E. UUCS-09-005. University of Utah; 2009. Symmetry as an organizational principle in cognitive sensor networks. [Google Scholar]
32.Kiddon C., Domingos P. Symmetry-based semantic parsing. Proceedings of the 2014 Workshop on Learning Semantics; 2015. [Google Scholar]
33.Baker H. F. Alternants and continuous groups. Proceedings of the London Mathematical Society. 1905;2(1):24–47. doi: 10.1112/plms/s2-3.1.24. [DOI] [Google Scholar]
34.Suprunenko D. A. Matrix Groups. Vol. 45. American Mathematical Society; 1976. [Google Scholar]
35.Birkhoff G., Lane S. M. A Survey of Modern Algebra. AK Peters/CRC Press; 1998. [Google Scholar]
36.Selig J. M. Computational Noncommutative Algebra and Applications. Springer; 2004. Lie groups and Lie algebras in robotics; pp. 101–125. [Google Scholar]
37.Murray R. M., Sastry S. S., Li Z. A Mathematical Introduction to Robotic Manipulation. 1st. Boca Raton, FL, USA: CRC Press; 1994. [Google Scholar]
38.Ma Q., Goh Z., Ruan S., Chirikjian G. S. Probabilistic approaches to the AXB= YCZ calibration problem in multi-robot systems. Autonomous Robots. 2018;42(7):1497–1520. doi: 10.1007/s10514-018-9744-3. [DOI] [Google Scholar]
39.Müller A. Screw and Lie group theory in multibody dynamics: recursive algorithms and equations of motion of tree-topology systems. Multibody System Dynamics. 2018;42(2):219–248. doi: 10.1007/s11044-017-9583-6. [DOI] [Google Scholar]
40.Chirikjian G. S., Mahony R., Ruan S., Trumpf J. Pose changes from a different point of view. Journal of Mechanisms and Robotics. 2018;10(2)021008 [Google Scholar]
41.LeCun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE. 1998;86(11):2278–2323. doi: 10.1109/5.726791. [DOI] [Google Scholar]
42.Hinton G. E., Osindero S., Teh Y.-W. A fast learning algorithm for deep belief nets. Neural Computation. 2006;18(7):1527–1554. doi: 10.1162/neco.2006.18.7.1527. [DOI] [PubMed] [Google Scholar]
43.Bengio Y., Lamblin P., Popovici D., Larochelle H. Greedy layer-wise training of deep networks. Proceedings of the 20th Annual Conference on Neural Information Processing Systems (NIPS '06); December 2006; Cambridge, Mass, USA. pp. 153–160. [Google Scholar]
44.Bengio Y. Learning deep architectures for AI. Foundations and Trends in Machine Learning. 2009;2(1):1–127. doi: 10.1561/2200000006. [DOI] [Google Scholar]
45.Tompson J., Jain A., LeCun Y., Bregler C. Joint training of a convolutional network and a graphical model for human pose estimation. Proceedings of the 28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014; December 2014; Canada. pp. 1799–1807. [Google Scholar]
46.Vejdemo-Johansson M., Pokorny F. T., Skraba P., Kragic D. Cohomological learning of periodic motion. Applicable Algebra in Engineering, Communication and Computing. 2015;26(1-2):5–26. doi: 10.1007/s00200-015-0251-x. [DOI] [Google Scholar]
47.Cohen T. S., Welling M. Group equivariant convolutional networks. Proceedings of the 33rd International Conference on Machine Learning; 2016; pp. 2990–2999. [Google Scholar]
48.Dieleman S., De Fauw J., Kavukcuoglu K. Exploiting cyclic symmetry in convolutional neural networks. Proceedings of the 33rd International Conference on Machine Learning; 2016; pp. 1889–1898. [Google Scholar]
49.Kuffner J. J. Effective sampling and distance metrics for 3D rigid body path planning. Proceedings of the International Conference on Robotics and Automation (ICRA); 2004; pp. 3993–3998. [DOI] [Google Scholar]
50.James A. Graphics Gems III (IBM Version) Elsevier; 1992. Fast random rotation matrices; pp. 117–120. [Google Scholar]
51.Yershova A., LaValle S. M. Deterministic sampling methods for spheres and SO(3). Proceedings of the IEEE International Conference on Robotics and Automation, 2004 (ICRA’04); May 2004; USA. IEEE; pp. 3974–3980. [Google Scholar]
52.Mitchell J. C. Sampling rotation groups by successive orthogonal images. SIAM Journal on Scientific Computing. 2008;30(1):525–547. doi: 10.1137/030601879. [DOI] [Google Scholar]
53.Yershova A., Jain S., Lavalle S. M., Mitchell J. C. Generating uniform incremental grids on SO3 using the hopf fibration. International Journal of Robotics Research. 2010;29(7):801–812. doi: 10.1177/0278364909352700. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Yan Y., Chirikjian G. S. Almost-uniform sampling of rotations for conformational searches in robotics and structural biology. Proceedings of the 2012 IEEE International Conference on Robotics and Automation, ICRA 2012; May 2012; USA. pp. 4254–4259. [Google Scholar]
55.Yan Y., Chirikjian G. S. Voronoi cells in lie groups and coset decompositions: implications for optimization, integration. Proceedings of the 52nd Conference on Decision and Control; 2013; pp. 1137–1143. [Google Scholar]
56.Blaser R., Fryzlewicz P. Random rotation ensembles. The Journal of Machine Learning Research. 2016;17(1):126–151. [Google Scholar]
57.Shoemake K. Graphics Gems III (IBM Version) Elsevier; 1992. Uniform random rotations; pp. 124–132. [Google Scholar]
58.Lattman E. E. Optimal sampling of the rotation function. Acta Crystallographica Section B: Structural Crystallography and Crystal Chemistry. 1972;28(4):1065–1068. doi: 10.1107/S0567740872003723. [DOI] [Google Scholar]
59.Evans D. J. On the representatation of orientation space. Molecular Physics. 1977;34(2):317–325. [Google Scholar]
60.Morawiec A., Field D. P. Rodrigues parameterization for orientation and misorientation distributions. Philosophical Magazine A. 1996;73(4):1113–1130. [Google Scholar]
61.Chazelle B. The Discrepancy Method: Randomness and Complexity. Cambridge University Press; 2001. [Google Scholar]
62.Matousek J. Geometric Discrepancy: An Illustrated Guide. Vol. 18. Springer Science & Business Media; 2009. [Google Scholar]
63.Chandrajit B., Abhishek B., Chattopadhyay E., Zuckerman D. On low discrepancy samplings in product spaces of motion groups. 2014, https://arxiv.org/abs/1411.7753.
64.Chirikjian G. S., Kyatkin A. B. Harmonic Analysis for Engineers and Applied Scientists. 2nd. New York, NY, USA: Dover Publications; 2016. [Google Scholar]
65.Chirikjian G. S., Sajjadi S., Shiffman B., Zucker S. M. Mathematical aspects of molecular replacement: IV. Measure-theoretic properties of motion spaces. Acta Crystallographica. 2017;73(5):387–402. doi: 10.1107/s2053273317007227. [DOI] [PubMed] [Google Scholar]
66.Postnikov M. M. Three-dimensional spherical forms. Akademiya Nauk SSSR. Trudy Matematicheskogo Instituta Imeni V. A. Steklova. 1991;196:114–146. [Google Scholar]
67.Kirby R. C., Scharlemann M. G. Geometric Topology. Elsevier; 1979. Eight faces of the Poincaré homology 3-sphere; pp. 113–146. [Google Scholar]
68.Janssen T. Crystallographic Groups. New Yory, NY, USA: Elsevier; 1973. [Google Scholar]
69.Hahn T., editor. International Tables for Crystallography, Volume A: Space Group Symmetry. Chester, UK: International Union for Crystallography (IUCr); 2002. [Google Scholar]
70.Aroyo M. I., Perez-Mato J. M., Capillas C., et al. Bilbao crystallographic server: I. databases and crystallographic computing programs. Zeitschrift für Kristallographie. 2006;221(1):15–27. [Google Scholar]
71.Wondratschek H., Müller U., editors. International Tables for Crystallography, Volume A1: Symmetry Relations Between Space Groups. Chester, UK: International Union for Crystallography (IUCr); 2008. [Google Scholar]
72.Hantzsche W., Wendt H. Dreidimensionale euklidische raumformen. Mathematische Annalen. 1935;110(1):593–611. doi: 10.1007/BF01448045. [DOI] [Google Scholar]
73.Charlap L. S. Bieberbach Groups and Flat Manifolds. New York, NY, USA: Springer-Verlag; 1986. [DOI] [Google Scholar]
74.Montesinos J. M. Classical Tessellations and Three-Manifolds. Berlin, Heidelberg, Germany: Springer-Verlag; 1987. [Google Scholar]
75.Wolf J. A. Spaces of Constant Curvature. New York, NY, USA: AMS Chelsea Publishing; 2010. [Google Scholar]
76.Szczepanski A. Geometry of Crystallographic Groups. Singapore: World Scientific; 2012. [Google Scholar]
77.Chirikjian G. S., Ratnayake K., Sajjadi S. Decomposition of Sohncke space groups into products of Bieberbach and symmorphic parts. Zeitschrift fur Kristallographie. 2015;230(12):719–741. [Google Scholar]
78.Jackendoff R. S. Semantics and Cognition. Cambridge, MA, USA: MIT Press; 1985. [Google Scholar]
79.Winograd T., Flores F. Understanding Computers and Cognition: A New Foundation for Design. Norwood, NJ, USA: Ablex Publishing Corporation; 1986. [Google Scholar]
80.Russell S., Norvig P. Artificial Intelligence: A Modern Approach. 3rd. Upper Saddle River, NJ, USA: Pearson Education Inc; 2009. [Google Scholar]
81.Chirikjian G. S. Mathematical aspects of molecular replacement. I. algebraic properties of motion spaces. Acta Crystallographica Section A Foundations of Crystallography. 2011;67(5):435–446. doi: 10.1107/S0108767311021003. [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Chirikjian G. S. Kinematics meets crystallography: the concept of a motion space. Journal of Computing and Information Science in Engineering. 2015;15(1)011012 [Google Scholar]

[B66] 1.Studdert-Kennedy M. How did language go discrete. Evolutionary Prerequisites for Language. 2005 [Google Scholar]

[B62] 2.Sgorbissa A. Integrated robot planning, path following, and obstacle avoidance in two and three dimensions: Wheeled robots, underwater vehicles, and multicopters. International Journal of Robotics Research. 2019;38(7):853–876. doi: 10.1177/0278364919846910. [DOI] [Google Scholar]

[B63] 3.Shiller Z., Gwo Y.-R. Dynamic motion planning of autonomous vehicles. IEEE Transactions on Robotics and Automation. 1991;7(2):241–249. doi: 10.1109/70.75906. [DOI] [Google Scholar]

[B25] 4.Dolgov D., Thrun S., Montemerlo M., Diebel J. Path planning for autonomous vehicles in unknown semi-structured environments. International Journal of Robotics Research. 2010;29(5):485–501. doi: 10.1177/0278364909359210. [DOI] [Google Scholar]

[B27] 5.Gandia A., Parascho S., Rust R., Casas G., Gramazio F., Kohler M. Robotic Fabrication in Architecture, Art and Design. Springer; 2018. Towards automatic path planning for robotically assembled spatial structures; pp. 59–73. [Google Scholar]

[B24] 6.Dogar M., Spielberg A., Baker S., Rus D. Multi-robot grasp planning for sequential assembly operations. Autonomous Robots. 2019;43(3):649–664. doi: 10.1007/s10514-018-9748-z. [DOI] [Google Scholar]

[B13] 7.Capolupo F., Labourdette P. Receding-horizon trajectory planning algorithm for passively safe on-orbit inspection missions. Journal of Guidance, Control, and Dynamics. 2019;42(5):1–10. doi: 10.2514/1.G003736. [DOI] [Google Scholar]

[B1] 8.Amato N. M., Song G. Using motion planning to study protein folding pathways. Journal of Computational Biology. 2002;9(2):149–168. doi: 10.1089/10665270252935395. [DOI] [PubMed] [Google Scholar]

[B43] 9.Jean-Claude L. Robot Motion Planning. Vol. 124. Springer Science & Business Media; 2012. [Google Scholar]

[B35] 10.Han-Pang H., Shu-Yun C. Dynamic visibility graph for path planning. Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No. 04CH37566); 2004; Sendai, Japan. IEEE; pp. 2813–2818. [DOI] [Google Scholar]

[B9] 11.Bhattacharya P., Gavrilova M. L., Marina L. Voronoi diagram in optimal path planning. Proceedings of the 4th International Symposium on Voronoi Diagrams in Science and Engineering (ISVD '07); July 2007; IEEE; pp. 38–47. [DOI] [Google Scholar]

[B48] 12.Lingelbach F. Path planning using probabilistic cell decomposition. Proceedings of the IEEE International Conference on Robotics and Automation, 2004 (ICRA’04); 2004; IEEE; pp. 467–472. [Google Scholar]

[B38] 13.Kavraki L. E., Jean-Claude L. Probabilistic roadmaps for robot path planning. 1998.

[B80] 14.Yilmaz A., Shah M. Actions sketch: a novel action representation. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR '05); June 2005; pp. 984–989. [DOI] [Google Scholar]

[B81] 15.Yilmaz A., Shah M. A differential geometric approach to representing the human actions. Computer Vision and Image Understanding. 2008;109(3):335–351. doi: 10.1016/j.cviu.2007.09.006. [DOI] [Google Scholar]

[B70] 16.Turaga P., Chellappa R., Subrahmanian V. S., Udrea O. Machine recognition of human activities: a survey. IEEE Transactions on Circuits and Systems for Video Technology. 2008;18(11):1473–1488. doi: 10.1109/TCSVT.2008.2005594. [DOI] [Google Scholar]

[B49] 17.Liu J., Kuipers B., Savarese S. Recognizing human actions by attributes. Proceedings of the Conference on Computer Vision and Pattern Recognition; 2011; pp. 3337–3344. [Google Scholar]

[B72] 18.Vemulapalli R., Arrate F., Chellappa R. Human action recognition by representing 3D skeletons as points in a lie group. Proceedings of the 27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR '14); June 2014; Columbus, Ohio, USA. pp. 588–595. [DOI] [Google Scholar]

[B32] 19.Hausman K., Niekum S., Osentoski S., Sukhatme G. S. Active articulation model estimation through interactive perception. Proceedings of the 2015 IEEE International Conference on Robotics and Automation, ICRA 2015; May 2015; USA. pp. 3305–3312. [Google Scholar]

[B41] 20.Koller D., Friedman N. Probabilistic Graphical Models: Principles and Techniques. Cambridge, MA, USA: MIT Press; 2009. [Google Scholar]

[B46] 21.Leyton M. A Generative Theory of Shape. Berlin, Heidelberg, Germany: Springer-Verlag; 2003. [Google Scholar]

[B12] 22.Byravan A., Fox D. SE3-nets: learning rigid body motion using deep neural networks. Proceedings of the 2017 IEEE International Conference on Robotics and Automation, ICRA 2017; June 2017; Singapore. pp. 173–180. [Google Scholar]

[B65] 23.Soatto S. Actionable information in vision. Proceedings of the 12th International Conference on Computer Vision, ICCV 2009; October 2009; Japan. pp. 2138–2145. [Google Scholar]

[B47] 24.Li Y., Fermuller C., Aloimonos Y., Ji H. Learning shift-invariant sparse representation of actions. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2010; June 2010; USA. pp. 2630–2637. [Google Scholar]

[B82] 25.Zhang L., Zhang B. Quotient Space Based Problem Solving: A Theoretical Foundation of Granular Computing. Burlington, MA, USA: Morgan Kaufmann Publishers Inc; 2014. [Google Scholar]

[B31] 26.Hartmanis J. Algebraic Structure Theory of Sequential Machines. Englewood Cliffs, NJ, USA: Prentice-Hall; 1966. [Google Scholar]

[B28] 27.Ginzburg A. Algebraic Theory of Automata. New York, NY, USA: Academic Press; 1968. [Google Scholar]

[B2] 28.Arbib M. A. Theories of Abstract Automata. Englewood Cliffs, NJ, USA: Prentice-Hall; 1969. [Google Scholar]

[B53] 29.Minsky M., Papert S. Perceptrons: An Introduction to Computational Geometry. Cambridge, MA, USA: The MIT Press; 1987. [Google Scholar]

[B50] 30.Liu Y., Collins R. T. Computational model for repeated pattern perception using Frieze and wallpaper groups. Proceedings of the CVPR '2000: IEEE Conference on Computer Vision and Pattern Recognition; June 2000; pp. 537–544. [Google Scholar]

[B33] 31.Henderson T. C., Fan X., Devnani S., Kumar S., Cohen E., Grant E. UUCS-09-005. University of Utah; 2009. Symmetry as an organizational principle in cognitive sensor networks. [Google Scholar]

[B39] 32.Kiddon C., Domingos P. Symmetry-based semantic parsing. Proceedings of the 2014 Workshop on Learning Semantics; 2015. [Google Scholar]

[B6] 33.Baker H. F. Alternants and continuous groups. Proceedings of the London Mathematical Society. 1905;2(1):24–47. doi: 10.1112/plms/s2-3.1.24. [DOI] [Google Scholar]

[B67] 34.Suprunenko D. A. Matrix Groups. Vol. 45. American Mathematical Society; 1976. [Google Scholar]

[B10] 35.Birkhoff G., Lane S. M. A Survey of Modern Algebra. AK Peters/CRC Press; 1998. [Google Scholar]

[B61] 36.Selig J. M. Computational Noncommutative Algebra and Applications. Springer; 2004. Lie groups and Lie algebras in robotics; pp. 101–125. [Google Scholar]

[B58] 37.Murray R. M., Sastry S. S., Li Z. A Mathematical Introduction to Robotic Manipulation. 1st. Boca Raton, FL, USA: CRC Press; 1994. [Google Scholar]

[B51] 38.Ma Q., Goh Z., Ruan S., Chirikjian G. S. Probabilistic approaches to the AXB= YCZ calibration problem in multi-robot systems. Autonomous Robots. 2018;42(7):1497–1520. doi: 10.1007/s10514-018-9744-3. [DOI] [Google Scholar]

[B57] 39.Müller A. Screw and Lie group theory in multibody dynamics: recursive algorithms and equations of motion of tree-topology systems. Multibody System Dynamics. 2018;42(2):219–248. doi: 10.1007/s11044-017-9583-6. [DOI] [Google Scholar]

[B19] 40.Chirikjian G. S., Mahony R., Ruan S., Trumpf J. Pose changes from a different point of view. Journal of Mechanisms and Robotics. 2018;10(2)021008 [Google Scholar]

[B45] 41.LeCun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE. 1998;86(11):2278–2323. doi: 10.1109/5.726791. [DOI] [Google Scholar]

[B34] 42.Hinton G. E., Osindero S., Teh Y.-W. A fast learning algorithm for deep belief nets. Neural Computation. 2006;18(7):1527–1554. doi: 10.1162/neco.2006.18.7.1527. [DOI] [PubMed] [Google Scholar]

[B8] 43.Bengio Y., Lamblin P., Popovici D., Larochelle H. Greedy layer-wise training of deep networks. Proceedings of the 20th Annual Conference on Neural Information Processing Systems (NIPS '06); December 2006; Cambridge, Mass, USA. pp. 153–160. [Google Scholar]

[B7] 44.Bengio Y. Learning deep architectures for AI. Foundations and Trends in Machine Learning. 2009;2(1):1–127. doi: 10.1561/2200000006. [DOI] [Google Scholar]

[B69] 45.Tompson J., Jain A., LeCun Y., Bregler C. Joint training of a convolutional network and a graphical model for human pose estimation. Proceedings of the 28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014; December 2014; Canada. pp. 1799–1807. [Google Scholar]

[B71] 46.Vejdemo-Johansson M., Pokorny F. T., Skraba P., Kragic D. Cohomological learning of periodic motion. Applicable Algebra in Engineering, Communication and Computing. 2015;26(1-2):5–26. doi: 10.1007/s00200-015-0251-x. [DOI] [Google Scholar]

[B22] 47.Cohen T. S., Welling M. Group equivariant convolutional networks. Proceedings of the 33rd International Conference on Machine Learning; 2016; pp. 2990–2999. [Google Scholar]

[B23] 48.Dieleman S., De Fauw J., Kavukcuoglu K. Exploiting cyclic symmetry in convolutional neural networks. Proceedings of the 33rd International Conference on Machine Learning; 2016; pp. 1889–1898. [Google Scholar]

[B42] 49.Kuffner J. J. Effective sampling and distance metrics for 3D rigid body path planning. Proceedings of the International Conference on Robotics and Automation (ICRA); 2004; pp. 3993–3998. [DOI] [Google Scholar]

[B4] 50.James A. Graphics Gems III (IBM Version) Elsevier; 1992. Fast random rotation matrices; pp. 117–120. [Google Scholar]

[B79] 51.Yershova A., LaValle S. M. Deterministic sampling methods for spheres and SO(3). Proceedings of the IEEE International Conference on Robotics and Automation, 2004 (ICRA’04); May 2004; USA. IEEE; pp. 3974–3980. [Google Scholar]

[B54] 52.Mitchell J. C. Sampling rotation groups by successive orthogonal images. SIAM Journal on Scientific Computing. 2008;30(1):525–547. doi: 10.1137/030601879. [DOI] [Google Scholar]

[B78] 53.Yershova A., Jain S., Lavalle S. M., Mitchell J. C. Generating uniform incremental grids on SO3 using the hopf fibration. International Journal of Robotics Research. 2010;29(7):801–812. doi: 10.1177/0278364909352700. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B76] 54.Yan Y., Chirikjian G. S. Almost-uniform sampling of rotations for conformational searches in robotics and structural biology. Proceedings of the 2012 IEEE International Conference on Robotics and Automation, ICRA 2012; May 2012; USA. pp. 4254–4259. [Google Scholar]

[B77] 55.Yan Y., Chirikjian G. S. Voronoi cells in lie groups and coset decompositions: implications for optimization, integration. Proceedings of the 52nd Conference on Decision and Control; 2013; pp. 1137–1143. [Google Scholar]

[B11] 56.Blaser R., Fryzlewicz P. Random rotation ensembles. The Journal of Machine Learning Research. 2016;17(1):126–151. [Google Scholar]

[B64] 57.Shoemake K. Graphics Gems III (IBM Version) Elsevier; 1992. Uniform random rotations; pp. 124–132. [Google Scholar]

[B44] 58.Lattman E. E. Optimal sampling of the rotation function. Acta Crystallographica Section B: Structural Crystallography and Crystal Chemistry. 1972;28(4):1065–1068. doi: 10.1107/S0567740872003723. [DOI] [Google Scholar]

[B26] 59.Evans D. J. On the representatation of orientation space. Molecular Physics. 1977;34(2):317–325. [Google Scholar]

[B56] 60.Morawiec A., Field D. P. Rodrigues parameterization for orientation and misorientation distributions. Philosophical Magazine A. 1996;73(4):1113–1130. [Google Scholar]

[B15] 61.Chazelle B. The Discrepancy Method: Randomness and Complexity. Cambridge University Press; 2001. [Google Scholar]

[B52] 62.Matousek J. Geometric Discrepancy: An Illustrated Guide. Vol. 18. Springer Science & Business Media; 2009. [Google Scholar]

[B5] 63.Chandrajit B., Abhishek B., Chattopadhyay E., Zuckerman D. On low discrepancy samplings in product spaces of motion groups. 2014, https://arxiv.org/abs/1411.7753.

[B18] 64.Chirikjian G. S., Kyatkin A. B. Harmonic Analysis for Engineers and Applied Scientists. 2nd. New York, NY, USA: Dover Publications; 2016. [Google Scholar]

[B21] 65.Chirikjian G. S., Sajjadi S., Shiffman B., Zucker S. M. Mathematical aspects of molecular replacement: IV. Measure-theoretic properties of motion spaces. Acta Crystallographica. 2017;73(5):387–402. doi: 10.1107/s2053273317007227. [DOI] [PubMed] [Google Scholar]

[B59] 66.Postnikov M. M. Three-dimensional spherical forms. Akademiya Nauk SSSR. Trudy Matematicheskogo Instituta Imeni V. A. Steklova. 1991;196:114–146. [Google Scholar]

[B40] 67.Kirby R. C., Scharlemann M. G. Geometric Topology. Elsevier; 1979. Eight faces of the Poincaré homology 3-sphere; pp. 113–146. [Google Scholar]

[B37] 68.Janssen T. Crystallographic Groups. New Yory, NY, USA: Elsevier; 1973. [Google Scholar]

[B29] 69.Hahn T., editor. International Tables for Crystallography, Volume A: Space Group Symmetry. Chester, UK: International Union for Crystallography (IUCr); 2002. [Google Scholar]

[B3] 70.Aroyo M. I., Perez-Mato J. M., Capillas C., et al. Bilbao crystallographic server: I. databases and crystallographic computing programs. Zeitschrift für Kristallographie. 2006;221(1):15–27. [Google Scholar]

[B75] 71.Wondratschek H., Müller U., editors. International Tables for Crystallography, Volume A1: Symmetry Relations Between Space Groups. Chester, UK: International Union for Crystallography (IUCr); 2008. [Google Scholar]

[B30] 72.Hantzsche W., Wendt H. Dreidimensionale euklidische raumformen. Mathematische Annalen. 1935;110(1):593–611. doi: 10.1007/BF01448045. [DOI] [Google Scholar]

[B14] 73.Charlap L. S. Bieberbach Groups and Flat Manifolds. New York, NY, USA: Springer-Verlag; 1986. [DOI] [Google Scholar]

[B55] 74.Montesinos J. M. Classical Tessellations and Three-Manifolds. Berlin, Heidelberg, Germany: Springer-Verlag; 1987. [Google Scholar]

[B74] 75.Wolf J. A. Spaces of Constant Curvature. New York, NY, USA: AMS Chelsea Publishing; 2010. [Google Scholar]

[B68] 76.Szczepanski A. Geometry of Crystallographic Groups. Singapore: World Scientific; 2012. [Google Scholar]

[B20] 77.Chirikjian G. S., Ratnayake K., Sajjadi S. Decomposition of Sohncke space groups into products of Bieberbach and symmorphic parts. Zeitschrift fur Kristallographie. 2015;230(12):719–741. [Google Scholar]

[B36] 78.Jackendoff R. S. Semantics and Cognition. Cambridge, MA, USA: MIT Press; 1985. [Google Scholar]

[B73] 79.Winograd T., Flores F. Understanding Computers and Cognition: A New Foundation for Design. Norwood, NJ, USA: Ablex Publishing Corporation; 1986. [Google Scholar]

[B60] 80.Russell S., Norvig P. Artificial Intelligence: A Modern Approach. 3rd. Upper Saddle River, NJ, USA: Pearson Education Inc; 2009. [Google Scholar]

[B16] 81.Chirikjian G. S. Mathematical aspects of molecular replacement. I. algebraic properties of motion spaces. Acta Crystallographica Section A Foundations of Crystallography. 2011;67(5):435–446. doi: 10.1107/S0108767311021003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] 82.Chirikjian G. S. Kinematics meets crystallography: the concept of a motion space. Journal of Computing and Information Science in Engineering. 2015;15(1)011012 [Google Scholar]

PERMALINK

Quantizing Euclidean Motions via Double-Coset Decomposition

Christian Wülker

Sipu Ruan

Gregory S Chirikjian

Abstract

1. Introduction

2. Literature Review

2.1. Lie Groups in Machine Learning and AI

2.2. Measurements of Sample Quality: Discrepancy, Dispersion, Consistency, and Uniformity

3. Some Relevant Aspects of Group Theory

3.1. Definitions and Properties

3.2. A Concrete Example: The Rotation Group

Figure 1.

Figure 2.

Figure 3.

4. Rigid-Body Motions as a Group Used for Framing Robots

5. Crystallographic Groups

6. A Motion Alphabet Based on a Fine Double-Coset Decomposition

Figure 4.

7. More Alphabets and Coarse-to-Fine Decoding Algorithms

7.1. The Purely Rotational Case

7.2. Planar-Motion Alphabets Based on Wallpaper Groups

Figure 5.

Figure 6.

Figure 7.

8. Comparisons and Applications

8.1. The Accuracy and Speed of Rounding Off Motions

8.1.1. Uniformity of Sampling on SO(3)

8.1.2. Computational Time of SO(3) Nearest Neighbor Search

Figure 8.

8.2. Combining the Benefits of Speed and Good Dispersion Properties

9. Conclusion

Acknowledgments

Disclosure

Conflicts of Interest

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases