General method to derive the relationship between two sets of Zernike coefficients corresponding to different aperture sizes

Huazhong Shu; Limin Luo; Guo-Niu Han; Jean-Louis Coatrieux

doi:10.1364/josaa.23.001960

. Author manuscript; available in PMC: 2007 Sep 21.

Published in final edited form as: J Opt Soc Am A Opt Image Sci Vis. 2006 Aug;23(8):1960–1966. doi: 10.1364/josaa.23.001960

General method to derive the relationship between two sets of Zernike coefficients corresponding to different aperture sizes

Huazhong Shu ^1,^2,^*, Limin Luo ^1,², Guo-Niu Han ³, Jean-Louis Coatrieux ^2,⁴

PMCID: PMC1961626 PMID: 16835654

Abstract

Zernike polynomials have been widely used to describe the aberrations in wave-front sensing of the eye. The Zernike coefficients are often computed under different aperture sizes. For the sake of comparison, the same aperture diameter is required. Since no standard aperture size is available for reporting the results, it is important to develop a technique for converting the Zernike coefficients obtained from one aperture size to another size. In this paper, by investigating the properties of Zernike polynomials, we propose a general method for establishing the relationship between two sets of Zernike coefficients computed with different aperture sizes.

1. Introduction

In the past decades, interest in wave-front sensing of the human eye has increased rapidly in the field of ophthalmic optics. Several techniques have been developed for measuring the aberrations of the eye.¹^,² In general, these techniques typically represent the aberrations as a wave-front error map at the corneal or pupil plane. Zernike polynomials, due to their properties such as orthogonality and rotational invariance, have been extensively used for fitting corneal surfaces.³^–⁶ Moreover, the lower terms of the Zernike polynomial expansion can be related to known types of aberrations such as defocus, astigmatism, coma, and spherical aberration.⁷ When the Zernike coefficients are computed, an aperture radius describing the circular area in which the Zernike polynomials are defined must be specified. Such a specification is usually affected by the measurement conditions and by variation in natural aperture size across the human population. Since the Zernike coefficients are often obtained under different aperture sizes, the values of the expansion coefficients can not be directly compared. Unfortunately, this type of comparison is exactly what needs to be done in repeatability and epidemiological studies. To solve this problem, a technique for converting a set of Zernike coefficients from one aperture size to another is required.

Recently, Schwiegerling⁸ proposed a method to derive the relationship between the sets of Zernike coefficients for two different aperture sizes, but he did not provide a full demonstration for his results. Campbell⁹ developed an algorithm based on matrix representation to find a new set of Zernike coefficients from an original set when the aperture size is changed. The advantage of Campbell’s method is its easy implementation. In this paper, by investigating the properties of Zernike polynomials, we present a general method for establishing the relationship between two sets of Zernike coefficients computed with different aperture sizes. An explicit and rigorous demonstration of the method is given in detail. It is shown that the results derived from the proposed method are much more simple than those obtained by Schwiegerling, and moreover, our method can be easily implemented.

2. Background

Zernike polynomials have been successfully used in many scientific research fields such as image analysis,¹⁰ pattern recognition,¹¹ astronomical telescope.¹² Some efficient algorithms for fast computation of Zernike moments defined by Eq. (7) below have also been reported.¹³^–¹⁵ Recently, Zernike polynomials have been applied to describe the aberrations in the human eye.¹ There are several different representations of Zernike polynomials in the literature. We adopt standard OSA notation. The Zernike polynomial of order n with index m describing the azimuthal frequency of the azimuthal component is defined as

Z_{n}^{m} (ρ, θ) = {\begin{matrix} N_{n}^{m} R_{n}^{m} (ρ) cos (m θ) for m \geq 0 \\ - N_{n}^{m} R_{n}^{m} (ρ) sin (m θ) for m < 0 \end{matrix}, ∣ m ∣ \leq n, n - ∣ m ∣ even

(1)

where the radial polynomial $R_{n}^{m} (ρ)$ is given by

R_{n}^{m} (ρ) = Σ_{s = 0}^{(n - ∣ m ∣) / 2} \frac{{(- 1)}^{s} (n - s)!}{s! [(n + ∣ m ∣) / 2 - s]! [(n - ∣ m ∣) / 2 - s]!} ρ^{n - 2 s}

(2)

and $N_{n}^{m}$ is the normalization factor given by

N_{n}^{m} = \sqrt{\frac{2 (n + 1)}{1 + δ_{m, 0}}}

(3)

Here δ_m_{, 0} is the Kronecker symbol.

Eqs. (2) and (3) show that both the radial polynomial $R_{n}^{m} (ρ)$ and the normalization factor $N_{n}^{m}$ are symmetric about m, i.e., $R_{n}^{m} (ρ) = R_{n}^{- m} (ρ), N_{n}^{m} = N_{n}^{- m}$ , for m ≥ 0. Thus, for the study of these polynomials, we can only consider the case where m ≥ 0. Let n = m + 2k with k ≥ 0, Eq. (2) can be rewritten as

\begin{array}{l} R_{m + 2 k}^{m} (ρ) & = Σ_{s = 0}^{k} \frac{{(- 1)}^{s} (m + 2 k - s)!}{s! (k - s)! (m + k - s)!} ρ^{m + 2 k - 2 s} \\ = Σ_{s = k}^{0} \frac{{(- 1)}^{k - s} (m + k + s)!}{s! (k - s)! (m + s)!} ρ^{m + 2 s} (making the change of variable s^{'} = k - s) \\ = Σ_{s = 0}^{k} c_{k, s}^{m} ρ^{m + 2 s} \end{array}

(4)

where

c_{k, s}^{m} = {(- 1)}^{k - s} \frac{(m + k + s)!}{s! (k - s)! (m + s)!}

(5)

Since the Zernike polynomials are orthogonal over the unit circle, the polar coordinates (r, θ) must be scaled to the normalized polar coordinates (ρ, θ) by setting ρ = r/r_max, where r_max denotes the maximum radial extent of the wave-front error surface. The wave-front error, W(r, θ) can thus be represented by a finite set of the Zernike polynomials as

W (r, θ) = Σ_{n = 0}^{N} \underset{m}{Σ} a_{n, m} Z_{n}^{m} (r / r_{max}, θ)

(6)

where N denotes the maximum order used in the representation, and a_n_, _m are the Zernike coefficients given by

a_{n, m} = \int_{0}^{r_{max}} \int_{0}^{2 π} Z_{n}^{m} (r / r_{max}, θ) W (r, θ) rdrd θ

(7)

The above equation shows clearly that the coefficients a_n_, _m depend on the choice of r_max. This dependence makes it difficult to compare two wave-front error measures obtained under different aperture sizes. To surmount this difficulty, it is necessary to develop a method that is capable to compute the Zernike coefficients for a given aperture size r₂ based on the expansion coefficients for a different aperture size r₁. Without loss of generality, we assume that r₁ takes value 1, and the problem can be formulated as follows.

Assume that the wave-front error can be expressed as

W (r, θ) = Σ_{n = 0}^{N} \underset{m}{Σ} a_{n, m} Z_{n}^{m} (r, θ)

(8)

where the coefficients a_n_, _m are known. The same wave-front error must be represented as

W (r, θ) = Σ_{n = 0}^{N} \underset{m}{Σ} b_{n, m} Z_{n}^{m} (λ r, θ)

(9)

where λ is a parameter taking positive value. We need to find the coefficient conversion relationships between two sets of coefficients {b_n_, _m}and {a_n_, _m}.

3. Methods and Results

In this section, we propose a general method that allows a new set of Zernike coefficients {b_n_, _m} corresponding to an arbitrary aperture size to be found from an original set of coefficients {a_n_, _m}. As indicated by Schwiegerling,⁸ the new coefficients b_n_, _m depend only on the coefficients a_n_, _m that have the same azimuthal frequency m. Thus, we consider a subset of terms in Eq. (8) all of which have the same azimuthal frequency m

W_{m} (r, θ) = {\begin{cases} (Σ_{k = 0}^{K} a_{m + 2 k, m} N_{m + 2 k}^{m} R_{m + 2 k}^{m} (r)) cos (m θ), for m \geq 0 \\ - (Σ_{k = 0}^{K} a_{- m + 2 k, m} N_{- m + 2 k}^{- m} R_{- m + 2 k}^{- m} (r)) sin (m θ), for m < 0 \end{cases}

(10)

where K is given by

K = {\begin{cases} (N - ∣ m ∣) / 2, if N and m have the same parity \\ (N - 1 - ∣ m ∣) / 2, otherwise \end{cases}

(11)

Similarly, the subset of terms in Eq. (9) with the same azimuthal frequency m can be expressed as

W_{m} (r, θ) = {\begin{cases} (Σ_{k = 0}^{K} b_{m + 2 k, m} N_{m + 2 k}^{m} R_{m + 2 k}^{m} (λ r)) cos (m θ), for m \geq 0 \\ - (Σ_{k = 0}^{K} b_{- m + 2 k, m} N_{- m + 2 k}^{- m} R_{- m + 2 k}^{- m} (λ r)) sin (m θ), for m < 0 \end{cases}

(12)

By equating (10) and (12), the sine and cosine dependence immediately cancels, and this leads to the following relation

Σ_{k = 0}^{K} b_{m + 2 k, m} N_{m + 2 k}^{m} R_{m + 2 k}^{m} (λ r) = Σ_{k = 0}^{K} a_{m + 2 k, m} N_{m + 2 k}^{m} R_{m + 2 k}^{m} (r)

(13)

Note that we have taken into account only the case of m ≥ 0; the case where m < 0 can be treated in a similar manner. Let

{\bar{R}}_{m + 2 k}^{m} (r) = N_{m + 2 k}^{m} R_{m + 2 k}^{m} (r)

(14)

Eq. (13) can be rewritten as

Σ_{k = 0}^{K} b_{m + 2 k, m} {\bar{R}}_{m + 2 k}^{m} (λ r) = Σ_{k = 0}^{K} a_{m + 2 k, m} {\bar{R}}_{m + 2 k}^{m} (r)

(15)

In order to solve Eq. (15), we will use the following basic results.

Lemma 1

Let a function f(r) be expressed as

f (r) = Σ_{n = 0}^{K} a_{n} P_{n} (r) = Σ_{n = 0}^{K} b_{n} P_{n} (λ r)

(16)

where P_n(r) is a polynomial of order n given by

P_{n} (r) = Σ_{k = 0}^{n} c_{n, k} r^{k}, with c_{n, n} \neq 0,

(17)

then we have

b_{i} = \frac{1}{λ^{i}} [a_{i} + Σ_{n = i + 1}^{K} (Σ_{k = i}^{n} \frac{c_{n, k} d_{k, i}}{λ^{k - i}}) a_{n}], i = 0, 1, 2, \dots, K

(18)

from which C_K = (c_n, _k), with 0 ≤ K ≤ n ≤ K, is a (K + 1) × (K + 1) lower triangular matrix, and D_K = (d_n, _k) is the inverse matrix of C_K.

The proof of Lemma 1 is deferred to Appendix A.

We are interested in a special case of Lemma 1 for which each polynomial order n can be expressed as n = m + qk where m and q are given positive integers, k = 0, 1, …, K. The corresponding result is described in the following corollary.

Corollary

Given the positive integer numbers m, q, and K. let $P_{n}^{m} (r)$ be a set of polynomials defined as

P_{n}^{m} (r) = P_{m + q k}^{m} (r) = Σ_{s = 0}^{k} c_{k, s}^{m} r^{m + q s}, k = 0, 1, 2, \dots, K

(19)

Let f(r) be a function that can be represented as

f (r) = Σ_{k = 0}^{K} a_{m + q k, m} P_{m + q k}^{m} (r) = Σ_{k = 0}^{K} b_{m + q k, m} P_{m + q k}^{m} (λ r)

(20)

Then we have

b_{m + q k} = \frac{1}{λ^{m + q k}} [a_{m + q k} + Σ_{i = k + 1}^{K} (Σ_{j = k}^{i} \frac{c_{i, j}^{m} d_{j, k}^{m}}{λ^{(j - k) q}}) a_{m + q i}], k = 0, 1, 2, \dots, K

(21)

from which $D_{K}^{m} = (d_{i, j}^{m})$ is the inverse matrix of $C_{K}^{m} = (c_{i, j}^{m})$ , both matrices are (K + 1) × (K + 1) lower triangle matrix.

Both Lemma 1 and Corollary are valid for any type of polynomials. In order to apply them, an essential step consists of finding the inverse matrix D_K or $D_{K}^{m}$ when the original matrix C_K or $C_{K}^{m}$ is known. For the purpose of the paper, we are particularly interested in the use of Zernike polynomials. For the radial polynomials $R_{m + 2 k}^{m} (r)$ defined by Eq. (4), we have the following proposition.

Proposition 1

For the lower triangular matrix $C_{K}^{m}$ whose elements $c_{k, s}^{m}$ are defined by Eq. (5), the elements of the inverse matrix $D_{K}^{m}$ are given as follows

d_{k, s}^{m} = \frac{(m + 2 s + 1) k! (m + k)!}{(k - s)! (m + k + s + 1)!}

(22)

The proof of Proposition 1 is deferred to Appendix A.

For the normalized radial polynomials ${\bar{R}}_{m + 2 k}^{m} (r)$ defined by Eq. (14), it can be rewritten as

{\bar{R}}_{m + 2 k}^{m} (r) = N_{m + 2 k}^{m} R_{m + 2 k}^{m} (r) = \sqrt{\frac{2 (m + 2 k + 1)}{1 + δ_{m, 0}}} R_{m + 2 k}^{m} (r) = Σ_{s = 0}^{k} {\bar{c}}_{k, s}^{m} r^{m + 2 s}

(23)

where

{\bar{c}}_{k, s}^{m} = \sqrt{\frac{2 (m + 2 k + 1)}{1 + δ_{m, 0}}} c_{k, s}^{m} = {(- 1)}^{k - s} \sqrt{\frac{2 (m + 2 k + 1)}{1 + δ_{m, 0}}} \frac{(m + k + s)!}{s! (k - s)! (m + s)!}

(24)

Since the normalization factor $N_{m + 2 k}^{m}$ depends only on m and k, by using the Proposition 1, we can easily derive the following result without proof.

Proposition 2

For the lower triangular matrix ${\bar{C}}_{K}^{m}$ whose elements ${\bar{c}}_{k, s}^{m}$ are defined by Eq. (24), the elements of the inverse matrix ${\bar{D}}_{K}^{m}$ are given as follows

{\bar{d}}_{k, s}^{m} = \sqrt{\frac{1 + δ_{m, 0}}{2 (m + 2 s + 1)}} d_{k, s}^{m} = \sqrt{\frac{(1 + δ_{m, 0}) (m + 2 s + 1)}{2}} \frac{k! (m + k)!}{(k - s)! (m + k + s + 1)!}

(25)

We are now ready to establish the relationship between the two set of Zernike coefficients {b_m_, _m, b _m_+2, _m, b _m_+4, _m, … b_m₊₂_K_, _m} and {a_m_, _m, a _m_+2, _m, a _m_+4, _m, … a_m₊₂_K_, _m} appeared in Eq. (13). Applying Corollary to the normalized radial polynomials ${\bar{R}}_{m + 2 k}^{m} (r)$ with q = 2 and using Eqs. (24) and (25), we have

Theorem 1

For given integers m and K, and real positive number λ, let {b_m_, _m, b _m_+2, _m, b _m_+4, _m, … b_m₊₂_K_, _m} and {a_m_, _m, a_m_+2, _m, a _m_+4, _m, … a_m₊₂_K_, _m} be two sets of Zernike coefficients corresponding to the aperture sizes 1 and λ, respectively, we have

\begin{array}{l} b_{m + 2 k, m} & = \frac{1}{λ^{m + 2 k}} [a_{m + 2 k, m} + Σ_{i = k + 1}^{K} (Σ_{j = k}^{i} \frac{{\bar{c}}_{i, j}^{m} {\bar{d}}_{j, k}^{m}}{λ^{2 (j - k)}}) a_{m + 2 i, m}] \\ = \frac{1}{λ^{m + 2 k}} [a_{m + 2 k, m} + Σ_{i = k + 1}^{K} C (m, k, i) a_{m + 2 i, m}] \end{array}, k = 0, 1, \dots, K,

(26)

where

\begin{array}{r} C (m, k, i) = \sqrt{(m + 2 i + 1) (m + 2 k + 1)} Σ_{j = k}^{i} \frac{{(- 1)}^{i - j}}{λ^{2 (j - k)}} \frac{(m + i + j)!}{(i - j)! (j - k)! (m + j + k + 1)!} \\ for i = k + 1, k + 2, \dots, K, \end{array}

(27)

The relationship established in Theorem 1 is explicit, and the coefficient b_m₊₂_K_, _m depends only on the set of coefficients {a_m₊₂_k_, _m, a _m₊₂₍_k _{+ 1),} _m, … a_m₊₂_K_, _m} thus, it is more simple than that given by Schwiegerling.⁸ Note also that even though the above results were demonstrated for the case m ≥ 0, they remain valid for m < 0 due to the symmetry property of the radial polynomials $R_{n}^{m} (r)$ about m.

Table 1 shows the conversion relationship between the coefficients b_n_, _m and a_n_, _m for Zernike polynomial expansions up to 45 terms (up to order 8). The results are the same as those given by Schwiegerling⁸ except for b_1, _m.

Table 1.

Coefficient conversion relationships for Zernike polynomial expansions up to order 8

n	m	New expansion coefficients b_n_,_m
0	0	$\begin{array}{l} b_{0, m} = a_{0, m} - \sqrt{3} (1 - \frac{1}{λ^{2}}) a_{2, m} + \sqrt{5} (1 - \frac{3}{λ^{2}} + \frac{2}{λ^{4}}) a_{4, m} \\ - \sqrt{7} (1 - \frac{6}{λ^{2}} + \frac{10}{λ^{4}} - \frac{5}{λ^{6}}) a_{6, m} + 3 (1 - \frac{10}{λ^{2}} + \frac{30}{λ^{4}} - \frac{35}{λ^{6}} + \frac{14}{λ^{8}}) a_{8, m} \end{array}$
1	−1, 1	$b_{1, m} = \frac{1}{λ} [a_{1, m} - 2 \sqrt{2} (1 - \frac{1}{λ^{2}}) a_{3, m} + \sqrt{3} (3 - \frac{8}{λ^{2}} + \frac{5}{λ^{4}}) a_{5, m} - 4 (2 - \frac{10}{λ^{2}} + \frac{15}{λ^{4}} - \frac{7}{λ^{6}}) a_{7, m}]$
2	−2, 0, 2	$b_{2, m} = \frac{1}{λ^{2}} [a_{2, m} - \sqrt{15} (1 - \frac{1}{λ^{2}}) a_{4, m} + \sqrt{21} (2 - \frac{5}{λ^{2}} + \frac{3}{λ^{4}}) a_{6, m} - \sqrt{3} (10 - \frac{45}{λ^{2}} + \frac{63}{λ^{4}} - \frac{28}{λ^{6}}) a_{8, m}]$
3	−3, −1, 1, 3	$b_{3, m} = \frac{1}{λ^{3}} [a_{3, m} - 2 \sqrt{6} (1 - \frac{1}{λ^{2}}) a_{5, m} + 2 \sqrt{2} (5 - \frac{12}{λ^{2}} + \frac{7}{λ^{4}}) a_{7, m}]$
4	−4 −2, 0, 2, 4	$b_{4, m} = \frac{1}{λ^{4}} [a_{4, m} - \sqrt{35} (1 - \frac{1}{λ^{2}}) a_{6, m} + 3 \sqrt{5} (3 - \frac{7}{λ^{2}} + \frac{4}{λ^{4}}) a_{8, m}]$
5	−5, −3, −1, 1, 3, 5	$b_{5, m} = \frac{1}{λ^{5}} [a_{5, m} - 4 \sqrt{3} (1 - \frac{1}{λ^{2}}) a_{7, m}]$
6	−6, −4 −2, 0, 2, 4, 6	$b_{6, m} = \frac{1}{λ^{6}} [a_{6, m} - 3 \sqrt{7} (1 - \frac{1}{λ^{2}}) a_{8, m}]$
7	−7, −5, −3, −1, 1, 3, 5, 7	$b_{7, m} = \frac{1}{λ^{7}} a_{7, m}$
8	−8 −6, −4, −2, 0, 2, 4, 6, 8	$b_{8, m} = \frac{1}{λ^{8}} a_{8, m}$

Open in a new tab

As correctly indicated by Schwiegerling,⁸ an interesting feature can be observed from Table 1: For a given radial polynomial order n, the conversion from the original to new coefficients have the same form regardless of the azimuthal frequency m. This can be demonstrated as follows.

Theorem 2

Let C(m, k, i) defined by Eq. (27) be the coefficient of a_m₊₂_i_, _m in the expansion of b_m₊₂_k_, _m given by Eq. (26), and C(m+2l, k−l, i−l) be the coefficient of a_m₊₂_i_, _m₊₂_l in the expansion of b_m₊₂_k_, _m₊₂_l where l is an integer number less than or equal to k, then we have

C (m, k, i) = C (m + 2 l, k - l, i - l)

(28)

Proof

From Eq. (27), we have

\begin{array}{l} C (m + 2 l, k - l, i - l) \\ = \sqrt{(m + 2 i + 1) (m + 2 k + 1)} Σ_{j = k - l}^{i - l} \frac{{(- 1)}^{i - l - j}}{λ^{2 (j - k + l)}} \frac{(m + l + i + j)!}{(i - l - j)! (j - k + l)! (m + l + j + k + 1)!} \\ = \sqrt{(m + 2 i + 1) (m + 2 k + 1)} Σ_{j = k}^{i} \frac{{(- 1)}^{i - j}}{λ^{2 (j - k)}} \frac{(m + i + j)!}{(i - j)! (j - k)! (m + j + k + 1)!} \end{array}

(29)

Comparing Eqs. (27) and (29), we obtain the result of theorem.

Another interesting feature was also observed which is summarized in the following theorem.

Theorem 3

For a fixed value of N, let N = m + 2K = m′ + 2K′, from Theorem 1, we have

\begin{array}{l} b_{m + 2 (K - l), m} = \frac{1}{λ^{N - 2 l}} [a_{m + 2 (K - l), m} + Σ_{i = K - l + 1}^{K} C (m, K - l, i) a_{m + 2 i, m}] \\ = \frac{1}{λ^{N - 2 l}} [a_{m + 2 (K - l), m} + Σ_{i = 0}^{l - 1} C (m, K - l, i + K - l + 1) a_{N + 2 i - 2 l + 2, m}] \end{array}, l = 0, 1, \dots, K

(30)

and

\begin{array}{l} b_{m^{'} + 2 (K^{'} - l), m} = \frac{1}{λ^{N - 2 l}} [a_{m^{'} + 2 (K^{'} - l), m^{'}} + Σ_{i = K^{'} - l + 1}^{K^{'}} C (m^{'}, K^{'} - l, i) a_{m^{'} + 2 i, m^{'}}] \\ = \frac{1}{λ^{N - 2 l}} [a_{m^{'} + 2 (K^{'} - l), m^{'}} + Σ_{i = 0}^{l - 1} C (m^{'}, K^{'} - l, i + K^{'} - l + 1) a_{N + 2 i - 2 l + 2, m^{'}}] \end{array}, l = 0, 1, \dots, K^{'}

(31)

then

C (m, K - l, i + K - l + 1) = C (m^{'}, K^{'} - l, i + K^{'} - l + 1)

(32)

for i = 0,1,…, l−1, l = 0, l,…, min(K, K′)

Proof

From Eq. (27), we have

\begin{array}{l} C (m, K - l, i + K - l + 1) \\ = \sqrt{(m + 2 i + 2 K - 2 l + 3) (m + 2 K - 2 l + 1)} \\ \times Σ_{j = K - l}^{i + K - l + 1} \frac{{(- 1)}^{i + K + 1 - l - j}}{λ^{2 (j - K + l)}} \frac{(m + i + K - l + j + 1)!}{(i + K - l + 1 - j)! (j - K + l)! (m + j + K - l + 1)!} \\ = \sqrt{(N + 2 i - 2 l + 3) (N - 2 l + 1)} Σ_{j = 0}^{i + 1} \frac{{(- 1)}^{i + 1 - j}}{λ^{2 j}} \frac{(N + i - 2 l + j + 1)!}{j! (i + 1 - j)! (N + j - l + 1)!} \end{array}

(33)

Similarly,

\begin{array}{l} C (m^{'}, K^{'} - l, i + K^{'} - l + 1) \\ = \sqrt{(m^{'} + 2 i + 2 K^{'} - 2 l + 3) (m + 2 K^{'} - 2 l + 1)} \\ \times Σ_{j = K^{'} - l}^{i + K^{'} - l + 1} \frac{{(- 1)}^{i + K^{'} + 1 - l - j}}{λ^{2 (j - K^{'} + l)}} \frac{(m^{'} + i + K^{'} - l + j + 1)!}{(i + K^{'} - l + 1 - j)! (j - K^{'} + l)! (m^{'} + j + K^{'} - l + 1)!} \\ = \sqrt{(N + 2 i - 2 l + 3) (N - 2 l + 1)} Σ_{j = 0}^{i + 1} \frac{{(- 1)}^{i + 1 - j}}{λ^{2 j}} \frac{(N + i - 2 l + j + 1)!}{j! (i + 1 - j)! (N + j - l + 1)!} \end{array}

(34)

Comparison of Eqs. (33) and (34) shows that Eq. (32) is valid.

Table 2 shows the case of N = m + 2K = 7 for different values of m and K.

Table 2.

Coefficient conversion relationships for different values of m and K where N = m + 2K = 7

m	K	New expansion coefficients b_n_,_m
5	1	$\begin{array}{l} b_{7, 5} = \frac{1}{λ^{7}} a_{7, 5} \\ b_{5, 5} = \frac{1}{λ^{5}} [a_{5, 5} - 4 \sqrt{3} (1 - \frac{1}{λ^{2}}) a_{7, 5}] \end{array}$
3	2	$\begin{array}{l} b_{7, 3} = \frac{1}{λ^{7}} a_{7, 3} \\ b_{5, 3} = \frac{1}{λ^{5}} [a_{5, 3} - 4 \sqrt{3} (1 - \frac{1}{λ^{2}}) a_{7, 3}] \\ b_{3, 3} = \frac{1}{λ^{3}} [a_{3, 3} - 2 \sqrt{6} (1 - \frac{1}{λ^{2}}) a_{5, 3} + 2 \sqrt{2} (5 - \frac{12}{λ^{2}} + \frac{7}{λ^{4}}) a_{7, 3}] \end{array}$
1	3	$\begin{array}{l} b_{7, 1} = \frac{1}{λ^{7}} a_{7, 1} \\ b_{5, 1} = \frac{1}{λ^{5}} [a_{5, 1} - 4 \sqrt{3} (1 - \frac{1}{λ^{2}}) a_{7, 1}] \\ b_{3, 1} = \frac{1}{λ^{3}} [a_{3, 1} - 2 \sqrt{6} (1 - \frac{1}{λ^{2}}) a_{5, 1} + 2 \sqrt{2} (5 - \frac{12}{λ^{2}} + \frac{7}{λ^{4}}) a_{7, 1}] \\ b_{1, 1} = \frac{1}{λ} [a_{1, 1} - 2 \sqrt{2} (1 - \frac{1}{λ^{2}}) a_{3, 1} + \sqrt{3} (3 - \frac{8}{λ^{2}} + \frac{5}{λ^{4}}) a_{5, 1} - 4 (2 - \frac{10}{λ^{2}} + \frac{15}{λ^{4}} - \frac{7}{λ^{6}}) a_{7, 1}] \end{array}$

Open in a new tab

4. Conclusion

We have developed a method that is suitable to determine a new set of Zernike coefficients from an original set when the aperture size is changed. An explicit and rigorous demonstration of the proposed approach was given, and some useful features have been observed and proved. The new algorithm allows a fair comparison of aberrations, described in terms of Zernike expansion coefficients that were computed with different aperture sizes. The proposed method is simple, and can be easily implemented.

Note that the formulae derived in this paper are mathematically correct for all values of λ = r₁/r₂ where r₁ and r₂ represent the original and new aperture sizes. But for application purpose, it is still recommended to make r₂ less than r₁. In the case where r₂ is greater than r₁, the wave-front error data must be extrapolated outside the region of the original fit. It is worth mentioning that such a process could produce erroneous results since the Zernike polynomials are no longer orthogonal in this region and they have high-frequency variations in the peripheries.⁸

Acknowledgments

This research is supported by the National Natural Science Foundation of China under grant No. 60272045 and Program for New Century Excellent Talents in University under grant No. NCET-04-0477. We would like to thank the anonymous referees for their helpful comments and suggestions.

Appendix A

Proof of Lemma 1

Eq. (16) can be expressed in matrix form as

f (r) = (a_{0}, a_{1}, a_{2}, \dots, a_{K}) (\begin{matrix} P_{0} (r) \\ P_{1} (r) \\ P_{2} (r) \\ ⋮ \\ P_{K} (r) \end{matrix}) = (b_{0}, b_{1}, b_{2}, \dots, b_{K}) (\begin{matrix} P_{0} (λ r) \\ P_{1} (λ r) \\ P_{2} (λ r) \\ ⋮ \\ P_{K} (λ r) \end{matrix})

(A1)

Using Eq. (17), we have

(\begin{matrix} P_{0} (r) \\ P_{1} (r) \\ P_{2} (r) \\ ⋮ \\ P_{K} (r) \end{matrix}) = C_{K} (\begin{array}{l} 1 \\ r \\ r^{2} \\ ⋮ \\ r^{K} \end{array})

(A2)

and

(\begin{matrix} P_{0} (λ r) \\ P_{1} (λ r) \\ P_{2} (λ r) \\ ⋮ \\ P_{K} (λ r) \end{matrix}) = C_{K} (\begin{array}{l} 1 \\ λ r \\ λ^{2} r^{2} \\ ⋮ \\ λ^{K} r^{K} \end{array}) = C_{K} diag (1, λ, λ^{2}, \dots, λ^{K}) (\begin{array}{l} 1 \\ r \\ r^{2} \\ ⋮ \\ r^{K} \end{array})

(A3)

Substitution of Eqs. (A2) and (A3) into (A1) yields

(a_{0}, a_{1}, a_{2}, \dots, a_{K}) C_{K} (\begin{array}{l} 1 \\ r \\ r^{2} \\ ⋮ \\ r^{K} \end{array}) = (b_{0}, b_{1}, b_{2}, \dots, b_{K}) C_{K} diag (1, λ, λ^{2}, \dots, λ^{K}) (\begin{array}{l} 1 \\ r \\ r^{2} \\ ⋮ \\ r^{K} \end{array})

(A4)

thus

\begin{array}{l} (b_{0}, b_{1}, b_{2}, \dots, b_{K}) & = (a_{0}, a_{1}, a_{2}, \dots, a_{K}) C_{K} {(diag (1, λ, λ^{2}, \dots, λ^{K}))}^{- 1} C_{K}^{- 1} \\ = (a_{0}, a_{1}, a_{2}, \dots, a_{K}) C_{K} diag (1, λ^{- 1}, λ^{- 2}, \dots, λ^{- K}) D_{K} \end{array}

(A5)

Eq. (18) can be easily obtained by expanding Eq. (A5).

Proof of Proposition 1

To prove the proposition, we need to demonstrate the following relation

Σ_{s = l}^{k} c_{k, s}^{m} d_{s, l}^{m} = δ_{k, l}, 0 \leq l \leq k \leq K

(A6)

For k = l, by using Eqs. (5) and (22), we have

c_{k, k}^{m} d_{k, k}^{m} = \frac{(m + 2 k)!}{k! (m + k)!} \times \frac{(m + 2 k + 1) k! (m + k)!}{(m + 2 k + 1)!} = 1

(A7)

For l < k, we have

\begin{array}{l} Σ_{s = l}^{k} c_{k, s}^{m} d_{s, l}^{m} & = Σ_{s = l}^{k} \frac{{(- 1)}^{k - s} (m + 2 l + 1) (m + k + s)!}{(s - l)! (k - s)! (m + s + l + 1)!} \\ = {(- 1)}^{k} (m + 2 l + 1) Σ_{s = l}^{k} F (m, k, l, s) \end{array}

(A8)

where

F (m, k, l, s) = \frac{{(- 1)}^{s} (m + k + s)!}{(s - l)! (k - s)! (m + s + l + 1)!}

(A9)

Let

G (m, k, l, s) = \frac{{(- 1)}^{s + 1} (m + k + s)!}{(s - l)! (k + 1 - s)! (m + l + s)!} \frac{(k + 1 - s) (s - l)}{(k - l) (m + k + l + 1)}

(A10)

it can be easily verified that

F (m, k, l, s) = G (m, k, l, s + 1) - G (m, k, l, s)

(A11)

thus

Σ_{s = l}^{k} F (m, k, l, s) = Σ_{s = l}^{k} [G (m, k, l, s + 1) - G (m, k, l, s)] = G (m, k, l, k + 1) - G (m, k, l, l) = 0

(A12)

We deduce from Eq. (A8) that

Σ_{s = l}^{k} c_{k, s}^{m} d_{s, l}^{m} = 0 for l < k .

(A13)

The proof is now complete.

Note that the proof of Proposition 1 was inspired by a technique proposed by Zeilberger.¹⁶

References

1.Liang J, Grimm W, Goelz S, Bille JF. Objective measurement of the wave aberrations of the human eye using a Hartmann-Shack wave-front sensor. J Opt Soc Am A. 1994;11:1949–1957. doi: 10.1364/josaa.11.001949. [DOI] [PubMed] [Google Scholar]
2.He JC, Marcos S, Webb RH, Burns SA. Measurement of the wave-front aberrations of the eye by a fast psychophysical procedure. J Opt Soc Am A. 1998;15:2449–2456. doi: 10.1364/josaa.15.002449. [DOI] [PubMed] [Google Scholar]
3.Carroll JP. A method to describe corneal topography. Optom Vis Sci. 1994;71:259–264. doi: 10.1097/00006324-199404000-00006. [DOI] [PubMed] [Google Scholar]
4.Schwiegerling J, Greivenkamp JE, Miller JK. Representation of videokeratoscopic height data with Zernike polynomials. J Opt Soc Am A. 1995;12:2105–2113. doi: 10.1364/josaa.12.002105. [DOI] [PubMed] [Google Scholar]
5.Iskander DR, Collins MJ, Davis B. Optimal modeling of corneal surfaces with Zernike polynomials. IEEE Trans Biomed Eng. 2001;48:85–97. doi: 10.1109/10.900255. [DOI] [PubMed] [Google Scholar]
6.Sicam VA, Coppens J, van den Berg TP, van der Heijde RL. Corneal surface reconstruction algorithm that uses Zernike polynomial representation. J Opt Soc Am A. 2004;21:1300–1306. doi: 10.1364/josaa.21.001300. [DOI] [PubMed] [Google Scholar]
7.Iskander DR, Morelande MR, Collins MJ, Davis B. Modeling of corneal surfaces with radial polynomials. IEEE Trans Biomed Eng. 2002;49:320–328. doi: 10.1109/10.991159. [DOI] [PubMed] [Google Scholar]
8.Schwiegerling J. Scaling Zernike expansion coefficients to different pupil sizes. J Opt Soc Am A. 2002;19:1937–1945. doi: 10.1364/josaa.19.001937. [DOI] [PubMed] [Google Scholar]
9.Campbell CE. Matrix method to find a new set of Zernike coefficients from an original set when the aperture radius is changed. J Opt Soc Am A. 2003;20:209–217. doi: 10.1364/josaa.20.000209. [DOI] [PubMed] [Google Scholar]
10.Teague MR. Image analysis via the general theory of moments. J Opt Soc Am. 1980;70:920–930. [Google Scholar]
11.Bailey RR, Srinath M. Orthogonal moment features for use with parametric and non-parametric classifiers. IEEE Trans Pattern Anal Mach Intell. 1996;18:389–400. [Google Scholar]
12.Wang JY, Silva DE. Wave-front interpretation with Zernike polynomials. Appl Opt. 1980;19:1510–1518. doi: 10.1364/AO.19.001510. [DOI] [PubMed] [Google Scholar]
13.Belkasim SO, Ahmadi M, Shridhar M. Efficient algorithm for fast computation of Zernike moments. J Franklin Inst Eng Appl Math. 1996;333:577–581. [Google Scholar]
14.Gu J, Shu HZ, Toumoulin C, Luo LM. A novel algorithm for fast computation of Zernike moments. Pattern Recognit. 2002;35:2905–2911. [Google Scholar]
15.Chong CW, Raveendran P, Mukundan R. A comparative analysis of algorithms for fast computation of Zernike moments. Pattern Recognit. 2003;36:731–742. [Google Scholar]
16.M. Petkovsek, H. S. Wilf, and D. Zeilberger, A = B (AK Peters, Ltd., 1996). The book is available on line at the University of Pennsylvania.

[R1] 1.Liang J, Grimm W, Goelz S, Bille JF. Objective measurement of the wave aberrations of the human eye using a Hartmann-Shack wave-front sensor. J Opt Soc Am A. 1994;11:1949–1957. doi: 10.1364/josaa.11.001949. [DOI] [PubMed] [Google Scholar]

[R2] 2.He JC, Marcos S, Webb RH, Burns SA. Measurement of the wave-front aberrations of the eye by a fast psychophysical procedure. J Opt Soc Am A. 1998;15:2449–2456. doi: 10.1364/josaa.15.002449. [DOI] [PubMed] [Google Scholar]

[R3] 3.Carroll JP. A method to describe corneal topography. Optom Vis Sci. 1994;71:259–264. doi: 10.1097/00006324-199404000-00006. [DOI] [PubMed] [Google Scholar]

[R4] 4.Schwiegerling J, Greivenkamp JE, Miller JK. Representation of videokeratoscopic height data with Zernike polynomials. J Opt Soc Am A. 1995;12:2105–2113. doi: 10.1364/josaa.12.002105. [DOI] [PubMed] [Google Scholar]

[R5] 5.Iskander DR, Collins MJ, Davis B. Optimal modeling of corneal surfaces with Zernike polynomials. IEEE Trans Biomed Eng. 2001;48:85–97. doi: 10.1109/10.900255. [DOI] [PubMed] [Google Scholar]

[R6] 6.Sicam VA, Coppens J, van den Berg TP, van der Heijde RL. Corneal surface reconstruction algorithm that uses Zernike polynomial representation. J Opt Soc Am A. 2004;21:1300–1306. doi: 10.1364/josaa.21.001300. [DOI] [PubMed] [Google Scholar]

[R7] 7.Iskander DR, Morelande MR, Collins MJ, Davis B. Modeling of corneal surfaces with radial polynomials. IEEE Trans Biomed Eng. 2002;49:320–328. doi: 10.1109/10.991159. [DOI] [PubMed] [Google Scholar]

[R8] 8.Schwiegerling J. Scaling Zernike expansion coefficients to different pupil sizes. J Opt Soc Am A. 2002;19:1937–1945. doi: 10.1364/josaa.19.001937. [DOI] [PubMed] [Google Scholar]

[R9] 9.Campbell CE. Matrix method to find a new set of Zernike coefficients from an original set when the aperture radius is changed. J Opt Soc Am A. 2003;20:209–217. doi: 10.1364/josaa.20.000209. [DOI] [PubMed] [Google Scholar]

[R10] 10.Teague MR. Image analysis via the general theory of moments. J Opt Soc Am. 1980;70:920–930. [Google Scholar]

[R11] 11.Bailey RR, Srinath M. Orthogonal moment features for use with parametric and non-parametric classifiers. IEEE Trans Pattern Anal Mach Intell. 1996;18:389–400. [Google Scholar]

[R12] 12.Wang JY, Silva DE. Wave-front interpretation with Zernike polynomials. Appl Opt. 1980;19:1510–1518. doi: 10.1364/AO.19.001510. [DOI] [PubMed] [Google Scholar]

[R13] 13.Belkasim SO, Ahmadi M, Shridhar M. Efficient algorithm for fast computation of Zernike moments. J Franklin Inst Eng Appl Math. 1996;333:577–581. [Google Scholar]

[R14] 14.Gu J, Shu HZ, Toumoulin C, Luo LM. A novel algorithm for fast computation of Zernike moments. Pattern Recognit. 2002;35:2905–2911. [Google Scholar]

[R15] 15.Chong CW, Raveendran P, Mukundan R. A comparative analysis of algorithms for fast computation of Zernike moments. Pattern Recognit. 2003;36:731–742. [Google Scholar]

[R16] 16.M. Petkovsek, H. S. Wilf, and D. Zeilberger, A = B (AK Peters, Ltd., 1996). The book is available on line at the University of Pennsylvania.

PERMALINK

General method to derive the relationship between two sets of Zernike coefficients corresponding to different aperture sizes

Huazhong Shu

Limin Luo

Guo-Niu Han

Jean-Louis Coatrieux

Abstract

1. Introduction

2. Background

3. Methods and Results

Lemma 1

Corollary

Proposition 1

Proposition 2

Theorem 1

Table 1.

Theorem 2

Proof

Theorem 3

Proof

Table 2.

4. Conclusion

Acknowledgments

Appendix A

Proof of Lemma 1

Proof of Proposition 1

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

General method to derive the relationship between two sets of Zernike coefficients corresponding to different aperture sizes

Huazhong Shu

Limin Luo

Guo-Niu Han

Jean-Louis Coatrieux

Abstract

1. Introduction

2. Background

3. Methods and Results

Lemma 1

Corollary

Proposition 1

Proposition 2

Theorem 1

Table 1.

Theorem 2

Proof

Theorem 3

Proof

Table 2.

4. Conclusion

Acknowledgments

Appendix A

Proof of Lemma 1

Proof of Proposition 1

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases