Abstract
Zernike polynomials have been widely used to describe the aberrations in wave-front sensing of the eye. The Zernike coefficients are often computed under different aperture sizes. For the sake of comparison, the same aperture diameter is required. Since no standard aperture size is available for reporting the results, it is important to develop a technique for converting the Zernike coefficients obtained from one aperture size to another size. In this paper, by investigating the properties of Zernike polynomials, we propose a general method for establishing the relationship between two sets of Zernike coefficients computed with different aperture sizes.
1. Introduction
In the past decades, interest in wave-front sensing of the human eye has increased rapidly in the field of ophthalmic optics. Several techniques have been developed for measuring the aberrations of the eye.1, 2 In general, these techniques typically represent the aberrations as a wave-front error map at the corneal or pupil plane. Zernike polynomials, due to their properties such as orthogonality and rotational invariance, have been extensively used for fitting corneal surfaces.3–6 Moreover, the lower terms of the Zernike polynomial expansion can be related to known types of aberrations such as defocus, astigmatism, coma, and spherical aberration.7 When the Zernike coefficients are computed, an aperture radius describing the circular area in which the Zernike polynomials are defined must be specified. Such a specification is usually affected by the measurement conditions and by variation in natural aperture size across the human population. Since the Zernike coefficients are often obtained under different aperture sizes, the values of the expansion coefficients can not be directly compared. Unfortunately, this type of comparison is exactly what needs to be done in repeatability and epidemiological studies. To solve this problem, a technique for converting a set of Zernike coefficients from one aperture size to another is required.
Recently, Schwiegerling8 proposed a method to derive the relationship between the sets of Zernike coefficients for two different aperture sizes, but he did not provide a full demonstration for his results. Campbell9 developed an algorithm based on matrix representation to find a new set of Zernike coefficients from an original set when the aperture size is changed. The advantage of Campbell’s method is its easy implementation. In this paper, by investigating the properties of Zernike polynomials, we present a general method for establishing the relationship between two sets of Zernike coefficients computed with different aperture sizes. An explicit and rigorous demonstration of the method is given in detail. It is shown that the results derived from the proposed method are much more simple than those obtained by Schwiegerling, and moreover, our method can be easily implemented.
2. Background
Zernike polynomials have been successfully used in many scientific research fields such as image analysis,10 pattern recognition,11 astronomical telescope.12 Some efficient algorithms for fast computation of Zernike moments defined by Eq. (7) below have also been reported.13–15 Recently, Zernike polynomials have been applied to describe the aberrations in the human eye.1 There are several different representations of Zernike polynomials in the literature. We adopt standard OSA notation. The Zernike polynomial of order n with index m describing the azimuthal frequency of the azimuthal component is defined as
(1) |
where the radial polynomial is given by
(2) |
and is the normalization factor given by
(3) |
Here δm, 0 is the Kronecker symbol.
Eqs. (2) and (3) show that both the radial polynomial and the normalization factor are symmetric about m, i.e., , for m ≥ 0. Thus, for the study of these polynomials, we can only consider the case where m ≥ 0. Let n = m + 2k with k ≥ 0, Eq. (2) can be rewritten as
(4) |
where
(5) |
Since the Zernike polynomials are orthogonal over the unit circle, the polar coordinates (r, θ) must be scaled to the normalized polar coordinates (ρ, θ) by setting ρ = r/rmax, where rmax denotes the maximum radial extent of the wave-front error surface. The wave-front error, W(r, θ) can thus be represented by a finite set of the Zernike polynomials as
(6) |
where N denotes the maximum order used in the representation, and an, m are the Zernike coefficients given by
(7) |
The above equation shows clearly that the coefficients an, m depend on the choice of rmax. This dependence makes it difficult to compare two wave-front error measures obtained under different aperture sizes. To surmount this difficulty, it is necessary to develop a method that is capable to compute the Zernike coefficients for a given aperture size r2 based on the expansion coefficients for a different aperture size r1. Without loss of generality, we assume that r1 takes value 1, and the problem can be formulated as follows.
Assume that the wave-front error can be expressed as
(8) |
where the coefficients an, m are known. The same wave-front error must be represented as
(9) |
where λ is a parameter taking positive value. We need to find the coefficient conversion relationships between two sets of coefficients {bn, m}and {an, m}.
3. Methods and Results
In this section, we propose a general method that allows a new set of Zernike coefficients {bn, m} corresponding to an arbitrary aperture size to be found from an original set of coefficients {an, m}. As indicated by Schwiegerling,8 the new coefficients bn, m depend only on the coefficients an, m that have the same azimuthal frequency m. Thus, we consider a subset of terms in Eq. (8) all of which have the same azimuthal frequency m
(10) |
where K is given by
(11) |
Similarly, the subset of terms in Eq. (9) with the same azimuthal frequency m can be expressed as
(12) |
By equating (10) and (12), the sine and cosine dependence immediately cancels, and this leads to the following relation
(13) |
Note that we have taken into account only the case of m ≥ 0; the case where m < 0 can be treated in a similar manner. Let
(14) |
Eq. (13) can be rewritten as
(15) |
In order to solve Eq. (15), we will use the following basic results.
Lemma 1
Let a function f(r) be expressed as
(16) |
where Pn(r) is a polynomial of order n given by
(17) |
then we have
(18) |
from which CK = (cn, k), with 0 ≤ K ≤ n ≤ K, is a (K + 1) × (K + 1) lower triangular matrix, and DK = (dn, k) is the inverse matrix of CK.
The proof of Lemma 1 is deferred to Appendix A.
We are interested in a special case of Lemma 1 for which each polynomial order n can be expressed as n = m + qk where m and q are given positive integers, k = 0, 1, …, K. The corresponding result is described in the following corollary.
Corollary
Given the positive integer numbers m, q, and K. let be a set of polynomials defined as
(19) |
Let f(r) be a function that can be represented as
(20) |
Then we have
(21) |
from which is the inverse matrix of , both matrices are (K + 1) × (K + 1) lower triangle matrix.
Both Lemma 1 and Corollary are valid for any type of polynomials. In order to apply them, an essential step consists of finding the inverse matrix DK or when the original matrix CK or is known. For the purpose of the paper, we are particularly interested in the use of Zernike polynomials. For the radial polynomials defined by Eq. (4), we have the following proposition.
Proposition 1
For the lower triangular matrix whose elements are defined by Eq. (5), the elements of the inverse matrix are given as follows
(22) |
The proof of Proposition 1 is deferred to Appendix A.
For the normalized radial polynomials defined by Eq. (14), it can be rewritten as
(23) |
where
(24) |
Since the normalization factor depends only on m and k, by using the Proposition 1, we can easily derive the following result without proof.
Proposition 2
For the lower triangular matrix whose elements are defined by Eq. (24), the elements of the inverse matrix are given as follows
(25) |
We are now ready to establish the relationship between the two set of Zernike coefficients {bm, m, b m+2, m, b m+4, m, … bm+2K, m} and {am, m, a m+2, m, a m+4, m, … am+2K, m} appeared in Eq. (13). Applying Corollary to the normalized radial polynomials with q = 2 and using Eqs. (24) and (25), we have
Theorem 1
For given integers m and K, and real positive number λ, let {bm, m, b m+2, m, b m+4, m, … bm+2K, m} and {am, m, am+2, m, a m+4, m, … am+2K, m} be two sets of Zernike coefficients corresponding to the aperture sizes 1 and λ, respectively, we have
(26) |
where
(27) |
The relationship established in Theorem 1 is explicit, and the coefficient bm+2K, m depends only on the set of coefficients {am+2k, m, a m+2(k + 1), m, … am+2K, m} thus, it is more simple than that given by Schwiegerling.8 Note also that even though the above results were demonstrated for the case m ≥ 0, they remain valid for m < 0 due to the symmetry property of the radial polynomials about m.
Table 1 shows the conversion relationship between the coefficients bn, m and an, m for Zernike polynomial expansions up to 45 terms (up to order 8). The results are the same as those given by Schwiegerling8 except for b1, m.
Table 1.
n | m | New expansion coefficients bn,m |
---|---|---|
0 | 0 | |
1 | −1, 1 | |
2 | −2, 0, 2 | |
3 | −3, −1, 1, 3 | |
4 | −4 −2, 0, 2, 4 | |
5 | −5, −3, −1, 1, 3, 5 | |
6 | −6, −4 −2, 0, 2, 4, 6 | |
7 | −7, −5, −3, −1, 1, 3, 5, 7 | |
8 | −8 −6, −4, −2, 0, 2, 4, 6, 8 |
As correctly indicated by Schwiegerling,8 an interesting feature can be observed from Table 1: For a given radial polynomial order n, the conversion from the original to new coefficients have the same form regardless of the azimuthal frequency m. This can be demonstrated as follows.
Theorem 2
Let C(m, k, i) defined by Eq. (27) be the coefficient of am+2i, m in the expansion of bm+2k, m given by Eq. (26), and C(m+2l, k−l, i−l) be the coefficient of am+2i, m+2l in the expansion of bm+2k, m+2l where l is an integer number less than or equal to k, then we have
(28) |
Proof
From Eq. (27), we have
(29) |
Comparing Eqs. (27) and (29), we obtain the result of theorem.
Another interesting feature was also observed which is summarized in the following theorem.
Theorem 3
For a fixed value of N, let N = m + 2K = m′ + 2K′, from Theorem 1, we have
(30) |
and
(31) |
then
(32) |
for i = 0,1,…, l−1, l = 0, l,…, min(K, K′)
Proof
From Eq. (27), we have
(33) |
Similarly,
(34) |
Comparison of Eqs. (33) and (34) shows that Eq. (32) is valid.
Table 2 shows the case of N = m + 2K = 7 for different values of m and K.
Table 2.
m | K | New expansion coefficients bn,m |
---|---|---|
5 | 1 | |
3 | 2 | |
1 | 3 |
4. Conclusion
We have developed a method that is suitable to determine a new set of Zernike coefficients from an original set when the aperture size is changed. An explicit and rigorous demonstration of the proposed approach was given, and some useful features have been observed and proved. The new algorithm allows a fair comparison of aberrations, described in terms of Zernike expansion coefficients that were computed with different aperture sizes. The proposed method is simple, and can be easily implemented.
Note that the formulae derived in this paper are mathematically correct for all values of λ = r1/r2 where r1 and r2 represent the original and new aperture sizes. But for application purpose, it is still recommended to make r2 less than r1. In the case where r2 is greater than r1, the wave-front error data must be extrapolated outside the region of the original fit. It is worth mentioning that such a process could produce erroneous results since the Zernike polynomials are no longer orthogonal in this region and they have high-frequency variations in the peripheries.8
Acknowledgments
This research is supported by the National Natural Science Foundation of China under grant No. 60272045 and Program for New Century Excellent Talents in University under grant No. NCET-04-0477. We would like to thank the anonymous referees for their helpful comments and suggestions.
Appendix A
Proof of Lemma 1
Eq. (16) can be expressed in matrix form as
(A1) |
Using Eq. (17), we have
(A2) |
and
(A3) |
Substitution of Eqs. (A2) and (A3) into (A1) yields
(A4) |
thus
(A5) |
Proof of Proposition 1
To prove the proposition, we need to demonstrate the following relation
(A6) |
For k = l, by using Eqs. (5) and (22), we have
(A7) |
For l < k, we have
(A8) |
where
(A9) |
Let
(A10) |
it can be easily verified that
(A11) |
thus
(A12) |
We deduce from Eq. (A8) that
(A13) |
The proof is now complete.
Note that the proof of Proposition 1 was inspired by a technique proposed by Zeilberger.16
References
- 1.Liang J, Grimm W, Goelz S, Bille JF. Objective measurement of the wave aberrations of the human eye using a Hartmann-Shack wave-front sensor. J Opt Soc Am A. 1994;11:1949–1957. doi: 10.1364/josaa.11.001949. [DOI] [PubMed] [Google Scholar]
- 2.He JC, Marcos S, Webb RH, Burns SA. Measurement of the wave-front aberrations of the eye by a fast psychophysical procedure. J Opt Soc Am A. 1998;15:2449–2456. doi: 10.1364/josaa.15.002449. [DOI] [PubMed] [Google Scholar]
- 3.Carroll JP. A method to describe corneal topography. Optom Vis Sci. 1994;71:259–264. doi: 10.1097/00006324-199404000-00006. [DOI] [PubMed] [Google Scholar]
- 4.Schwiegerling J, Greivenkamp JE, Miller JK. Representation of videokeratoscopic height data with Zernike polynomials. J Opt Soc Am A. 1995;12:2105–2113. doi: 10.1364/josaa.12.002105. [DOI] [PubMed] [Google Scholar]
- 5.Iskander DR, Collins MJ, Davis B. Optimal modeling of corneal surfaces with Zernike polynomials. IEEE Trans Biomed Eng. 2001;48:85–97. doi: 10.1109/10.900255. [DOI] [PubMed] [Google Scholar]
- 6.Sicam VA, Coppens J, van den Berg TP, van der Heijde RL. Corneal surface reconstruction algorithm that uses Zernike polynomial representation. J Opt Soc Am A. 2004;21:1300–1306. doi: 10.1364/josaa.21.001300. [DOI] [PubMed] [Google Scholar]
- 7.Iskander DR, Morelande MR, Collins MJ, Davis B. Modeling of corneal surfaces with radial polynomials. IEEE Trans Biomed Eng. 2002;49:320–328. doi: 10.1109/10.991159. [DOI] [PubMed] [Google Scholar]
- 8.Schwiegerling J. Scaling Zernike expansion coefficients to different pupil sizes. J Opt Soc Am A. 2002;19:1937–1945. doi: 10.1364/josaa.19.001937. [DOI] [PubMed] [Google Scholar]
- 9.Campbell CE. Matrix method to find a new set of Zernike coefficients from an original set when the aperture radius is changed. J Opt Soc Am A. 2003;20:209–217. doi: 10.1364/josaa.20.000209. [DOI] [PubMed] [Google Scholar]
- 10.Teague MR. Image analysis via the general theory of moments. J Opt Soc Am. 1980;70:920–930. [Google Scholar]
- 11.Bailey RR, Srinath M. Orthogonal moment features for use with parametric and non-parametric classifiers. IEEE Trans Pattern Anal Mach Intell. 1996;18:389–400. [Google Scholar]
- 12.Wang JY, Silva DE. Wave-front interpretation with Zernike polynomials. Appl Opt. 1980;19:1510–1518. doi: 10.1364/AO.19.001510. [DOI] [PubMed] [Google Scholar]
- 13.Belkasim SO, Ahmadi M, Shridhar M. Efficient algorithm for fast computation of Zernike moments. J Franklin Inst Eng Appl Math. 1996;333:577–581. [Google Scholar]
- 14.Gu J, Shu HZ, Toumoulin C, Luo LM. A novel algorithm for fast computation of Zernike moments. Pattern Recognit. 2002;35:2905–2911. [Google Scholar]
- 15.Chong CW, Raveendran P, Mukundan R. A comparative analysis of algorithms for fast computation of Zernike moments. Pattern Recognit. 2003;36:731–742. [Google Scholar]
- 16.M. Petkovsek, H. S. Wilf, and D. Zeilberger, A = B (AK Peters, Ltd., 1996). The book is available on line at the University of Pennsylvania.