This last paper of this series provides a survey of the topics the moments have addressed in image processing at large. Our introductive papers dealt with the theoretical expressions of orthogonal and non-orthogonal basis functions [1], with important properties like geometric invariance, robustness to noise or blurring [2] and the computational accuracy and complexity [3]. Here, most of the steps going from image reconstruction, segmentation, motion tracking to pattern recognition are examined through moment-based techniques. Though they have not an equal importance in this wide spectrum of applications, they can offer sound solutions. Rather than entering into methodological details and examples, this paper is intended to identify the main contributions brought by moments.
Featuring Image analysis
The multiple applications of digital images lead to segment the field into many areas, each having their own journals and conferences. This diversity makes more difficult to have a full view on the on-going research and may hide the general interest of a given method. It must be first pointed out that the physical process underlying the image formation is of major importance for the all subsequent processing. If not properly dealt with, most of the information later extracted can be biased. The reconstruction problem, so important in medical imaging, can be dealt with purely mathematical views but the capability to model and control the acquisition device should open new pathways. Conversely, the search for generic algorithms, a major trend in image analysis, is attractive: the efficiency of such approach has been demonstrated through many examples. Assumptions are always required however and they must be adapted to the situations under study. Thus, a full apprehension of the field has to take into account the 4-tuple, “observations (or data), objects (properties), goals (analysis objectives), methods (algorithms)”.
Multidimensional observations are more and more available in medicine (volume (3-D) images, 2-D+t and 3-D+t image sequences, with scalar or vectorial descriptions). The spatial and temporal sampling is here critical according to the task to be carried out. The huge data sets they provide (hundreds of slices or 2-D projections) call for challenging time computations. The objects of interest are varying in shape among individuals and with disorders, may change and deform over time, have poor contrast. Moreover, they are not reducible to simple geometric patterns. These difficulties may be compensated by introducing some, but limited, interactive pointing or delineation. The ultimate goal (omitting the image formation) is to provide the user with relevant features for clinical decision and therapeutic guidance. It includes parameters for quantifying the lesion extent and discriminating them, locating the abnormal patterns in anatomical space, or characterizing the behavior in space and time. The methods that may bring answers to these issues are many. The classical paradigms like low vs high level, data-driven vs model-driven, salient feature and primitive grouping, edge and region cooperation, local vs global analysis, image understanding through semantic labeling and spatial relation modeling, etc. are still of value in most of the problems to solve. Non-stationary noise and artifacts, low contrast, object occlusions and deformations, variability in observation conditions, etc. lead to recurrent challenges in computer vision when robustness, accuracy, completeness must be achieved under real-time constraints.
The main applications
Reconstruction
This issue has received much attention in the medical imaging literature. A number of imaging modalities are concerned from X-ray computerized tomography and emission tomography up to acoustic and optical techniques. They all bring different insights in the human body either morphological or functional. Recent works deal with reconstruction from only limited range projections with the aim to estimate the projections at unknown views such as the moment-based approaches reported by Salzman [4] and Goncharev [5]. Milanfar et al. [6] presented a variational framework for the tomographic reconstruction of an image from the maximum likelihood estimates of its orthogonal moments. Basu and Bresler [7, 8] investigated the problem of recovering the view angles from the projection data by means of moment method. Rouze et al. [9] derived a relation between Zernike moments and the Radon transform, and then applied it to reconstruction. By establishing a relationship between the image geometric moments and projection moments, Wang and Sze [10] presented an approach to reconstruct the CT images from limited range projections. Shu et al. [11] extended Wang’s method using the orthogonal Legendre moments to improve the quality of the reconstructed image. The advantage of Shu’s method is that the orthogonal moments have simple inverse transform, thus, the image can be more easily obtained from the orthogonal moments. Moreover, the geometric moments, especially at high order, are sensitive to noise and digitization error. However, both methods were based on the use of continuous moments. When applied to 2-D digital images, the double integrals are usually approximated by discrete summations that lead to numerical errors in the computed moments. The discrete orthogonal moments, in particular Tchebichef moments, do not require any discrete approximation for numerical implementation and moreover the discrete orthogonal Tchebichef polynomials are the simplest among all the discrete orthogonal polynomials with a definition domain ideally suited for square images [12]. However, the traditional Radon transform is no longer applicable, and a discrete version of Radon transform is required. Beylkin [13] described a discrete Radon transform (DRT) to map a set of sampled image points onto a set of discrete projections. The algebraic mapping for this transformation can be computed exactly, since no interpolation of the data is required. Matus and Flusser [14] developed a group theoretic and Fourier based approach to the DRT. Svalbe [15] and Kingston [16] derived improved versions of DRT to handle both continuous projections and discrete projections in Fourier and Radon space. The advantage of the techniques reported in [15] and [16] is that they require a straightforward 1-D linear interpolation, a simple sorting of projection samples and no pre-processing of the projection data. The purely mathematical views provided by moments for the reconstruction problem have a drawback: the difficulties they have to deal with the physical characteristics inherent to any image capture.
Pattern Recognition
This topic represents certainly the major applications of moments and a survey of the all literature is out of reach in a short paper. The classical views regarding pattern recognition rely on the ability to extract the objects of interest, on the choice of their representation and the final decision step, e.g. allocation to classes. These classes may be formed beforehand through a learning phase. Many techniques are available to manage that either by contour-based or region-based features with local or global descriptions. They can operate in the image or in the transform domains. They include kernel methods (like Support Vector Machine), neural networks, rigid or elastic matching, string coding and matching, etc. They all require a distance measure to estimate the similarity between two shapes. The flexibility of moments in dealing with binary or grey level images, polygonal and region representations on one hand and, in the other hand, the invariance properties discussed in [2] make them particularly attractive for a number of applications. They offer compact and robust descriptions and bring statistical cues on pixel (or voxel) value distributions: in some cases (the low orders) they have physical interpretations.
Invariance properties are central for pattern recognition as discussed in [2] [17]. Many attempts have been reported using moments for aircraft recognition [18], identification of building and bridge [19], partially occluded object localization [20], flower shape analysis [21], image retrieval [22], character recognition [23–27]. Bailey and Srinath [28] investigated invariant character recognition using Legendre, Zernike and pseudo-Zernike moments with different classifiers. Khotanzad and Hong [29] has shown that a neural network classifier using Zernike moments has a strong class separability power. The evolution towards images with higher dimensions (3-D and n-D) has also motivated a lot of theoretical works [30–36].
Beyond shape features, texture patterns are also of interest in order to recognize objects. Less moment-based works have been devoted to texture analysis [37, 38]. The attempts made for tissue characterization did not however show a significant breakthrough: this can be due to the texture complexity, resulting from mixture of structural tissue properties, partial volume effects, noise and reconstruction artefacts.
Miscellaneous
We briefly review here some additional topics where the moments have been called for but with limited impact. As pointed out above, nearly every system is initialized by detection and segmentation tasks. Despite the recent achievements provided by deformable models, level set and graph-cut methods, these problems remain open in medical applications and require some user interaction. Edge detection is mainly carried out by derivative operators while integrative schemes may offer a sound alternative as shown in 2-D with geometric moments [39] or Zernike moments [40]. Geometric and Legendre moments have also been proposed for 3-D surface detection assuming a local planar patch [41] or a parabolic model [42]. Image registration or alignment is another active area. Most approaches rely on establishing the feature correspondence between a reference image and the target image, a task that can be dealt with moments [43–50]. Motion estimation is another major area in image analysis [51]. Optical flow methods based on differential-geometric descriptors and matching techniques (or correlation) have been widely explored. Joint motion-tracking approaches combining different types of information (boundary-region) and different sources (gradient, intensity, motion) in scenes with multiple objects should provide new perspectives. Moments have not received much attention in this area and the few attempts made were essentially focused on optical flow methods using Zernike moments [52, 53].
Conclusion
Moment-based methods are still marginal in image processing and it is our hope that this series will motivate new research on this topic and open new applications especially in medical imaging. These techniques cover, as we pointed out, a wide spectrum of applications and deserve to be confronted to other attractive and competitive methods. They should also be combined in order to take advantage of their basic features, in particular the invariance properties and the continuously improvements in computational complexity.
References
- 1.Shu HZ, Luo LM, Coatrieux JL. Moment-based approaches in image, Part 1: basic features. IEEE Eng Med Biol Mag. 2007;26(5):70–75. doi: 10.1109/emb.2007.906026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Shu HZ, Luo LM, Coatrieux JL. Moment-based approaches in image, Part 2: invariance. IEEE Eng Med Biol Mag. 2008;27(1):81–83. doi: 10.1109/MEMB.2007.911462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Shu HZ, Luo LM, Coatrieux JL. Moment-based approaches in image, Part 3: computational considerations. IEEE Eng Med Biol Mag. 2008;27(3):89–91. doi: 10.1109/MEMB.2008.918690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Salzman D. A method of general moments for orienting 2D projections of unknown 3D objects. Comput Vis Graph Image Process. 1990;50(2):129–156. [Google Scholar]
- 5.Goncharev AB. Methods of integral geometry and recovering a function with compact support from its projections in unknown directions. Acta Appl Math. 1988;11(3):213–222. [Google Scholar]
- 6.Milanfar PM, Karl WC, Willsky AS. A moment-based variational approach to tomographic reconstruction. IEEE Trans Image Process. 1996;5(3):459–470. doi: 10.1109/83.491319. [DOI] [PubMed] [Google Scholar]
- 7.Basu S, Bresler Y. Uniqueness of tomography with unknown view angles. IEEE Trans Image Process. 2000;9(6):1094–1106. doi: 10.1109/83.846251. [DOI] [PubMed] [Google Scholar]
- 8.Basu S, Bresler Y. Feasibility of tomography with unknown view angles. IEEE Trans Image Process. 2000;9(6):1107–1122. doi: 10.1109/83.846252. [DOI] [PubMed] [Google Scholar]
- 9.Rouze NC, Soon VC, Hutchins GD. On the connection between the Zernike moments and Radon transform of an image. Pattern Recognit Lett. 2006;27(6):636–642. [Google Scholar]
- 10.Wang TJ, Sze TW. The image moment method for the limited range CT image reconstruction and pattern recognition. Pattern Recognit. 2001;34(11):2145–2154. [Google Scholar]
- 11.Shu HZ, Zhou J, Han GN, Luo LM, Coatrieux JL. Image reconstruction from limited range projections using orthogonal moments. Pattern Recognit. 2007;40(2):670–680. [Google Scholar]
- 12.Dai XB, Shu HZ, Luo LM, Han GN, Coatrieux JL. Reconstruction of tomographic image from limited range projections using discrete Radon transform and Tchebichef moments. IEEE Trans Image Process [Google Scholar]
- 13.Beylkin G. Discrete Radon transform. IEEE Trans Acoustics Speech Signal Process. 1987;35(2):162–172. [Google Scholar]
- 14.Matus F, Flusser J. Image representation via a finite radon transform. IEEE Trans Pattern Anal Machine Intell. 1993;15(10):996–1106. [Google Scholar]
- 15.Svalbe I, van der Spek Dimitri. Reconstruction of tomographic images using analog projections and the digital radon transform. Linear Algebra and Its Applications. 2001;339(1):125–145. [Google Scholar]
- 16.Kingston A, Svalbe I. Mapping between digital and continuous projections via the discrete radon transform in Fourier space. DICTA’03; Sydney: Dec, 2003. pp. 1–10. [Google Scholar]
- 17.Hu M. Visual pattern recognition by moment invariant. IRE Trans Inform Theory. 1962;8(2):179–187. [Google Scholar]
- 18.Dudani SA, Breeding SK, McGhee R. Aircraft identification by moment invariants. IEEE Trans Comput. 1977;26(1):39–45. [Google Scholar]
- 19.Gilmore JF, Boyd WW. Building and bredge classification by invariant moments. Proc SPIE. 1981;292:256–263. [Google Scholar]
- 20.Sluzek A. Using moment invariants to recognize and locate partially occluded 2D objects. Pattern Recognit Lett. 1988;7(4):253–257. [Google Scholar]
- 21.Miao ZJ. Zernike moment-based image shape analysis and its application. Pattern Recognit Lett. 2000;21(2):169–177. [Google Scholar]
- 22.Kim WY, Kim YS. A region-based shape descriptor using Zernike moments. Signal Processing: Image Communication. 2000;16(1–2):95–102. [Google Scholar]
- 23.Fang M, Hausler G. Class of transforms invariant under shift, rotation, and scaling. Appl Opt. 1990;29(5):704–708. doi: 10.1364/AO.29.000704. [DOI] [PubMed] [Google Scholar]
- 24.Reiss TH. The revised fundamental theorem of moment invariants. IEEE Trans Pattern Anal Mach Intell. 1991;13(8):830–834. [Google Scholar]
- 25.Shen L, Sheng Y. Noncentral image moments for invariant pattern recognition. Opt Eng. 1995;34(11):3181–3186. [Google Scholar]
- 26.Gruhen M, Hsu KY. Moment-based image normalization with high noise tolerance. IEEE Trans Pattern Anal Machine Intell. 1997;19(2):136–139. [Google Scholar]
- 27.Kan C, Srinath MD. Invariant character recognition with Zernike and orthogonal Fourier-Mellin moments. Pattern Recognit. 2002;35(1):143–154. [Google Scholar]
- 28.Bailey RR, Srinath MD. Orthogonal moment features for use with parametric and non-parametric classifiers. IEEE Trans Pattern Anal Machine Intell. 1996;18(4):389–399. [Google Scholar]
- 29.Khotanzad A, Hong YH. Classification of invariant image representations using a neural network. IEEE Trans Acoust Speech Signal Process. 1990;38(6):1028–1938. [Google Scholar]
- 30.Sadjadi FA, Hall EL. Three-dimensional moment invariants. IEEE Trans Pattern Anal Machine Intell. 1980;2(2):127–136. doi: 10.1109/tpami.1980.4766990. [DOI] [PubMed] [Google Scholar]
- 31.Lo CH, Don HS. 3-D moment forms: their construction and application to object identification and positioning. IEEE Trans Pattern Anal Machine Intell. 1989;11(10):1053–1064. [Google Scholar]
- 32.Canterakis N. 3D Zernike moments and Zernike affine invariants for 3D image analysis and recognition. Proc of 11th Scandinavian Conference on Image Analysis. 1999:85–93. [Google Scholar]
- 33.Novotni M, Klein R. Shape retrieval using 3D Zernike descriptors. Comput Aided Design. 2004;36(11):1047–1062. [Google Scholar]
- 34.Mamistvalov AG. N-dimensional moment invariants and conceptual mathematical theory of recognition n-dimensional solids. IEEE Trans Pattern Anal Machine Intell. 1998;20(8):819–831. [Google Scholar]
- 35.Flusser J, Boldys J, Zivova B. Moment forms invariant to rotation and blur in arbitrary number of dimensions. IEEE Trans Pattern Anal Machine Intell. 2003;25(2):234–246. [Google Scholar]
- 36.Xu D, Li H. Geometric moment invariants. Pattern Recognit. 2008;41(1):240–249. [Google Scholar]
- 37.Tuceryan M. Moment based texture segmentation. Pattern Recognit Lett. 1994;15(7):659–667. [Google Scholar]
- 38.Wang J, Healey G. Using Zernike moments for the illumination and geometry invariant classification of multispectral texture. IEEE Trans Image Process. 1998;7(2):196–203. doi: 10.1109/83.660996. [DOI] [PubMed] [Google Scholar]
- 39.Luo LM, Xie XH, Bao XD. A modified moment-based edge operator for rectangular pixel image. IEEE Trans Circ Sys Video Tech. 1994;4(6):552–554. [Google Scholar]
- 40.Ghosal S, Mehrotra R. Orthogonal moment operators for subpixel edge detection. Pattern Recognit. 1993;26(2):295–306. [Google Scholar]
- 41.Luo LM, Hamitouche C, Dillenseger JL, Coatrieux JL. A moment-based three-dimensional edge operator. IEEE Trans Biomed Eng. 1993;40(7):693–703. doi: 10.1109/10.237699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zhang H, Shu HZ, Luo LM, Dillenseger JL. A Legendre orthogonal moment based 3D edge operator. Science in China Ser G. 2005;48(1):1–13. [Google Scholar]
- 43.Wong RY, Hall EL. Scene matching with invariant moments. Comput Vis Graph Image Process. 1978;8(1):16–24. [Google Scholar]
- 44.Yang XH, Birkfellner W, Niederer P. A similarity based on Tchebichef moments for 2D/3D medical image registration. International Congress Series. 2004;1268:153–158. [Google Scholar]
- 45.Liu J, Zhang TX. Matching and normalization of affine deformed image from regular moments. Pattern Recognit Lett. 2004;25(14):1619–1631. [Google Scholar]
- 46.Heikkila J. Pattern matching with affine moment descriptors. Pattern Recognit. 2004;37(9):1825–1834. [Google Scholar]
- 47.Yang Z, Cohen FS. Cross-weighted moments and affine invariants for image registration and matching. IEEE Trans Pattern Anal Machine Intell. 1999;21(8):804–814. [Google Scholar]
- 48.Rothe I, Susse H, Voss K. The method of normalization to determine invariants. IEEE Trans Pattern Anal Machine Intell. 1996;18(4):366–376. [Google Scholar]
- 49.Dong L, Boyer AL. A portal image alignment and patient setup verification procedure using moments and correlation techniques. Phys Med Biol. 1996;41(4):697–723. doi: 10.1088/0031-9155/41/4/008. [DOI] [PubMed] [Google Scholar]
- 50.Shu HZ, Ge Y, Luo LM, Wang JW, Yu WX, Bao XD. An orthogonal moment-based method for automatic verification of radiation field shape. Phys Med Biol. 2000;45(10):2897–2911. doi: 10.1088/0031-9155/45/10/312. [DOI] [PubMed] [Google Scholar]
- 51.Coatrieux JL. Shape and function from motion in biomedical imaging: part 3. IEEE Eng Med Biol Mag. 2007;26(3):81–83. doi: 10.1109/MEMB.2007.364938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ghosal S, Mehrotra R. Robust optical flow estimation using semi-invariant local features. Pattern Recognit. 1997;30(2):229–237. [Google Scholar]
- 53.Gu J, Luo LM, Shu HZ. A local optimization algorithm for computing 2D optical flow field of heart echocardiograph image using Zernike moments. Acta Electronic Sinica. 2001;29(3):350–353. [Google Scholar]