Improved direction estimation for Di Zenzo's multichannel image gradient operator

doi:10.1016/j.patcog.2012.06.003

Pattern Recognition

Volume 45, Issue 12, December 2012, Pages 4300-4311

https://doi.org/10.1016/j.patcog.2012.06.003 Get rights and content

Abstract

Gradient estimation is one of the most important tasks in image/video processing. For multichannel images, a classical and widely-used gradient method is Di Zenzo's gradient operator, which is based on the measure of squared local contrast variation of multichannel images. However, up to now, the indetermination of Di Zenzo's gradient direction has not been well solved, which results in errors occurring in most of the subsequent studies in which Di Zenzo's vector gradient is used. In this paper, this problem is solved thoroughly. Furthermore, the ranges of the values that the gradient angle should take in various cases are also analyzed. As an application in color image processing, a color version of Canny edge detector is implemented by introducing the new gradient estimator to the traditional grayscale image Canny operator. The experimental results indicate that the improved Di Zenzo's gradient operator is currently one of the best color gradient estimators and outperforms other state-of-the-art color image gradient methods. The improved multichannel gradient operator not only provides accurate gradient estimation but also is efficient and easy to implement.

Highlights

► Method of solving the indetermination of Di Zenzo's multichannel image gradient directions. ► Analysis of the gradient angle range in various cases. ► New color Canny edge operator.

Introduction

Gradient [1] has been widely used in image processing and computer vision in applications such as edge detection [2], [3], [4], [5], [6], image segmentation [7], [8], corner detection [9], image fusion [10], image recognition [11], face detection [12], and object tracking [8], [13]. For example, edge detection can be implemented by thresholding gradient magnitudes or by locating local maximum values of gradient magnitudes, and object tracking and recognition can be obtained by matching the gradient directions and at the same time using the gradient magnitudes of the pixels on the model object edges and candidate object edges.

The gradient associated with an image pixel is usually defined as a 2-D column vector, in which the vectorial angle denotes the direction of the largest growth of the image function. For grayscale images, numerous gradient estimators have been developed. However, for multichannel (multidimensional) images, which are usually described as vector fields [14], this issue has not received enough attention. From a general point of view, multidimensional gradient estimators can be divided into three major categories. The first type is characterized by a single estimate of the orientation and strength of an edge at a point [15]. The first such method is proposed by Robinson [16], who computed 24 directional derivatives (8 neighbors per color channel) and chose the one with the largest magnitude as the color gradient. Later, Ruzon and Tomasi [15] utilized a color distribution to represent a neighborhood and implemented color edge, junction, and corner detection. Their method first divides the current processing window in half with a line segment and computes a color distribution for each half, and then calculates the distance between the two distributions. This process is repeated using line segments with different orientations and the one with maximum strength is assumed as the orientation of the edge. Thus, the maximum strength and the direction normal to the corresponding orientation are regarded as the gradient magnitude and direction.

The second category of multichannel gradient methods is based on grayscale image gradient estimators. These operators calculate the gradient vectors for individual channels and then combine them to produce the final gradient vectors. According to different combination mechanisms, the resultant gradient can be the vector sum of the gradient vectors of individual channels, or the RMS (root mean square), or the maximum of the channel gradient magnitudes, or other mechanisms. However, these component-wise methods, as pointed out in [17], are unsatisfactory in some cases since in these methods the image channels do not actually cooperate with one another.

The third type of multidimensional gradient estimators is based on finding the maximum changes of image vectors. Among them, the simplest one is to define the resultant gradient as the vector in which the magnitude is the maximum of the Euclidean distances between the central pixel vector and its eight neighboring pixel vectors and the direction is estimated from the direction of the maximum change [18]. Di Zenzo [17] proposed a classical and efficient multichannel gradient operator, which is based on the measure of squared local contrast variation of multichannel images. Scharcanski and Venetsanopoulos developed a local vector statistics based gradient method for color edge detection [19], in which they used the differences between the average color vectors of the samples inside the sub-windows in horizontal and vertical directions to estimate the maximum variation of a color image in each pixel position.

More recently, Nezhadarya and Ward proposed a new color image gradient operator [20]. This method first applies highpass and lowpass vector operators in an appropriate manner in both horizontal and vertical directions, where the highpass and lowpass operators are respectively used as vector difference estimate and noise smoothing. Then, an aggregation operator is performed on each direction to find the corresponding partial derivative.

Among these multidimensional gradient estimators mentioned above, perhaps the most classical and widely-used one is Di Zenzo's multichannel gradient operator [17]. However, Di Zenzo did not solve the problem of indetermination of the gradient direction. Although some researchers [21], [22] have made further studies on Di Zenzo's vector gradient, to date this problem has not been well solved, which results in errors occurring in most of the later studies (see Section 2) in which Di Zenzo's vector gradient is referenced. This paper will solve this problem thoroughly and at the same time analyze the gradient angle ranges in various cases.

As an application in image processing, we apply the new multichannel gradient operator to color image edge detection, since gradient is closely related to edge detection and image segmentation. In color images, edges can be defined as meaningful discontinuities of image functions in vector fields [23], [24]. Color edge detection techniques [23], [24], [25], [26], [27] can be roughly divided into two classes: monochromatic-based techniques that first detect edges in individual color channels separately and then combine the component results to be the color edges, and vector-valued techniques that treat color pixels as color vectors in a vector space to detect the abrupt changes. Vector approaches are generally preferred to component-wise techniques owing to the vector nature of color images and the strong spectral correlation that exists between color channels. Vector approaches mainly include the first- [17] and second- [21], [22] order derivative methods which are based on color vector gradients, the directional vectors based difference methods (or called directional operators) [19], the methods based on vector order statistics [28], [29], the difference vector operators [23], [25], and other methods such as morphological gradient approaches [30], vector entropy methods [31], [32], density estimation methods [33], [34], and methods based on physics models [35] and principal axis analysis and moment-preserving [36].

The remainder of this paper is organized as follows. In Section 2, Di Zenzo's vector gradient and the related work are reviewed, and the proposed mechanism for solving the ambiguity in Di Zenzo's gradient angle is described in detail. Section 3 gives an application for color edge detection by applying the new gradient operator to the traditional grayscale image Canny operator. Finally, conclusions are drawn in Section 4.

Section snippets

Di Zenzo's gradient operator and the proposed method

In this section, we first briefly introduce Di Zenzo's multidimensional gradient method and the related studies, and analyze the existing problem in Di Zenzo's gradient operator and its variations. Then, we give the solution which is described by a theorem to the existing problem. The proof of the theorem is presented in Appendix A.

Application in color edge detection

As mentioned in the introductory section, gradient is closely related to edge detection and image segmentation. So, in this section, we apply the proposed multichannel gradient operator to color image edge detection. For edge detection, perhaps the best method to evaluate the accuracy of image gradients is Canny edge detector [2]. Because in Canny operator, both gradient magnitudes and directions are used to implement non-maximal suppression, which is the key step and significantly affects the

Conclusions

Estimating gradients of multichannel images is an important issue for multichannel image processing. A currently widely-used gradient method for multichannel images is Di Zenzo's vector gradient operator. However, the uncertainty of Di Zenzo's gradient direction has not well solved to date, which results in errors occurring in many published literature in which Di Zenzo's vector gradient is referenced. In this paper, by analyzing the squared contrast variation function, we obtain the solution

Acknowledgements

This study was supported by the National Natural Science Foundation of China under Grant No. 60972098. The authors would like to thank the reviewers for their valuable comments which help to improve the paper, Dr. Ehsan Nezhadarya for providing the program for RCMG-Median–Mean gradient operator, and the authors of Compass operator for putting the code on the webpage.

Lianghai Jin received the BS and MS degrees in computer science from Central South University (China) in 1988 and Beijing Jiaotong University (China) in 2002, and the PhD degree in pattern recognition and intelligent systems from Huazhong University of Science and Technology (China) in 2008, where he is now an associate professor with the School of Computer Science and Technology. From 1988 to 1999, he was with a railway institute in China as an engineer and senior engineer, respectively. His

References (42)

O.A. Zuniga et al.
Gradient threshold selection using the facet model
Pattern Recognition
(1988)
D. Sen et al.
Gradient histogram: thresholding in a region of interest for edge detection
Image and Vision Computing
(2010)
D. Xiao et al.
A region and gradient based active contour model and its application in boundary tracking on anal canal ultrasound images
Pattern Recognition
(2007)
X. Zhang et al.
Corner detection based on gradient correlation matrices of planar curves
Pattern Recognition
(2010)
M. Shi et al.
Handwritten numeral recognition using gradient and curvature of gray scale image
Pattern Recognition
(2002)
L.-L. Huang et al.
Gradient feature extraction for classification-based face detection
Pattern Recognition
(2003)
D.W. Paglieroni et al.
Resolution analysis for gradient direction matching of object model edges to overhead images
Computer Vision and Image Understanding
(2009)
S. Di Zenzo
A note on the gradient of a multi-image
Computer Vision, Graphics, and Image Processing
(1986)
A. Cumani
Edge detection in multispectral images
Graphical Models and Image Processing
(1991)
A. Shiozaki
Edge extraction using entropy operator
Computer Vision, Graphics, and Image Processing
(1986)

A. Ortiz et al.

Analysis of colour channel coupling from a physics-based viewpoint: application to colour edge detection

Pattern Recognition

(2010)

S.-C. Cheng et al.

Subpixel edge detection of color images by principal axis analysis and moment-preserving principle

Pattern Recognition

(2005)

K. Bowyer et al.

Edge detector evaluation using empirical ROC curves

Computer Vision and Image Understanding

(2001)

W.K. Pratt

Digital Image Processing

(2001)

J.F. Canny

A computational approach to edge detection, IEEE

Transactions on Pattern Analysis and Machine Intelligence

(1986)

D. Marr et al.

Theory of Edge Detection, Proceedings of the Royal Society of London B

Biological Sciences

(1980)

R.M. Haralick

Digital step edges from zero crossing of second directional derivatives

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1984)

P.R. Hill et al.

Image segmentation using a texture gradient based watershed transform

IEEE Transactions on Image Processing

(2003)

V.S. Petrovic et al.

Gradient-based multiresolution image fusion

IEEE Transactions on Image Processing

(2004)

R. Machuca et al.

Applications of vector fields to image processing, IEE

Transactions on Pattern Analysis and Machine Intelligence

(1983)

M.A. Ruzon et al.

Edge, junction, and corner detection using color distributions

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2001)

Cited by (28)

Multimodal medical image fusion based on joint bilateral filter and local gradient energy
2021, Information Sciences
Citation Excerpt :
In the last few years, the structure tensor has emerged as a useful tool for local gradient feature analysis. It has been widely applied to many low-level image processing tasks [31–36], such as biomedical image analysis [31], image registration [32], hyper-spectral image denoising [33], image regularization [34], and multifocus image fusion [35]. In this section, a multimodal medical image fusion method is proposed and discussed it in detail.
As a powerful assistance technique for biomedical diagnosis, multimodal medical image fusion has emerged as a hot topic in recent years. Unfortunately, the trade-off among fusion performance, time consumption and noise robustness for many medical image fusion algorithms remains an enormous challenge. In this paper, an effective, fast and robust medical image fusion method is proposed. A two-layer decomposition scheme is introduced by the joint bilateral filter, the energy layer containing rich intensity information, and the structure layer capturing ample details. Then a novel local gradient energy operator based on the structure tensor and neighbor energy is proposed to fuse the structure layer and the l₁-max rule is introduced to fuse the energy layer. A total of 118 co-registered pairs of medical images covering five different categories of medical image fusion problems are tested in experiments. Seven latest representative medical image fusion methods are compared, and six representative quality evaluation metrics with complementary characteristics are fully employed to objectively evaluate the fused results. Extensive experimental results demonstrate that the proposed method yields better performance than some state-of-the-art methods in both visual quality and quantitative evaluation, and achieves nearly real-time computational efficiency and robustness to noise.
Multi-focus image fusion based on nonsubsampled contourlet transform and residual removal
2021, Signal Processing
Citation Excerpt :
For a multi-focus image, the LGS of a region can reflect its focusing degree to a certain extent, and the LGS of the focused region is usually more salient than those of the defocused region, and thus an LGS operator can be utilized to detect the focusing property. In recent years, as a useful tool for LGS feature analysis, the structure tensor has been successfully applied in fields such as edge detection [50], image reconstruction [51], image registration [52], image denoising [53], and image fusion [54]. The way MST based methods fuse multi-focus images is highly similar to that the human visual system processes visual information, and thus the fused results obtained by this type of methods usually yield promising visual performance, such as the cleanliness and integrity of the focus boundary (see Fig. 3 (b) and (e)).
The goal of multi-focus image fusion is to integrate all focus pixels from the source images into the fused result and simultaneously avoid the introduction of defocused pixels. However, erasing the defocused pixels of the fused image remains a huge challenge. In this paper, a novel multi-focus image fusion method based on residual removal is proposed, which can effectively bridge the gap between the transform domain and spatial domain based methods. Firstly, a structure tensor based fusion rule in nonsubsampled contourlet transform domain is designed, and the initial fused result is obtained. Meanwhile, a new multi-scale threshold correction focusing detection technique in spatial domain is proposed. In this step, all focusing advantages with different scales and focus reliability are taken into account, and then the incomplete decision maps are produced by the pixels with preponderant focus property. Subsequently, the initial decision maps are constructed by the supplement of a novel third-party focus maps. The residual is generated by the initial differences and the residual decision maps. At last, the final fused image is obtained by the subtraction between the initial fused result and the residual. Experimental results demonstrate that the proposed method outperforms some state-of-the-art methods in both quantitative and qualitative evaluations.
Blind color-image deblurring based on color image gradients
2019, Signal Processing
Citation Excerpt :
The method of selecting multi-image gradients was first proposed by Di Zenzo [29] and caused issues with gradient directions [30]. To fully detect color-image edges, our method to selects the color image gradients in the rgb normalized space instead of adopting the optimization method in [30]. The flowchart for calculating the color image gradients is shown in Fig. 1.
In this paper, a blind color-image deblurring method based on color image gradients is proposed. The color-image deblurring model is built by introducing color image gradients in the rgb normalized space. The deblurring process is based on alternating iterations of image estimations and blurring kernel estimations. The clear image is obtained by the alternating direction method of multipliers (ADMM), and the blurring kernel is calculated by the half-quadratic penalty method. Experimental results show that the proposed method effectively estimates the blurring kernels. The fidelity of the deblurred images is better, for both motion blurred images and defocused images, than what is achieved with other state-of-the-art blind image deblurring methods. Furthermore, the quality metrics for the proposed method also verify the value of our method compared with other methods.
Video oriented filter for impulse noise reduction
2018, Journal of Visual Communication and Image Representation
A window-adaptive video filter for removal of impulse noise from grayscale videos is proposed. The new method is based on local orientation estimation. The dominant orientation of the pattern in a local spatial neighborhood is computed by minimizing an expression of directional derivatives, and at the same time the orientation strength is also computed. Based on the local spatial orientation and its strength, the size, shape, and orientation of 3D filter window are adaptively determined, which leads to the proposed window-adaptive 3D median filter. To further enhance denoising performance, a new noise detection mechanism is developed and integrated to the proposed video filter. By using this noise detector, video pixels are classified into noise-free and noisy ones. For the noisy pixels detected, the proposed window-adaptive 3D filter is performed. Experimental results show that the proposed method outperforms other state-of-the-art video denoising methods in both objective measure and visual evaluation.
Multifocus image fusion by combining with mixed-order structure tensors and multiscale neighborhood
2016, Information Sciences
In this study, we propose a new method for multifocus image fusion by combining with the structure tensors of mixed order differentials and the multiscale neighborhood. In this method, the structure tensor of an integral differential is utilized to detect the high frequency regions and the structure tensor of the fractional differential is used to detect the low frequency regions. To improve the performance of the fusion method, we propose a new focus measure based on the multiscale neighborhood technique to generate the initial fusion decision maps by exploiting the advantages of different scales. Next, based on the multiscale neighborhood technique, a post-processing method is used to update the initial fusion decision maps. During the fusion process, the pixels located in the focused inner regions are selected to produce the fused image. In order to avoid discontinuities in the transition zone between the focused and defocused regions, we propose a new “averaging” scheme based on the fusion decision maps at different scales. Our experimental results demonstrate that the proposed method outperformed the conventional multifocus image fusion methods in terms of both their subjective and objective quality.
Fractional differential and variational method for image fusion and super-resolution
2016, Neurocomputing
This paper introduces a novel fractional differential and variational model that includes the terms of fusion and super-resolution, edge enhancement and noise suppression. In image fusion and super-resolution term, the structure tensor is employed to describe the geometry of all the input images. According to the fact that the fused image and the source inputs should have the same or similar structure tensor, the energy functional of the image fusion and super-resolution is established combining with the down-sampling operator. For edge enhancement, the bidirectional diffusion term is incorporated into the image fusion and super-resolution model to enhance the visualization of the fused image. In the noise suppression term, a new variational model is developed based on the fractional differential and fractional total variation. Thanks to the above three terms, the proposed model can realize the image fusion, super-resolution, and the edge information enhancement simultaneously. To search for the optimal solution, a gradient descent iteration scheme derived from the Euler–Lagrange equation of the proposed model is employed. The numerical results indicate that the proposed method is feasible and effective.

View all citing articles on Scopus

Hong Liu received the BS and MS degrees in optoelectronic engineering and computer science from Huazhong University of Science and Technology (China) in 1984 and 1995, respectively, and the PhD degree in electronic engineering from Teesside University (United Kingdom) in 2000. She is currently an associate professor with the School of Computer Science and Technology, Huazhong University of Science and Technology. Her research interests include image processing and computer networking.

Xiangyang Xu received the BS, MS, and PhD degrees in computer science from Huazhong University of Science and Technology (China) in 1998, 1991, and 2010, respectively. He is currently an associate professor with the School of Computer Science and Technology, Huazhong University of Science and Technology. His research interests include image processing and analysis.

Enmin Song received the PhD degree in electronic engineering from Teesside University (United Kingdom) in 1999. He is currently a professor with the School of Computer Science and Technology, Huazhong University of Science and Technology, China. His research interests include image processing and algorithm analysis.

View full text

Improved direction estimation for Di Zenzo's multichannel image gradient operator

Abstract

Highlights

Introduction

Section snippets

Di Zenzo's gradient operator and the proposed method

Application in color edge detection

Conclusions

Acknowledgements

Pattern Recognition

Image and Vision Computing

Pattern Recognition

Pattern Recognition

Pattern Recognition

Pattern Recognition

Computer Vision and Image Understanding

Computer Vision, Graphics, and Image Processing

Graphical Models and Image Processing

Computer Vision, Graphics, and Image Processing

Pattern Recognition

Pattern Recognition

Computer Vision and Image Understanding

Digital Image Processing

A computational approach to edge detection, IEEE

Transactions on Pattern Analysis and Machine Intelligence

Theory of Edge Detection, Proceedings of the Royal Society of London B

Biological Sciences

Digital step edges from zero crossing of second directional derivatives

IEEE Transactions on Pattern Analysis and Machine Intelligence

Image segmentation using a texture gradient based watershed transform

IEEE Transactions on Image Processing

Gradient-based multiresolution image fusion

IEEE Transactions on Image Processing

Applications of vector fields to image processing, IEE

Transactions on Pattern Analysis and Machine Intelligence

Edge, junction, and corner detection using color distributions

IEEE Transactions on Pattern Analysis and Machine Intelligence