Elsevier

Pattern Recognition

Volume 45, Issue 12, December 2012, Pages 4300-4311
Pattern Recognition

Improved direction estimation for Di Zenzo's multichannel image gradient operator

https://doi.org/10.1016/j.patcog.2012.06.003Get rights and content

Abstract

Gradient estimation is one of the most important tasks in image/video processing. For multichannel images, a classical and widely-used gradient method is Di Zenzo's gradient operator, which is based on the measure of squared local contrast variation of multichannel images. However, up to now, the indetermination of Di Zenzo's gradient direction has not been well solved, which results in errors occurring in most of the subsequent studies in which Di Zenzo's vector gradient is used. In this paper, this problem is solved thoroughly. Furthermore, the ranges of the values that the gradient angle should take in various cases are also analyzed. As an application in color image processing, a color version of Canny edge detector is implemented by introducing the new gradient estimator to the traditional grayscale image Canny operator. The experimental results indicate that the improved Di Zenzo's gradient operator is currently one of the best color gradient estimators and outperforms other state-of-the-art color image gradient methods. The improved multichannel gradient operator not only provides accurate gradient estimation but also is efficient and easy to implement.

Highlights

► Method of solving the indetermination of Di Zenzo's multichannel image gradient directions. ► Analysis of the gradient angle range in various cases. ► New color Canny edge operator.

Introduction

Gradient [1] has been widely used in image processing and computer vision in applications such as edge detection [2], [3], [4], [5], [6], image segmentation [7], [8], corner detection [9], image fusion [10], image recognition [11], face detection [12], and object tracking [8], [13]. For example, edge detection can be implemented by thresholding gradient magnitudes or by locating local maximum values of gradient magnitudes, and object tracking and recognition can be obtained by matching the gradient directions and at the same time using the gradient magnitudes of the pixels on the model object edges and candidate object edges.

The gradient associated with an image pixel is usually defined as a 2-D column vector, in which the vectorial angle denotes the direction of the largest growth of the image function. For grayscale images, numerous gradient estimators have been developed. However, for multichannel (multidimensional) images, which are usually described as vector fields [14], this issue has not received enough attention. From a general point of view, multidimensional gradient estimators can be divided into three major categories. The first type is characterized by a single estimate of the orientation and strength of an edge at a point [15]. The first such method is proposed by Robinson [16], who computed 24 directional derivatives (8 neighbors per color channel) and chose the one with the largest magnitude as the color gradient. Later, Ruzon and Tomasi [15] utilized a color distribution to represent a neighborhood and implemented color edge, junction, and corner detection. Their method first divides the current processing window in half with a line segment and computes a color distribution for each half, and then calculates the distance between the two distributions. This process is repeated using line segments with different orientations and the one with maximum strength is assumed as the orientation of the edge. Thus, the maximum strength and the direction normal to the corresponding orientation are regarded as the gradient magnitude and direction.

The second category of multichannel gradient methods is based on grayscale image gradient estimators. These operators calculate the gradient vectors for individual channels and then combine them to produce the final gradient vectors. According to different combination mechanisms, the resultant gradient can be the vector sum of the gradient vectors of individual channels, or the RMS (root mean square), or the maximum of the channel gradient magnitudes, or other mechanisms. However, these component-wise methods, as pointed out in [17], are unsatisfactory in some cases since in these methods the image channels do not actually cooperate with one another.

The third type of multidimensional gradient estimators is based on finding the maximum changes of image vectors. Among them, the simplest one is to define the resultant gradient as the vector in which the magnitude is the maximum of the Euclidean distances between the central pixel vector and its eight neighboring pixel vectors and the direction is estimated from the direction of the maximum change [18]. Di Zenzo [17] proposed a classical and efficient multichannel gradient operator, which is based on the measure of squared local contrast variation of multichannel images. Scharcanski and Venetsanopoulos developed a local vector statistics based gradient method for color edge detection [19], in which they used the differences between the average color vectors of the samples inside the sub-windows in horizontal and vertical directions to estimate the maximum variation of a color image in each pixel position.

More recently, Nezhadarya and Ward proposed a new color image gradient operator [20]. This method first applies highpass and lowpass vector operators in an appropriate manner in both horizontal and vertical directions, where the highpass and lowpass operators are respectively used as vector difference estimate and noise smoothing. Then, an aggregation operator is performed on each direction to find the corresponding partial derivative.

Among these multidimensional gradient estimators mentioned above, perhaps the most classical and widely-used one is Di Zenzo's multichannel gradient operator [17]. However, Di Zenzo did not solve the problem of indetermination of the gradient direction. Although some researchers [21], [22] have made further studies on Di Zenzo's vector gradient, to date this problem has not been well solved, which results in errors occurring in most of the later studies (see Section 2) in which Di Zenzo's vector gradient is referenced. This paper will solve this problem thoroughly and at the same time analyze the gradient angle ranges in various cases.

As an application in image processing, we apply the new multichannel gradient operator to color image edge detection, since gradient is closely related to edge detection and image segmentation. In color images, edges can be defined as meaningful discontinuities of image functions in vector fields [23], [24]. Color edge detection techniques [23], [24], [25], [26], [27] can be roughly divided into two classes: monochromatic-based techniques that first detect edges in individual color channels separately and then combine the component results to be the color edges, and vector-valued techniques that treat color pixels as color vectors in a vector space to detect the abrupt changes. Vector approaches are generally preferred to component-wise techniques owing to the vector nature of color images and the strong spectral correlation that exists between color channels. Vector approaches mainly include the first- [17] and second- [21], [22] order derivative methods which are based on color vector gradients, the directional vectors based difference methods (or called directional operators) [19], the methods based on vector order statistics [28], [29], the difference vector operators [23], [25], and other methods such as morphological gradient approaches [30], vector entropy methods [31], [32], density estimation methods [33], [34], and methods based on physics models [35] and principal axis analysis and moment-preserving [36].

The remainder of this paper is organized as follows. In Section 2, Di Zenzo's vector gradient and the related work are reviewed, and the proposed mechanism for solving the ambiguity in Di Zenzo's gradient angle is described in detail. Section 3 gives an application for color edge detection by applying the new gradient operator to the traditional grayscale image Canny operator. Finally, conclusions are drawn in Section 4.

Section snippets

Di Zenzo's gradient operator and the proposed method

In this section, we first briefly introduce Di Zenzo's multidimensional gradient method and the related studies, and analyze the existing problem in Di Zenzo's gradient operator and its variations. Then, we give the solution which is described by a theorem to the existing problem. The proof of the theorem is presented in Appendix A.

Application in color edge detection

As mentioned in the introductory section, gradient is closely related to edge detection and image segmentation. So, in this section, we apply the proposed multichannel gradient operator to color image edge detection. For edge detection, perhaps the best method to evaluate the accuracy of image gradients is Canny edge detector [2]. Because in Canny operator, both gradient magnitudes and directions are used to implement non-maximal suppression, which is the key step and significantly affects the

Conclusions

Estimating gradients of multichannel images is an important issue for multichannel image processing. A currently widely-used gradient method for multichannel images is Di Zenzo's vector gradient operator. However, the uncertainty of Di Zenzo's gradient direction has not well solved to date, which results in errors occurring in many published literature in which Di Zenzo's vector gradient is referenced. In this paper, by analyzing the squared contrast variation function, we obtain the solution

Acknowledgements

This study was supported by the National Natural Science Foundation of China under Grant No. 60972098. The authors would like to thank the reviewers for their valuable comments which help to improve the paper, Dr. Ehsan Nezhadarya for providing the program for RCMG-Median–Mean gradient operator, and the authors of Compass operator for putting the code on the webpage.

Lianghai Jin received the BS and MS degrees in computer science from Central South University (China) in 1988 and Beijing Jiaotong University (China) in 2002, and the PhD degree in pattern recognition and intelligent systems from Huazhong University of Science and Technology (China) in 2008, where he is now an associate professor with the School of Computer Science and Technology. From 1988 to 1999, he was with a railway institute in China as an engineer and senior engineer, respectively. His

References (42)

  • A. Ortiz et al.

    Analysis of colour channel coupling from a physics-based viewpoint: application to colour edge detection

    Pattern Recognition

    (2010)
  • S.-C. Cheng et al.

    Subpixel edge detection of color images by principal axis analysis and moment-preserving principle

    Pattern Recognition

    (2005)
  • K. Bowyer et al.

    Edge detector evaluation using empirical ROC curves

    Computer Vision and Image Understanding

    (2001)
  • W.K. Pratt

    Digital Image Processing

    (2001)
  • J.F. Canny

    A computational approach to edge detection, IEEE

    Transactions on Pattern Analysis and Machine Intelligence

    (1986)
  • D. Marr et al.

    Theory of Edge Detection, Proceedings of the Royal Society of London B

    Biological Sciences

    (1980)
  • R.M. Haralick

    Digital step edges from zero crossing of second directional derivatives

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (1984)
  • P.R. Hill et al.

    Image segmentation using a texture gradient based watershed transform

    IEEE Transactions on Image Processing

    (2003)
  • V.S. Petrovic et al.

    Gradient-based multiresolution image fusion

    IEEE Transactions on Image Processing

    (2004)
  • R. Machuca et al.

    Applications of vector fields to image processing, IEE

    Transactions on Pattern Analysis and Machine Intelligence

    (1983)
  • M.A. Ruzon et al.

    Edge, junction, and corner detection using color distributions

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2001)
  • Cited by (28)

    • Multimodal medical image fusion based on joint bilateral filter and local gradient energy

      2021, Information Sciences
      Citation Excerpt :

      In the last few years, the structure tensor has emerged as a useful tool for local gradient feature analysis. It has been widely applied to many low-level image processing tasks [31–36], such as biomedical image analysis [31], image registration [32], hyper-spectral image denoising [33], image regularization [34], and multifocus image fusion [35]. In this section, a multimodal medical image fusion method is proposed and discussed it in detail.

    • Multi-focus image fusion based on nonsubsampled contourlet transform and residual removal

      2021, Signal Processing
      Citation Excerpt :

      For a multi-focus image, the LGS of a region can reflect its focusing degree to a certain extent, and the LGS of the focused region is usually more salient than those of the defocused region, and thus an LGS operator can be utilized to detect the focusing property. In recent years, as a useful tool for LGS feature analysis, the structure tensor has been successfully applied in fields such as edge detection [50], image reconstruction [51], image registration [52], image denoising [53], and image fusion [54]. The way MST based methods fuse multi-focus images is highly similar to that the human visual system processes visual information, and thus the fused results obtained by this type of methods usually yield promising visual performance, such as the cleanliness and integrity of the focus boundary (see Fig. 3 (b) and (e)).

    • Blind color-image deblurring based on color image gradients

      2019, Signal Processing
      Citation Excerpt :

      The method of selecting multi-image gradients was first proposed by Di Zenzo [29] and caused issues with gradient directions [30]. To fully detect color-image edges, our method to selects the color image gradients in the rgb normalized space instead of adopting the optimization method in [30]. The flowchart for calculating the color image gradients is shown in Fig. 1.

    • Video oriented filter for impulse noise reduction

      2018, Journal of Visual Communication and Image Representation
    View all citing articles on Scopus

    Lianghai Jin received the BS and MS degrees in computer science from Central South University (China) in 1988 and Beijing Jiaotong University (China) in 2002, and the PhD degree in pattern recognition and intelligent systems from Huazhong University of Science and Technology (China) in 2008, where he is now an associate professor with the School of Computer Science and Technology. From 1988 to 1999, he was with a railway institute in China as an engineer and senior engineer, respectively. His research interests include color and video image processing.

    Hong Liu received the BS and MS degrees in optoelectronic engineering and computer science from Huazhong University of Science and Technology (China) in 1984 and 1995, respectively, and the PhD degree in electronic engineering from Teesside University (United Kingdom) in 2000. She is currently an associate professor with the School of Computer Science and Technology, Huazhong University of Science and Technology. Her research interests include image processing and computer networking.

    Xiangyang Xu received the BS, MS, and PhD degrees in computer science from Huazhong University of Science and Technology (China) in 1998, 1991, and 2010, respectively. He is currently an associate professor with the School of Computer Science and Technology, Huazhong University of Science and Technology. His research interests include image processing and analysis.

    Enmin Song received the PhD degree in electronic engineering from Teesside University (United Kingdom) in 1999. He is currently a professor with the School of Computer Science and Technology, Huazhong University of Science and Technology, China. His research interests include image processing and algorithm analysis.

    View full text