A compact multi-pattern encoding descriptor for texture classification

https://doi.org/10.1016/j.dsp.2021.103081Get rights and content

Abstract

Binary pattern family is considered as a powerful tool for visual texture classification. Most popular methods improve the classification performance by multi-feature fusion. However, many sub-features are redundant and low-discriminative and the classification system has high computational complexity and unsatisfactory results. To handle above problems, this paper proposes a compact multi-pattern encoding descriptor for visual texture classification. First, we develop local extremum patterns and local center pattern to represent the neighborhood intensity changes. Then, we design a compact encoding scheme to encode local maximum, minimum and center patterns into a three-bit binary code, named MMC pattern. Finally, a compact multi-pattern encoding descriptor is proposed by combining the traditional local sign pattern and MMC pattern. Experimental results on five representative texture databases demonstrate that our method achieves the state-of-the-art texture classification performance.

Introduction

Texture classification, as one of the most important tasks in pattern recognition and image processing, has attracted much attention and enjoys a wide range of applications such as medical imaging [1], remote sensing [2], [3], signature verification [4], script identification [5], and face analysis [6], [7], [8]. A successful texture feature should have a good tradeoff between high computational efficiency and rich information representation. Although it has been extensively studied over last few decades, it is still a challenging and open problem to effectively define and represent texture.

In 2017, Liu et al. [9] proposed a taxonomy of binary family and analyzed the merits and demerits of different LBP variants. However, it still does not have a clear definition for texture, and the distinction among different categories is unsatisfactory. Antonio et al. [10] provided a crisp mathematical definition about texture and proposed the histograms of equivalent patterns (HEP) framework for texture analysis. In the HEP framework, the encoding schemes are defined as different “kernel functions” of pixel values. As one of the most basic “kernel functions”, the local binary pattern (LBP) [11] has drawn great attention due to its powerful discriminative capability, computational simplicity, and ease of implementation. However, the traditional LBP only encodes the local sign components of local differences but ignores other local structure information. The incomplete information representation significantly limits its application.

Recent studies suggest that combining complementary sub-features is an effective solution for providing comprehensive texture representation. For example, the authors of [12] proposed a completed modeling of local binary pattern, named CLBP, by combining local sign, magnitude, and center pixel. Due to its good complementary in local information representation, CLBP has rapidly derived many variants, such as CLBC [13] and multi-scale CLBP [14]. The authors of [15] combined the modified color histogram and diagonally symmetric co-occurrence texture pattern for texture and natural scene retrieval. Bnaerjee et al. [16] proposed a local neighborhood intensity pattern for content-based image retrieval. Besides, the authors of [17] found a single but optimal scale for binary pattern descriptors and proposed the scale-adaptive LBP (SALBP) for texture classification. Afterward, in [18], a locally directional and extremal pattern was developed to exploit the information of local texture intensities and directionality effectively. Srishti et al. [19] proposed a local directional peak valley binary pattern for texture retrieval. The authors of [20] introduced a concave and convex binary thresholding function and proposed the local concave and convex micro-structures pattern for texture classification. To achieve a good tradeoff between simplicity and performance, the authors of [21] introduced a cross-scale joint feature representation and proposed the cross-complementary LBP. In [22], four doublets around center pixel were considered to generate the attractive and repulsive center-symmetric LBP that represented the fine and coarse texture information effectively and achieved remarkable performance improvements. In addition, 2D-LBP [23] was proposed by Bin that extracted the spatial contextual information and achieved satisfactory classification results.

The deep learning method is much different from the traditional hand-crafted method in terms of the feature representation. Some researchers have paid much attention to deep texture features generated by convolution neural networks (CNN) and its variants [24], [25], [26], [27]. In particular, Nruyen et al. [28] proposed a handcrafted normalized convolution network (NmzNet) for texture classification. Zhang et al. [29] presented a deep texture encoding network (Deep-TEN) for material recognition. Ayan et al. [30] developed a deep hashing framework for texture retrieval. Despite the success, the deep features still have a series of original issues. On one hand, texture feature distributions require a spatially invariant representation rather than the concatenation of fully connected layer in CNNs [30]. On the other hand, deep models require large amount of annotated training data. However, some texture datasets cannot provide sufficient labeled data. It is worth noting that transfer learning has been used for improving the problems with limited data in texture analysis. Specifically, Asaad et al. [26] applied transfer learning to classify scaled texture patterns. Alessandra et al. [31] used transfer learning and fine-tuning of pretrained models for plankton classification, Song et al. [32] designed a locally transferred fisher vector method for texture classification, which transformed the CNN-based FV descriptors to obtain more discriminative texture representation. Nonetheless, inborn drawbacks, including high computational complex and difficult parameters tuning, make the deep features-based texture representation still controversial. The authors of [33] provided comparative evaluation of handcrafted features vs. CNN-based features. It is interesting to notice that the handcrafted features are still better than CNN-based features for regular texture easily modeled or under little intra-class variation. Therefore, the main research of this paper focuses on the traditional handcrafted binary family features.

Despite the success of LBP variants, there are still significant disadvantages: all these classification systems divide the local neighborhood into different sub-features to represent local texture from different aspects. But these sub-features are usually not completely rotation invariant. In addition, most of sub-features are low discriminative and have high dimensionality, which make the classification system more complex and redundant. Obviously, these LBP-variants improve the classification accuracy and robustness at the price of high computational complexity that, in turn, severely limits their application in many real-world tasks. Despite recent achievements of LBP-variants, there is still a large room for evolving an effective and efficient texture descriptor. In this paper, we present a compact multi-pattern encoding (CMPE) descriptor for visual texture classification, which divides local neighborhood into local sign, local extremums, and center pixel. The contributions of this paper are highlighted as follows:

1. We introduce local extremum patterns, including maximum and minimum patterns, to represent local neighborhood intensity changes. Compared with traditional magnitude components, they have lower dimensionality and keep complete rotation invariance.

2. We develop a compact encoding scheme to jointly encode local maximum, minimum and center patterns into a three-bit binary code, named MMC pattern. No matter how many neighborhood points are sampled, its feature dimensionality always stays the same.

3. We propose a compact multi-pattern encoding descriptor by combining the traditional local sign pattern and MMC pattern to provide a highly complete texture representation.

4. Extensive Experiments on three popular texture databases validate that the compact multi-pattern encoding descriptor achieves the state-of-the-art classification performance.

The rest of the paper is organized as follows. In section 2, we briefly review the traditional LBP. Section 3, we present a compact multi-pattern encoding descriptor. Section 4 discusses the experimental results. Section 5 is the discussion. Finally, we conclude the paper in Section 6.

Section snippets

Local binary pattern

The traditional local binary patten descriptor was introduced by [11]. As one of the most powerful local texture descriptors, it has attracted wide interest of the texture analysis community.

Formally, given a center pixel gc, its corresponding radius R and P neighborhood pixels gp, p=0,...,P1, then the rotation invariant uniform LBP (LBPR,Priu2) is encoded by comparing gc with gp, which can be described as follows:LBPR,Priu2(gc)={p=0P1s(gpgc)2p,ifU(LBPR,P(gc))2P+1,otherwise, where U is the

Local extremum patterns

Previous studies have reported that local intensity change plays a critical role in texture feature representation. For instance, CLBP introduced the magnitude component (CLBP_M) as a complementary information. However, CLBP_M is low-discriminative and high dimensional that makes the classification system redundant and inefficient.

In this section, we propose local extremum patterns, including the local maximum and minimum patterns, to achieve a succinct and efficient neighborhood intensity

Experiments

In this section, a series of experiments are carried out to investigate the classification performance of the CMPE descriptor. For fair comparisons, all texture descriptors use the nearest neighborhood classifier (NNC) for classification tasks. We use five representation texture databases (including UMD [36], Outex [37], UIUC [38], KTH-TIPS [39], and KTH-TIPS2b [39]) to measure the effectiveness of the CMPE descriptor. It is worth to note that we use the “rgb2gray” function in MATLAB for

Discussion

The experimental results on five texture datasets show following third observations:

First, the proposed CMPE descriptor obtains the completely rotation invariance by jointly the MMC pattern and the local sign pattern. Especially for MMC pattern, as the local extremum and center pixel stay the same regardless of texture rotation, it provides the striking performance for image rotation.

Second, the CMPE descriptor achieves a high tradeoff between low-dimensional representation and discrimination

Conclusion

To improve the classification performance and keep low feature dimension, this paper proposes a compact multi-pattern encoding descriptor for visual texture classification. First, the local extremum patterns are developed to describe neighborhood intensity changes. Then, we design a compact encoding scheme to encode local extremums and center patterns into a three-bit binary code, named MMC pattern. Finally, a compact multi-pattern encoding descriptor is presented by combining the traditional

CRediT authorship contribution statement

Xiaochun Xu: Methodology, Software, Validation, Writing – original draft. Yibing Li: Conceptualization, Supervision. Q.M. Jonathan Wu: Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

The paper is funded by the National Key Research and Development Program of China (Grant No. 2016YFF0102806), the National Natural Science Foundation of China (Grant No. 51809056), the Natural Science Foundation of Heilongjiang Province, China (Grant No. F2017004).

Xiaochun Xu received her BE degree in electronic information engineering and her MS degree in information and communication engineering from Harbin Engineering University, Harbin, China, in 2013 and 2016, respectively. She is currently pursuing her PhD at the College of Information and Communication Engineering, Harbin Engineering University. Her research interests include image processing and pattern recognition.

References (61)

  • A. Lumini et al.

    Deep learning and transfer learning features for plankton classification

    Ecol. Inform.

    (2019)
  • K. Wang et al.

    Local binary circumferential and radial derivative pattern for texture classification

    Pattern Recognit.

    (2017)
  • Z. Pan et al.

    Central pixel selection strategy based on local gray-value distribution by using gradient information to enhance LBP for texture classification

    Expert Syst. Appl.

    (2019)
  • X. Xu et al.

    A completed local shrinkage pattern for texture classification

    Appl. Soft Comput.

    (2020)
  • K. Song et al.

    Adjacent evaluation of local binary pattern for texture classification

    J. Vis. Commun. Image Represent.

    (2015)
  • Z. Zhang et al.

    Learning completed discriminative local features for texture classification

    Pattern Recognit.

    (2017)
  • Y. Zhao et al.

    Completed robust local binary pattern for texture classification

    Neurocomputing

    (2013)
  • Z. Pan et al.

    Feature based local binary pattern for rotation invariant texture classification

    Expert Syst. Appl.

    (2017)
  • Y. Zhao et al.

    Local quantization code histogram for texture classification

    Neurocomputing

    (2016)
  • T. Tuncer et al.

    A novel neural network based image descriptor for texture classification

    Physica A

    (2019)
  • A.K. Bhunia et al.

    Signature verification approach using fusion of hybrid texture features

    Neural Comput. Appl.

    (2019)
  • M. Verma et al.

    Script identification in natural scene images: a dataset and texture-feature based performance evaluation

  • X. Tan et al.

    Enhanced local texture feature sets for face recognition under difficult lighting conditions

    IEEE Trans. Image Process.

    (2010)
  • C. Ding et al.

    Multi-directional multi-level dual-cross patterns for robust face recognition

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2014)
  • S. Chakraborty et al.

    Local directional gradient pattern: a local descriptor for face recognition

    Multimed. Tools Appl.

    (2017)
  • A. Fernández et al.

    Texture description through histograms of equivalent patterns

    J. Math. Imaging Vis.

    (2013)
  • T. Ojala et al.

    Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2002)
  • Z. Guo et al.

    A completed modeling of local binary pattern operator for texture classification

    IEEE Trans. Image Process.

    (2010)
  • Y. Zhao et al.

    Completed local binary count for rotation invariant texture classification

    IEEE Trans. Image Process.

    (2012)
  • C. Chen et al.

    Land-use scene classification using multi-scale completed local binary patterns

    Signal Image Video Process.

    (2016)
  • Cited by (13)

    View all citing articles on Scopus

    Xiaochun Xu received her BE degree in electronic information engineering and her MS degree in information and communication engineering from Harbin Engineering University, Harbin, China, in 2013 and 2016, respectively. She is currently pursuing her PhD at the College of Information and Communication Engineering, Harbin Engineering University. Her research interests include image processing and pattern recognition.

    Yibing Li received his BS, MS, and PhD degrees from Harbin Marine Engineering College, Harbin Engineering University in 1989, 1997, and 2003, respectively. He has been a teacher at Harbin Engineering University of China since 1989, and became as a professor in 2004. From 2007 to 2008, he stayed at the University of Hong Kong Electronic Engineering Lab as a visiting scholar. Now, he is a senior member of China Institute of Communications and a senior member of China Computer Federation.

    Q. M. Jonathan Wu received his Ph.D. in electrical engineering from the University of Wales, Swansea, U.K., in 1990. He was affiliated with the National Research Council of Canada for ten years beginning in 1995, where he became a Senior Research Officer and a Group Leader. He is currently a Professor with the Department of Electrical and Computer Engineering, University of Windsor, Windsor, ON, Canada. He has published more than 300 peer-reviewed papers in computer vision, image processing, intelligent systems, robotics, and integrated microsystems. His current research interests include 3-D computer vision, active video object tracking and extraction, interactive multimedia, sensor analysis and fusion, and visual sensor networks. Dr. Wu holds the Tier 1 Canada Research Chair in Automotive Sensors and Information Systems. He is an Associate Editor of the IEEE Transactions on Neural Networks and Learning Systems, the Cognitive Computation, and the International Journal of Robotics and Automation. He was an Associated Editor of the IEEE Transactions on System, Man and Cybernetics, Part A. He has served on technical program committees and international advisory committees for many prestigious conferences.

    View full text