Paper
20 March 2015 Trade-off between speed and performance for colorectal endoscopic NBI image classification
Shoji Sonoyama, Toru Tamaki, Tsubasa Hirakawa, Bisser Raytchev, Kazufumi Kaneda, Tetsushi Koide, Yoko Kominami, Shigeto Yoshida, Shinji Tanaka
Author Affiliations +
Abstract
This paper investigates a trade-off between computation time and recognition rate of local descriptor-based recognition for colorectal endoscopic NBI image classification. Recent recognition methods using descriptors have been successfully applied to medical image classification. The accuracy of these methods might depend on the quality of vector quantization (VQ) and encoding of descriptors, however an accurate quantization takes a long time. This paper reports how a simple sampling strategy affects performances with different encoding methods. First, we extract about 7.7 million local descriptors from training images of a dataset of 908 NBI endoscopic images. Second, we randomly choose a subset of between 7.7M and 19K descriptors for VQ. Third, we use three encoding methods (BoVW, VLAD, and Fisher vector) with different number of descriptors. Linear SVM is used for classification of a three-class problem. The computation time for VQ was drastically reduced by the factor of 100, while the peak performance was retained. The performance improved roughly 1% to 2% when more descriptors by over-sampling were used for encoding. Performances with descriptors extracted every pixel ("grid1") or every two pixels ("grid2") are similar, while the computation time is very different; grid2 is 5 to 30 times faster than grid1. The main finding of this work is twofold. First, recent encoding methods such as VLAD and Fisher vector are as insensitive to the quality of VQ as BoVW. Second, there is a trade-off between computation time and performance in encoding over-sampled descriptors with BoVW and Fisher vector, but not with VLAD.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Shoji Sonoyama, Toru Tamaki, Tsubasa Hirakawa, Bisser Raytchev, Kazufumi Kaneda, Tetsushi Koide, Yoko Kominami, Shigeto Yoshida, and Shinji Tanaka "Trade-off between speed and performance for colorectal endoscopic NBI image classification", Proc. SPIE 9413, Medical Imaging 2015: Image Processing, 94132D (20 March 2015); https://doi.org/10.1117/12.2081928
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Computer programming

Endoscopy

Quantization

Visualization

Medical imaging

Image quality

Image classification

Back to Top