Abstract
The basic idea behind the energy transfer features is that the appearance of objects can be described using a function of energy distribution in images. Inside the image, the energy sources are placed and the energy is transferred from the sources during a certain chosen time. The values of energy distribution function have to be reduced into a reasonable number of values. The process of reducing can be simply solved by sampling. The input image is divided into regular cells. The mean value is calculated inside each cell. The values of samples are then considered as a vector that is used as an input for the SVM classifier. We propose an improvement to this process. The discrete cosine transform coefficients are calculated inside the cells (instead of the mean values) to construct the feature vector for the face and pedestrian detectors. To reduce the number of coefficients, we use the patterns in which the coefficients are grouped into regions. In the face detector, the principal component analysis is also used to create the feature vector with a relatively small dimension. The results show that, using this approach, the objects can be efficiently encoded with a relatively short vector with the results that outperform the results of the state-of-the-art detectors.






Similar content being viewed by others
References
Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)
Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: IEEE Conference on Computer Vision and Pattern Recognition, IEEE Conference on Computer Vision and Pattern Recognition. Ieee, New York. CVPR 2012 Open Source Award Winner (2012)
Bai, Y., Guo, L., Jin, L., Huang, Q.: A novel feature extraction method using pyramid histogram of orientation gradients for smile recognition. In: Proceedings of the 16th IEEE International Conference on Image processing. ICIP’09, pp. 3269–3272. IEEE Press, Piscataway, NJ, USA (2009)
Bay, H., Tuytelaars, T., Gool, L.J.V.: Surf: speeded up robust features. ECCV 1, 404–417 (2006)
Berg, T.L., Berg, A.C., Edwards, J., Forsyth, D.: Who’s in the picture. Adv. Neural Inf. Process. Syst. 17, 137–144 (2005)
Bosch, A., Zisserman, A., Munoz, X.: Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM International Conference on Image and Video Retrieval. CIVR ’07, pp. 401–408. ACM, New York, NY, USA (2007)
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: Brief: binary robust independent elementary features. In: Proceedings of the 11th European Conference on Computer Vision: Part IV. ECCV’10, pp. 778–792. Springer, Berlin, Heidelberg (2010)
Center for Biological and Computational Learning: MIT CBCL pedestrian database #1 (2013). http://cbcl.mit.edu/software-datasets/PedestrianData.html
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition CVPR 2005, vol. 1, pp. 886–893 (2005)
Dalal, N., Triggs, B., Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Proceedings of the 9th European Conference on Computer Vision-Volume Part II. ECCV’06, pp. 428–441. Springer, Berlin, Heidelberg (2006)
Enzweiler, M., Gavrila, D.: Monocular pedestrian detection: survey and experiments. IEEE Trans. Pattern Anal. Mach. Intell. 31(12), 2179–2195 (2009)
Felzenszwalb, P.F., McAllester, D.A., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR (2008)
Fusek, R., Sojka, E., Mozdren, K., Surkala, M.: Energy-transfer features and their application in the task of face detection. In: IEEE 10th International Conference on Advanced Video and Signal Based Surveillance (AVSS) 2013, pp. 147–152 (2013)
Hadid, A., Pietikainen, M., Ahonen, T.: A discriminative feature space for detecting and recognizing faces. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition CVPR 2004, vol. 2, pp. II-797-II-804 (2004)
Huang, D., Shan, C., Ardabilian, M., Wang, Y., Chen, L.: Local binary patterns and its application to facial image analysis: a survey. IEEE Trans. Syst. Man Cybern. C App. Rev. 41(6), 765–781 (2011)
Lee, K., Ho, J., Kriegman, D.: Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Anal. Mach. Intell. 27(5), 684–698 (2005)
Leutenegger, S., Chli, M., Siegwart, R.: Brisk: Binary robust invariant scalable keypoints. In: IEEE International Conference on Computer Vision (ICCV) 2011 , pp. 2548–2555 (2011)
Liao, S., Zhu, X., Lei, Z., Zhang, L., Li, S.Z.: Learning multi-scale block local binary patterns for face recognition. In: ICB, pp. 828–837 (2007)
Lienhart, R., Maydt, J.: An extended set of haar-like features for rapid object detection. In: Proceedings of the International Conference on Image Processing 2002, vol. 1, pp. I-900-I-903 (2002)
Lowe, D.: Object recognition from local scale-invariant features. In: IEEE International Conference on the Proceedings of the Seventh Computer Vision 1999, vol. 2, pp. 1150–1157 (1999)
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recogn. 29(1), 51–59 (1996)
Pang, Y., Yuan, Y., Li, X., Pan, J.: Efficient HOG human detection. Signal Process. 91(4), 773–781 (2011)
Park, K.Y., Hwang, S.Y.: An improved haar-like feature for efficient object detection. Pattern Recogn. Lett. 42, 148–153 (2014)
Perona, P., Malik, J.: Scale-space and edge detection using anisotropic diffusion. IEEE Trans. Pattern Anal. Mach. Intell. 12, 629–639 (1990)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: Orb: An efficient alternative to sift or surf. In: IEEE International Conference on Computer Vision (ICCV) 2011, pp. 2564–2571 (2011)
Shan, C., Gong, S., McOwan, P.W.: Facial expression recognition based on local binary patterns: a comprehensive study. Image Vision Comput. 27(6), 803–816 (2009)
Tan, X., Triggs, B.: Enhanced local texture feature sets for face recognition under difficult lighting conditions. IEEE Trans. Image Process. 19(6), 1635–1650 (2010)
Tsai, T., Huang, Y.P., Chiang, T.W.: Image retrieval based on dominant texture features. In: IEEE International Symposium on Industrial Electronics 2006, vol. 1, pp. 441–446 (2006)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: IEEE Computer Society Conference on Proceedings of the 2001 Computer Vision and Pattern Recognition CVPR 2001, vol. 1, pp. I-511 - I-518 (2001)
Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: Proceedings Ninth IEEE International Conference on Computer Vision 2003, pp. 734–741 vol. 2 (2003)
Zhang, L., Chu, R., Xiang, S., Liao, S., Li, S.Z.: Face detection based on multi-block lbp representation. In: Proceedings of the 2007 International Conference on Advances in Biometrics. ICB’07, pp. 11–18. Springe, Berlin, Heidelberg (2007)
Zheng, W., Liang, L.: Fast car detection using image strip features. In: IEEE Conference on Computer Vision and Pattern Recognition 2009. CVPR 2009, pp. 2703–2710 (2009)
Acknowledgments
This work was supported by SGS in VSB Technical University of Ostrava, Czech Republic, under the grant No. SP2015/141.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fusek, R., Sojka, E. Energy transfer features combined with DCT for object detection. SIViP 10, 479–486 (2016). https://doi.org/10.1007/s11760-015-0777-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-015-0777-1