Skip to main content

Advertisement

Log in

Energy transfer features combined with DCT for object detection

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

The basic idea behind the energy transfer features is that the appearance of objects can be described using a function of energy distribution in images. Inside the image, the energy sources are placed and the energy is transferred from the sources during a certain chosen time. The values of energy distribution function have to be reduced into a reasonable number of values. The process of reducing can be simply solved by sampling. The input image is divided into regular cells. The mean value is calculated inside each cell. The values of samples are then considered as a vector that is used as an input for the SVM classifier. We propose an improvement to this process. The discrete cosine transform coefficients are calculated inside the cells (instead of the mean values) to construct the feature vector for the face and pedestrian detectors. To reduce the number of coefficients, we use the patterns in which the coefficients are grouped into regions. In the face detector, the principal component analysis is also used to create the feature vector with a relatively small dimension. The results show that, using this approach, the objects can be efficiently encoded with a relatively short vector with the results that outperform the results of the state-of-the-art detectors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)

    Article  Google Scholar 

  2. Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: IEEE Conference on Computer Vision and Pattern Recognition, IEEE Conference on Computer Vision and Pattern Recognition. Ieee, New York. CVPR 2012 Open Source Award Winner (2012)

  3. Bai, Y., Guo, L., Jin, L., Huang, Q.: A novel feature extraction method using pyramid histogram of orientation gradients for smile recognition. In: Proceedings of the 16th IEEE International Conference on Image processing. ICIP’09, pp. 3269–3272. IEEE Press, Piscataway, NJ, USA (2009)

  4. Bay, H., Tuytelaars, T., Gool, L.J.V.: Surf: speeded up robust features. ECCV 1, 404–417 (2006)

    Google Scholar 

  5. Berg, T.L., Berg, A.C., Edwards, J., Forsyth, D.: Who’s in the picture. Adv. Neural Inf. Process. Syst. 17, 137–144 (2005)

    Google Scholar 

  6. Bosch, A., Zisserman, A., Munoz, X.: Representing shape with a spatial pyramid kernel. In: Proceedings of the 6th ACM International Conference on Image and Video Retrieval. CIVR ’07, pp. 401–408. ACM, New York, NY, USA (2007)

  7. Calonder, M., Lepetit, V., Strecha, C., Fua, P.: Brief: binary robust independent elementary features. In: Proceedings of the 11th European Conference on Computer Vision: Part IV. ECCV’10, pp. 778–792. Springer, Berlin, Heidelberg (2010)

  8. Center for Biological and Computational Learning: MIT CBCL pedestrian database #1 (2013). http://cbcl.mit.edu/software-datasets/PedestrianData.html

  9. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition CVPR 2005, vol. 1, pp. 886–893 (2005)

  10. Dalal, N., Triggs, B., Schmid, C.: Human detection using oriented histograms of flow and appearance. In: Proceedings of the 9th European Conference on Computer Vision-Volume Part II. ECCV’06, pp. 428–441. Springer, Berlin, Heidelberg (2006)

  11. Enzweiler, M., Gavrila, D.: Monocular pedestrian detection: survey and experiments. IEEE Trans. Pattern Anal. Mach. Intell. 31(12), 2179–2195 (2009)

    Article  Google Scholar 

  12. Felzenszwalb, P.F., McAllester, D.A., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: CVPR (2008)

  13. Fusek, R., Sojka, E., Mozdren, K., Surkala, M.: Energy-transfer features and their application in the task of face detection. In: IEEE 10th International Conference on Advanced Video and Signal Based Surveillance (AVSS) 2013, pp. 147–152 (2013)

  14. Hadid, A., Pietikainen, M., Ahonen, T.: A discriminative feature space for detecting and recognizing faces. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition CVPR 2004, vol. 2, pp. II-797-II-804 (2004)

  15. Huang, D., Shan, C., Ardabilian, M., Wang, Y., Chen, L.: Local binary patterns and its application to facial image analysis: a survey. IEEE Trans. Syst. Man Cybern. C App. Rev. 41(6), 765–781 (2011)

    Article  Google Scholar 

  16. Lee, K., Ho, J., Kriegman, D.: Acquiring linear subspaces for face recognition under variable lighting. IEEE Trans. Pattern Anal. Mach. Intell. 27(5), 684–698 (2005)

    Article  Google Scholar 

  17. Leutenegger, S., Chli, M., Siegwart, R.: Brisk: Binary robust invariant scalable keypoints. In: IEEE International Conference on Computer Vision (ICCV) 2011 , pp. 2548–2555 (2011)

  18. Liao, S., Zhu, X., Lei, Z., Zhang, L., Li, S.Z.: Learning multi-scale block local binary patterns for face recognition. In: ICB, pp. 828–837 (2007)

  19. Lienhart, R., Maydt, J.: An extended set of haar-like features for rapid object detection. In: Proceedings of the International Conference on Image Processing 2002, vol. 1, pp. I-900-I-903 (2002)

  20. Lowe, D.: Object recognition from local scale-invariant features. In: IEEE International Conference on the Proceedings of the Seventh Computer Vision 1999, vol. 2, pp. 1150–1157 (1999)

  21. Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recogn. 29(1), 51–59 (1996)

    Article  Google Scholar 

  22. Pang, Y., Yuan, Y., Li, X., Pan, J.: Efficient HOG human detection. Signal Process. 91(4), 773–781 (2011)

    Article  MATH  Google Scholar 

  23. Park, K.Y., Hwang, S.Y.: An improved haar-like feature for efficient object detection. Pattern Recogn. Lett. 42, 148–153 (2014)

    Article  Google Scholar 

  24. Perona, P., Malik, J.: Scale-space and edge detection using anisotropic diffusion. IEEE Trans. Pattern Anal. Mach. Intell. 12, 629–639 (1990)

  25. Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: Orb: An efficient alternative to sift or surf. In: IEEE International Conference on Computer Vision (ICCV) 2011, pp. 2564–2571 (2011)

  26. Shan, C., Gong, S., McOwan, P.W.: Facial expression recognition based on local binary patterns: a comprehensive study. Image Vision Comput. 27(6), 803–816 (2009)

    Article  Google Scholar 

  27. Tan, X., Triggs, B.: Enhanced local texture feature sets for face recognition under difficult lighting conditions. IEEE Trans. Image Process. 19(6), 1635–1650 (2010)

    Article  MathSciNet  Google Scholar 

  28. Tsai, T., Huang, Y.P., Chiang, T.W.: Image retrieval based on dominant texture features. In: IEEE International Symposium on Industrial Electronics 2006, vol. 1, pp. 441–446 (2006)

  29. Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: IEEE Computer Society Conference on Proceedings of the 2001 Computer Vision and Pattern Recognition CVPR 2001, vol. 1, pp. I-511 - I-518 (2001)

  30. Viola, P., Jones, M., Snow, D.: Detecting pedestrians using patterns of motion and appearance. In: Proceedings Ninth IEEE International Conference on Computer Vision 2003, pp. 734–741 vol. 2 (2003)

  31. Zhang, L., Chu, R., Xiang, S., Liao, S., Li, S.Z.: Face detection based on multi-block lbp representation. In: Proceedings of the 2007 International Conference on Advances in Biometrics. ICB’07, pp. 11–18. Springe, Berlin, Heidelberg (2007)

  32. Zheng, W., Liang, L.: Fast car detection using image strip features. In: IEEE Conference on Computer Vision and Pattern Recognition 2009. CVPR 2009, pp. 2703–2710 (2009)

Download references

Acknowledgments

This work was supported by SGS in VSB Technical University of Ostrava, Czech Republic, under the grant No. SP2015/141.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Radovan Fusek.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fusek, R., Sojka, E. Energy transfer features combined with DCT for object detection. SIViP 10, 479–486 (2016). https://doi.org/10.1007/s11760-015-0777-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-015-0777-1

Keywords

Navigation