short-paper

Learned vs. Hand-Crafted Features for Pedestrian Gender Recognition

Authors:

Grigory Antipov,

Sid-Ahmed Berrani,

Natacha Ruchaud,

Jean-Luc DugelayAuthors Info & Claims

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Pages 1263 - 1266

https://doi.org/10.1145/2733373.2806332

Published: 13 October 2015 Publication History

Abstract

This paper addresses the problem of image features selection for pedestrian gender recognition. Hand-crafted features (such as HOG) are compared with learned features which are obtained by training convolutional neural networks. The comparison is performed on the recently created collection of versatile pedestrian datasets which allows us to evaluate the impact of dataset properties on the performance of features. The study shows that hand-crafted and learned features perform equally well on small-sized homogeneous datasets. However, learned features significantly outperform hand-crafted ones in the case of heterogeneous and unfamiliar (unseen) datasets. Our best model which is based on learned features obtains 79% average recognition rate on completely unseen datasets. We also show that a relatively small convolutional neural network is able to produce competitive features even with little training data.

References

[1]

L. Cao, M. Dikmen, Y. Fu, and T. S. Huang. Gender recognition from body. In ACM MM, Canada, 2008.

Digital Library

[2]

K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman. Return of the devil in the details: Delving deep into convolutional nets. CoRR, abs/1405.3531, 2014.

[3]

M. Collins, J. Zhang, P. Miller, and H. Wang. Full body image feature representations for gender profiling. In ICCV, Japan, 2009.

[4]

N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, USA, 2005.

Digital Library

[5]

Y. Deng, P. Luo, C. C. Loy, and X. Tang. Pedestrian attribute recognition at far distance. In ACM MM, USA, 2014.

Digital Library

[6]

I. J. Goodfellow, Y. Bulatov, J. Ibarz, S. Arnoud, and V. Shet. Multi-digit number recognition from street view imagery using deep convolutional neural networks. CoRR, abs/1312.6082, 2013.

[7]

J. A. Hanley and B. J. McNeil. The meaning and use of the area under a receiver operating characteristic (roc) curve. Radiology, 1982.

[8]

G. E. Hinton, O. Vinyals, and J. Dean. Distilling the knowledge in a neural network. In NIPS Deep Learning Workshop, Canada, 2014.

[9]

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. CoRR, abs/1408.5093, 2014.

Digital Library

[10]

T. Joachims. Making large scale SVM learning practical. 1999.

[11]

A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, USA, 2012.

[12]

R. Layne, T. M. Hospedales, S. Gong, et al. Person re-identification by attributes. In BMVC, UK, 2012.

[13]

Y. LeCun and Y. Bengio. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 1995.

Digital Library

[14]

D. G. Lowe. Object recognition from local scale-invariant features. In ICCV, Canada, 1999.

Digital Library

[15]

C.-B. Ng, Y.-H. Tay, and B.-M. Goi. A convolutional neural network for pedestrian gender recognition. In ISNN. Springer, 2013.

Digital Library

[16]

T. Ojala, M. Pietik\"ainen, and D. Harwood. A comparative study of texture measures with classification based on featured distributions. Pattern recognition, 1996.

[17]

A. S. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson. Cnn features off-the-shelf: an astounding baseline for recognition. CoRR, abs/1403.6382, 2014.

[18]

O. Russakovsky, J. Deng, H. Su, J. Krause, and S. S. et al. ImageNet Large Scale Visual Recognition Challenge. CoRR, abs/1409.0575, 2014.

[19]

Y. Taigman, M. Yang, M. Ranzato, and L. Wolf. Deepface: Closing the gap to human-level performance in face verification. In CVPR, USA, 2014.

Digital Library

Cited By

Da Silva APereira L(2024)Evaluating Methods for Violence Classification and Firearm Detection in Indoor CCTV EnvironmentJournal of the Brazilian Computer Society10.5753/jbcs.2024.328230:1(411-420)Online publication date: 5-Oct-2024
https://doi.org/10.5753/jbcs.2024.3282
Lee DJeong MJeong SJung SPark K(2024)Estimation of Fractal Dimension and Segmentation of Body Regions for Deep Learning-Based Gender RecognitionFractal and Fractional10.3390/fractalfract81005518:10(551)Online publication date: 24-Sep-2024
https://doi.org/10.3390/fractalfract8100551
Naralasetti VBodapati J(2024)Enhancing Plant Leaf Disease Prediction Through Advanced Deep Feature Representations: A Transfer Learning ApproachJournal of The Institution of Engineers (India): Series B10.1007/s40031-023-00966-0105:3(469-482)Online publication date: 10-Feb-2024
https://doi.org/10.1007/s40031-023-00966-0
Show More Cited By

Index Terms

Learned vs. Hand-Crafted Features for Pedestrian Gender Recognition
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations

Recommendations

Classification of radiolarian images with hand-crafted and deep features

Radiolarians are planktonic protozoa and are important biostratigraphic and paleoenvironmental indicators for paleogeographic reconstructions. Radiolarian paleontology still remains as a low cost and the one of the most convenient way to obtain dating ...
Hand-Crafted Features or Machine Learnt Features? Together They Improve RGB-D Object Recognition
ISM '14: Proceedings of the 2014 IEEE International Symposium on Multimedia

RGB-D object recognition is an important research topic in computer version, and seeking a robust image representation is the most important sub problem for RGB-D object recognition. On the one hand, the recently emerging deep learning methods, which ...
Crack recognition on concrete structures based on machine crafted and hand crafted features
Abstract
Concrete structures play a critical role in infrastructure development, and their safety is of utmost importance. Cracks in concrete structures can be a sign of deterioration, which may affect the overall safety of the building. Regular ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '15: Proceedings of the 23rd ACM international conference on Multimedia

October 2015

1402 pages

ISBN:9781450334594

DOI:10.1145/2733373

General Chairs:
Xiaofang Zhou
The University of Queensland, Australia
,
Alan F. Smeaton
Dublin City University, Ireland
,
Qi Tian
The University of Texas at San Antonio, USA
,
Program Chairs:
Dick C.A. Bulterman
FXPAL, USA
,
Heng Tao Shen
The University of Queensland, Australia
,
Ketan Mayer-Patel
The University of North Carolina, USA
,
Shuicheng Yan
National University of Singapore, Singapore

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 October 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM '15

Sponsor:

SIGMM

MM '15: ACM Multimedia Conference

October 26 - 30, 2015

Brisbane, Australia

Acceptance Rates

MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

74
Total Citations
View Citations
709
Total Downloads

Downloads (Last 12 months)22
Downloads (Last 6 weeks)2

Reflects downloads up to 13 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Da Silva APereira L(2024)Evaluating Methods for Violence Classification and Firearm Detection in Indoor CCTV EnvironmentJournal of the Brazilian Computer Society10.5753/jbcs.2024.328230:1(411-420)Online publication date: 5-Oct-2024
https://doi.org/10.5753/jbcs.2024.3282
Lee DJeong MJeong SJung SPark K(2024)Estimation of Fractal Dimension and Segmentation of Body Regions for Deep Learning-Based Gender RecognitionFractal and Fractional10.3390/fractalfract81005518:10(551)Online publication date: 24-Sep-2024
https://doi.org/10.3390/fractalfract8100551
Naralasetti VBodapati J(2024)Enhancing Plant Leaf Disease Prediction Through Advanced Deep Feature Representations: A Transfer Learning ApproachJournal of The Institution of Engineers (India): Series B10.1007/s40031-023-00966-0105:3(469-482)Online publication date: 10-Feb-2024
https://doi.org/10.1007/s40031-023-00966-0
Bhola GVishwakarma D(2024)A review of vision-based indoor HAR: state-of-the-art, challenges, and future prospectsMultimedia Tools and Applications10.1007/s11042-023-15443-583:1(1965-2005)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1007/s11042-023-15443-5
Dogan YOzdemir CKaya Y(2024)Enhancing CNN model classification performance through RGB angle rotation methodNeural Computing and Applications10.1007/s00521-024-10232-z36:32(20259-20276)Online publication date: 1-Nov-2024
https://dl.acm.org/doi/10.1007/s00521-024-10232-z
Liu MZhang YLi H(2023)Survey of Cross-Modal Person Re-Identification from a Mathematical PerspectiveMathematics10.3390/math1103065411:3(654)Online publication date: 28-Jan-2023
https://doi.org/10.3390/math11030654
Srilatha MSrinivasu N(2023)Person Identification from Video Analytic System Using Edge Computing -A Review2023 1st International Conference on Cognitive Computing and Engineering Education (ICCCEE)10.1109/ICCCEE55951.2023.10424635(1-7)Online publication date: 27-Apr-2023
https://doi.org/10.1109/ICCCEE55951.2023.10424635
Inoue MNishiyama MIwai Y(2023)Age group identification using gaze-guided feature extraction2023 IEEE 12th Global Conference on Consumer Electronics (GCCE)10.1109/GCCE59613.2023.10315305(708-711)Online publication date: 10-Oct-2023
https://doi.org/10.1109/GCCE59613.2023.10315305
Vu MBeurton-Aimar M(2023)Learning to focus on region-of-interests for pain intensity estimation2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG)10.1109/FG57933.2023.10042583(1-6)Online publication date: 5-Jan-2023
https://dl.acm.org/doi/10.1109/FG57933.2023.10042583
Abbas FYasmin MFayyaz MAsim U(2023)ViT-PGC: vision transformer for pedestrian gender classification on small-size datasetPattern Analysis and Applications10.1007/s10044-023-01196-226:4(1805-1819)Online publication date: 26-Sep-2023
https://doi.org/10.1007/s10044-023-01196-2
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten