IMDB-Attire: A Novel Dataset for Attire Detection and Localization

Yousuf, Saad Bin; Sajid, Hasan; Poon, Simon; Khushi, Matloob

doi:10.1007/978-3-030-36711-4_46

Saad Bin Yousuf¹¹,
Hasan Sajid¹²,
Simon Poon¹¹ &
…
Matloob Khushi¹¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11954))

Included in the following conference series:

International Conference on Neural Information Processing

1816 Accesses
1 Citations

Abstract

People’s attire, or their way of dressing defines not only their social status and personality but also affects the way people meet and greet them. Attire detection has many useful applications such as clothing preferences in diverse regions of the world could be monitored and quantified. This information is very valuable for fashion designers. Real-time clothing recognition can be useful for security surveillance, where information about an individual’s clothes can be used to identify crime suspects. Recently, deep learning algorithms have shown promise in the field of object detection and recognition. These algorithms are data hungry and are only as good as the data they are trained on. In this work, we have focused on three tasks to address this problem. We created a unique dataset of ~8000 images from IMDBb.com (movie rating website) to address the challenge of real-world application of the algorithm training for attire detection. The dataset contains pictures from movies, making the dataset a good source of images from the wild. We manually labelled 60 different classes of attire. Then we focused on multiclass classification and attire object detection using customized deep learning architectures including YOLO and SSD. We achieved a mean Average Precision (mAP) of 64.14% and an Average Precision (AP) of 91.14% for top 5 classes on YOLO. Available at https://github.com/saadyousuf45/.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Khushi, M., et al.: Automated classification and characterization of the mitotic spindle following knockdown of a mitosis-related protein. BMC Bioinform. 18(16), 566 (2017)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)
Article Google Scholar
Chen, K.-T., Luo, J.: When fashion meets big data: discriminative mining of best selling clothing features. In: Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee (2017)
Google Scholar
Lao, B., Jagadeesh, K.: Convolutional neural networks for fashion classification and object detection. In: CCCV 2015: Computer Vision, pp. 120–129 (2016)
Google Scholar
Divvala, S.K., Farhadi, A., Guestrin, C.: Learning everything about anything: webly-supervised visual concept learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014)
Google Scholar
Rohrbach, M., et al.: What helps where–and why? Semantic relatedness for knowledge transfer. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE (2010)
Google Scholar
Chen, H., et al.: Composite templates for cloth modeling and sketching. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006). IEEE (2006)
Google Scholar
Ng, H.N., Grimsdale, R.L.: Computer graphics techniques for modeling cloth. IEEE Comput. Graph. Appl. 16(5), 28–41 (1996)
Article Google Scholar
Yamaguchi, K., Hadi Kiapour, M., Berg, T.L.: Paper doll parsing: Retrieving similar styles to parse clothing items. In: Proceedings of the IEEE International Conference on Computer Vision (2013)
Google Scholar
Hasan, B., Hogg, D.C.: Segmentation using deformable spatial priors with application to clothing. In: BMVC (2010)
Google Scholar
Wang, N., Ai, H.: Who blocks who: simultaneous clothing segmentation for grouping images. In: 2011 International Conference on Computer Vision. IEEE (2011)
Google Scholar
Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: 2011 18th IEEE International Conference on Image Processing. IEEE (2011)
Google Scholar
Packer, C., McAuley, J., Ramisa, A.: Visually-aware personalized recommendation using interpretable image representations. arXiv preprint arXiv:1806.09820 (2018)
Zhang, X., et al.: Trip outfits advisor: location-oriented clothing recommendation. IEEE Trans. Multimedia 19(11), 2533–2544 (2017)
Article Google Scholar
Chen, Q., et al.: Deep domain adaptation for describing people based on fine-grained clothing attributes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)
Google Scholar
Chen, H., Gallagher, A., Girod, B.: Describing clothing by semantic attributes. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 609–623. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33712-3_44
Chapter Google Scholar
Bossard, L., Dantone, M., Leistner, C., Wengert, C., Quack, T., Van Gool, L.: Apparel classification with style. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7727, pp. 321–335. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37447-0_25
Chapter Google Scholar
He, Y., Yang, L., Chen, L.: Real-time fashion-guided clothing semantic parsing: a lightweight multi-scale inception neural network and benchmark. In: Workshops at the Thirty-First AAAI Conference on Artificial Intelligence (2017)
Google Scholar
Liu, K.-H., Chen, T.-Y., Chen, C.-S.: MVC: a dataset for view-invariant clothing retrieval and attribute prediction. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval. ACM (2016)
Google Scholar
Chao, X., et al.: A framework for robust feature selection for real-time fashion style recommendation. In: Proceedings of the 1st International Workshop on Interactive Multimedia for Consumer Electronics. ACM (2009)
Google Scholar
Chang, C.-C., Wang, L.-L.: Color texture segmentation for clothing in a computer-aided fashion design system. Image Vis. Comput. 14(9), 685–702 (1996)
Article Google Scholar
Zhang, W., et al.: An intelligent fitting room using multi-camera perception. In: Proceedings of the 13th International Conference on Intelligent User Interfaces. ACM (2008)
Google Scholar
Freixenet, J., Muñoz, X., Raba, D., Martí, J., Cufí, X.: Yet another survey on image segmentation: region and boundary information integration. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 408–422. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-47977-5_27
Chapter Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection (2005)
Google Scholar
Yang, J., et al.: Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval. ACM (2007)
Google Scholar
Adams, R., Bischof, L.: Seeded region growing. IEEE Trans. Pattern Anal. Mach. Intell. 16(6), 641–647 (1994)
Article Google Scholar
Liu, Z., et al.: DeepFashion: Powering robust clothes recognition and retrieval with rich annotations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Google Scholar
Hadi Kiapour, M., et al.: Where to buy it: matching street clothing photos in online shops. In: Proceedings of the IEEE International Conference on Computer Vision (2015)
Google Scholar
Huang, J., et al.: Cross-domain image retrieval with a dual attribute-aware ranking network. In: Proceedings of the IEEE International Conference on Computer Vision (2015)
Google Scholar
Ge, Y., et al., DeepFashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. arXiv preprint arXiv:1901.07973 (2019)
Liu, S., et al.: Fashion parsing with weak color-category labels. IEEE Trans. Multimedia 16(1), 253–265 (2014)
Article Google Scholar
Everingham, M., et al.: The Pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)
Article Google Scholar
Bray, T., et al.: Extensible markup language (XML) 1.0. W3C recommendation, October 2000
Google Scholar
Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2009)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Redmon, J., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Google Scholar
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)
Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (2015)
Google Scholar
Chollet, F.: Keras (2015)
Google Scholar
Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2016) (2016)
Google Scholar
Khushi, M.: Benchmarking database performance for genomic data. J. Cell. Biochem. 116(6), 877–883 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, The University of Sydney, Sydney, NSW, 2006, Australia
Saad Bin Yousuf, Simon Poon & Matloob Khushi
Department of Robotics and Intelligent Machine Engineering, National University of Sciences and Technology, Islamabad, Pakistan
Hasan Sajid

Authors

Saad Bin Yousuf
View author publications
You can also search for this author in PubMed Google Scholar
Hasan Sajid
View author publications
You can also search for this author in PubMed Google Scholar
Simon Poon
View author publications
You can also search for this author in PubMed Google Scholar
Matloob Khushi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Saad Bin Yousuf .

Editor information

Editors and Affiliations

Australian National University, Canberra, ACT, Australia
Tom Gedeon
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yousuf, S.B., Sajid, H., Poon, S., Khushi, M. (2019). IMDB-Attire: A Novel Dataset for Attire Detection and Localization. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11954. Springer, Cham. https://doi.org/10.1007/978-3-030-36711-4_46

Download citation

DOI: https://doi.org/10.1007/978-3-030-36711-4_46
Published: 09 December 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36710-7
Online ISBN: 978-3-030-36711-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics