Skip to main content

IMDB-Attire: A Novel Dataset for Attire Detection and Localization

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2019)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11954))

Included in the following conference series:

Abstract

People’s attire, or their way of dressing defines not only their social status and personality but also affects the way people meet and greet them. Attire detection has many useful applications such as clothing preferences in diverse regions of the world could be monitored and quantified. This information is very valuable for fashion designers. Real-time clothing recognition can be useful for security surveillance, where information about an individual’s clothes can be used to identify crime suspects. Recently, deep learning algorithms have shown promise in the field of object detection and recognition. These algorithms are data hungry and are only as good as the data they are trained on. In this work, we have focused on three tasks to address this problem. We created a unique dataset of ~8000 images from IMDBb.com (movie rating website) to address the challenge of real-world application of the algorithm training for attire detection. The dataset contains pictures from movies, making the dataset a good source of images from the wild. We manually labelled 60 different classes of attire. Then we focused on multiclass classification and attire object detection using customized deep learning architectures including YOLO and SSD. We achieved a mean Average Precision (mAP) of 64.14% and an Average Precision (AP) of 91.14% for top 5 classes on YOLO. Available at https://github.com/saadyousuf45/.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Khushi, M., et al.: Automated classification and characterization of the mitotic spindle following knockdown of a mitosis-related protein. BMC Bioinform. 18(16), 566 (2017)

    Article  Google Scholar 

  2. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)

    Google Scholar 

  3. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)

    Article  Google Scholar 

  4. Chen, K.-T., Luo, J.: When fashion meets big data: discriminative mining of best selling clothing features. In: Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee (2017)

    Google Scholar 

  5. Lao, B., Jagadeesh, K.: Convolutional neural networks for fashion classification and object detection. In: CCCV 2015: Computer Vision, pp. 120–129 (2016)

    Google Scholar 

  6. Divvala, S.K., Farhadi, A., Guestrin, C.: Learning everything about anything: webly-supervised visual concept learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014)

    Google Scholar 

  7. Rohrbach, M., et al.: What helps where–and why? Semantic relatedness for knowledge transfer. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE (2010)

    Google Scholar 

  8. Chen, H., et al.: Composite templates for cloth modeling and sketching. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006). IEEE (2006)

    Google Scholar 

  9. Ng, H.N., Grimsdale, R.L.: Computer graphics techniques for modeling cloth. IEEE Comput. Graph. Appl. 16(5), 28–41 (1996)

    Article  Google Scholar 

  10. Yamaguchi, K., Hadi Kiapour, M., Berg, T.L.: Paper doll parsing: Retrieving similar styles to parse clothing items. In: Proceedings of the IEEE International Conference on Computer Vision (2013)

    Google Scholar 

  11. Hasan, B., Hogg, D.C.: Segmentation using deformable spatial priors with application to clothing. In: BMVC (2010)

    Google Scholar 

  12. Wang, N., Ai, H.: Who blocks who: simultaneous clothing segmentation for grouping images. In: 2011 International Conference on Computer Vision. IEEE (2011)

    Google Scholar 

  13. Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: 2011 18th IEEE International Conference on Image Processing. IEEE (2011)

    Google Scholar 

  14. Packer, C., McAuley, J., Ramisa, A.: Visually-aware personalized recommendation using interpretable image representations. arXiv preprint arXiv:1806.09820 (2018)

  15. Zhang, X., et al.: Trip outfits advisor: location-oriented clothing recommendation. IEEE Trans. Multimedia 19(11), 2533–2544 (2017)

    Article  Google Scholar 

  16. Chen, Q., et al.: Deep domain adaptation for describing people based on fine-grained clothing attributes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)

    Google Scholar 

  17. Chen, H., Gallagher, A., Girod, B.: Describing clothing by semantic attributes. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 609–623. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33712-3_44

    Chapter  Google Scholar 

  18. Bossard, L., Dantone, M., Leistner, C., Wengert, C., Quack, T., Van Gool, L.: Apparel classification with style. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7727, pp. 321–335. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37447-0_25

    Chapter  Google Scholar 

  19. He, Y., Yang, L., Chen, L.: Real-time fashion-guided clothing semantic parsing: a lightweight multi-scale inception neural network and benchmark. In: Workshops at the Thirty-First AAAI Conference on Artificial Intelligence (2017)

    Google Scholar 

  20. Liu, K.-H., Chen, T.-Y., Chen, C.-S.: MVC: a dataset for view-invariant clothing retrieval and attribute prediction. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval. ACM (2016)

    Google Scholar 

  21. Chao, X., et al.: A framework for robust feature selection for real-time fashion style recommendation. In: Proceedings of the 1st International Workshop on Interactive Multimedia for Consumer Electronics. ACM (2009)

    Google Scholar 

  22. Chang, C.-C., Wang, L.-L.: Color texture segmentation for clothing in a computer-aided fashion design system. Image Vis. Comput. 14(9), 685–702 (1996)

    Article  Google Scholar 

  23. Zhang, W., et al.: An intelligent fitting room using multi-camera perception. In: Proceedings of the 13th International Conference on Intelligent User Interfaces. ACM (2008)

    Google Scholar 

  24. Freixenet, J., Muñoz, X., Raba, D., Martí, J., Cufí, X.: Yet another survey on image segmentation: region and boundary information integration. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 408–422. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-47977-5_27

    Chapter  Google Scholar 

  25. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection (2005)

    Google Scholar 

  26. Yang, J., et al.: Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval. ACM (2007)

    Google Scholar 

  27. Adams, R., Bischof, L.: Seeded region growing. IEEE Trans. Pattern Anal. Mach. Intell. 16(6), 641–647 (1994)

    Article  Google Scholar 

  28. Liu, Z., et al.: DeepFashion: Powering robust clothes recognition and retrieval with rich annotations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)

    Google Scholar 

  29. Hadi Kiapour, M., et al.: Where to buy it: matching street clothing photos in online shops. In: Proceedings of the IEEE International Conference on Computer Vision (2015)

    Google Scholar 

  30. Huang, J., et al.: Cross-domain image retrieval with a dual attribute-aware ranking network. In: Proceedings of the IEEE International Conference on Computer Vision (2015)

    Google Scholar 

  31. Ge, Y., et al., DeepFashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. arXiv preprint arXiv:1901.07973 (2019)

  32. Liu, S., et al.: Fashion parsing with weak color-category labels. IEEE Trans. Multimedia 16(1), 253–265 (2014)

    Article  Google Scholar 

  33. Everingham, M., et al.: The Pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)

    Article  Google Scholar 

  34. Bray, T., et al.: Extensible markup language (XML) 1.0. W3C recommendation, October 2000

    Google Scholar 

  35. Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2009)

    Google Scholar 

  36. Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)

  37. Redmon, J., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)

    Google Scholar 

  38. Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)

    Google Scholar 

  39. Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2

    Chapter  Google Scholar 

  40. Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (2015)

    Google Scholar 

  41. Chollet, F.: Keras (2015)

    Google Scholar 

  42. Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2016) (2016)

    Google Scholar 

  43. Khushi, M.: Benchmarking database performance for genomic data. J. Cell. Biochem. 116(6), 877–883 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saad Bin Yousuf .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yousuf, S.B., Sajid, H., Poon, S., Khushi, M. (2019). IMDB-Attire: A Novel Dataset for Attire Detection and Localization. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11954. Springer, Cham. https://doi.org/10.1007/978-3-030-36711-4_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-36711-4_46

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-36710-7

  • Online ISBN: 978-3-030-36711-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics