Abstract
People’s attire, or their way of dressing defines not only their social status and personality but also affects the way people meet and greet them. Attire detection has many useful applications such as clothing preferences in diverse regions of the world could be monitored and quantified. This information is very valuable for fashion designers. Real-time clothing recognition can be useful for security surveillance, where information about an individual’s clothes can be used to identify crime suspects. Recently, deep learning algorithms have shown promise in the field of object detection and recognition. These algorithms are data hungry and are only as good as the data they are trained on. In this work, we have focused on three tasks to address this problem. We created a unique dataset of ~8000 images from IMDBb.com (movie rating website) to address the challenge of real-world application of the algorithm training for attire detection. The dataset contains pictures from movies, making the dataset a good source of images from the wild. We manually labelled 60 different classes of attire. Then we focused on multiclass classification and attire object detection using customized deep learning architectures including YOLO and SSD. We achieved a mean Average Precision (mAP) of 64.14% and an Average Precision (AP) of 91.14% for top 5 classes on YOLO. Available at https://github.com/saadyousuf45/.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Khushi, M., et al.: Automated classification and characterization of the mitotic spindle following knockdown of a mitosis-related protein. BMC Bioinform. 18(16), 566 (2017)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)
Chen, K.-T., Luo, J.: When fashion meets big data: discriminative mining of best selling clothing features. In: Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee (2017)
Lao, B., Jagadeesh, K.: Convolutional neural networks for fashion classification and object detection. In: CCCV 2015: Computer Vision, pp. 120–129 (2016)
Divvala, S.K., Farhadi, A., Guestrin, C.: Learning everything about anything: webly-supervised visual concept learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014)
Rohrbach, M., et al.: What helps where–and why? Semantic relatedness for knowledge transfer. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE (2010)
Chen, H., et al.: Composite templates for cloth modeling and sketching. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006). IEEE (2006)
Ng, H.N., Grimsdale, R.L.: Computer graphics techniques for modeling cloth. IEEE Comput. Graph. Appl. 16(5), 28–41 (1996)
Yamaguchi, K., Hadi Kiapour, M., Berg, T.L.: Paper doll parsing: Retrieving similar styles to parse clothing items. In: Proceedings of the IEEE International Conference on Computer Vision (2013)
Hasan, B., Hogg, D.C.: Segmentation using deformable spatial priors with application to clothing. In: BMVC (2010)
Wang, N., Ai, H.: Who blocks who: simultaneous clothing segmentation for grouping images. In: 2011 International Conference on Computer Vision. IEEE (2011)
Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: 2011 18th IEEE International Conference on Image Processing. IEEE (2011)
Packer, C., McAuley, J., Ramisa, A.: Visually-aware personalized recommendation using interpretable image representations. arXiv preprint arXiv:1806.09820 (2018)
Zhang, X., et al.: Trip outfits advisor: location-oriented clothing recommendation. IEEE Trans. Multimedia 19(11), 2533–2544 (2017)
Chen, Q., et al.: Deep domain adaptation for describing people based on fine-grained clothing attributes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)
Chen, H., Gallagher, A., Girod, B.: Describing clothing by semantic attributes. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 609–623. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33712-3_44
Bossard, L., Dantone, M., Leistner, C., Wengert, C., Quack, T., Van Gool, L.: Apparel classification with style. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7727, pp. 321–335. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37447-0_25
He, Y., Yang, L., Chen, L.: Real-time fashion-guided clothing semantic parsing: a lightweight multi-scale inception neural network and benchmark. In: Workshops at the Thirty-First AAAI Conference on Artificial Intelligence (2017)
Liu, K.-H., Chen, T.-Y., Chen, C.-S.: MVC: a dataset for view-invariant clothing retrieval and attribute prediction. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval. ACM (2016)
Chao, X., et al.: A framework for robust feature selection for real-time fashion style recommendation. In: Proceedings of the 1st International Workshop on Interactive Multimedia for Consumer Electronics. ACM (2009)
Chang, C.-C., Wang, L.-L.: Color texture segmentation for clothing in a computer-aided fashion design system. Image Vis. Comput. 14(9), 685–702 (1996)
Zhang, W., et al.: An intelligent fitting room using multi-camera perception. In: Proceedings of the 13th International Conference on Intelligent User Interfaces. ACM (2008)
Freixenet, J., Muñoz, X., Raba, D., Martí, J., Cufí, X.: Yet another survey on image segmentation: region and boundary information integration. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 408–422. Springer, Heidelberg (2002). https://doi.org/10.1007/3-540-47977-5_27
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection (2005)
Yang, J., et al.: Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of the International Workshop on Workshop on Multimedia Information Retrieval. ACM (2007)
Adams, R., Bischof, L.: Seeded region growing. IEEE Trans. Pattern Anal. Mach. Intell. 16(6), 641–647 (1994)
Liu, Z., et al.: DeepFashion: Powering robust clothes recognition and retrieval with rich annotations. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Hadi Kiapour, M., et al.: Where to buy it: matching street clothing photos in online shops. In: Proceedings of the IEEE International Conference on Computer Vision (2015)
Huang, J., et al.: Cross-domain image retrieval with a dual attribute-aware ranking network. In: Proceedings of the IEEE International Conference on Computer Vision (2015)
Ge, Y., et al., DeepFashion2: a versatile benchmark for detection, pose estimation, segmentation and re-identification of clothing images. arXiv preprint arXiv:1901.07973 (2019)
Liu, S., et al.: Fashion parsing with weak color-category labels. IEEE Trans. Multimedia 16(1), 253–265 (2014)
Everingham, M., et al.: The Pascal visual object classes (VOC) challenge. Int. J. Comput. Vis. 88(2), 303–338 (2010)
Bray, T., et al.: Extensible markup language (XML) 1.0. W3C recommendation, October 2000
Deng, J., et al.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2009)
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Redmon, J., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Ren, S., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (2015)
Chollet, F.: Keras (2015)
Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2016) (2016)
Khushi, M.: Benchmarking database performance for genomic data. J. Cell. Biochem. 116(6), 877–883 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Yousuf, S.B., Sajid, H., Poon, S., Khushi, M. (2019). IMDB-Attire: A Novel Dataset for Attire Detection and Localization. In: Gedeon, T., Wong, K., Lee, M. (eds) Neural Information Processing. ICONIP 2019. Lecture Notes in Computer Science(), vol 11954. Springer, Cham. https://doi.org/10.1007/978-3-030-36711-4_46
Download citation
DOI: https://doi.org/10.1007/978-3-030-36711-4_46
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-36710-7
Online ISBN: 978-3-030-36711-4
eBook Packages: Computer ScienceComputer Science (R0)