Abstract
Same-style clothes retrieval is a task to search images which contain exactly the same designing style clothes. For such a task, too limited training data makes the problem of how to gain suitable same-style feature representations challenging but significant. In this paper, we adopt a memory-augmented deep neural network, also called as a few-shot learning model, to collect possibly same-style images. Besides, we present an object-aware clothes retrieval framework to further enhance the same-style feature representations, in which object focusing regions through object detection are first obtained, and a multi-task Siamese network is designed for ranking feature learning provided with some same-style or non-same-style image pairs. Experiments results show that our proposed solution is effective to discover more same-style images precisely, and further achieve the satisfied performance on same-style clothes retrieval.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liang, X., Lin, L., Yang, W., Luo, P.: Clothes co-parsing via joint image segmentation and labeling with application to clothing retrieval. IEEE Trans. Multimedia 18(6), 1 (2016)
Liu, S., Song, Z., Wang, M., Xu, C., Lu, H., Yan, S.: Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3330–3337 (2012)
Veit, A., Kovacs, B., Bell, S., Mcauley, J., Bala, K., Belongie, S.: Learning visual clothing style with heterogeneous dyadic co-occurrences. In: IEEE International Conference on Computer Vision, pp. 4642–4650 (2015)
Luo, P., Wang, X., Tang, X.: Pedestrian parsing via deep decompositional network. In: IEEE International Conference on Computer Vision, pp. 2648–2655 (2013)
Yamaguchi, K., Kiapour, M.H., Berg, T.L.: Paper doll parsing: retrieving similar styles to parse clothing items. In: IEEE International Conference on Computer Vision, pp. 3519–3526 (2014)
Kiapour, M.H., Han, X., Lazebnik, S., Berg, A.C., Berg, T.L.: Where to buy it: matching street clothing photos in online shops. In: IEEE International Conference on Computer Vision, pp. 3343–3351 (2015)
Kaiser, Ł., Nachum, O., Roy, A., Bengio, S.: Learning to remember rare events. arXiv preprint arXiv:1703.03129 (2017)
Santoro, A., Bartunov, S., Botvinick, M., Wierstra, D., Lillicrap, T.: One-shot learning with memory-augmented neural networks. arXiv preprint arXiv:1605.06065 (2016)
Vinyals, O., Blundell, C., Lillicrap, T., Wierstra, D., et al.: Matching networks for one shot learning. In: Advances in Neural Information Processing Systems, pp. 3630–3638 (2016)
Ravi, S., Larochelle, H.: Optimization as a model for few-shot learning. In: International Conference on Learning Representations, vol. 1, p. 6 (2017)
Fang, Z., Liu, J., Wang, Y., Li, Y., Hang, S., Tang, J., Lu, H.: Object-aware deep network for commodity image retrieval. In: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, pp. 405–408. ACM (2016)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. arXiv preprint arXiv:1405.3531 (2014)
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 248–255. IEEE (2009)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Chopra, S., Hadsell, R., LeCun, Y.: Learning a similarity metric discriminatively, with application to face verification. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, pp. 539–546 (2005)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. arXiv preprint arXiv:1612.03144 (2016)
Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., Wei, Y.: Deformable convolutional networks. arXiv preprint arXiv:1703.06211 (2017)
Wang, X., Sun, Z., Zhang, W., Zhou, Y., Jiang, Y.G.: Matching user photos to online products with robust deep features. In: ACM on International Conference on Multimedia Retrieval, pp. 7–14 (2016)
Acknowledgments
This work was supported by National Natural Science Foundation of China (61332016 and 61472422).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shen, Z., Fang, Z., Liu, J. (2018). Same-Style Products Mining for Clothes Retrieval. In: Huet, B., Nie, L., Hong, R. (eds) Internet Multimedia Computing and Service. ICIMCS 2017. Communications in Computer and Information Science, vol 819. Springer, Singapore. https://doi.org/10.1007/978-981-10-8530-7_44
Download citation
DOI: https://doi.org/10.1007/978-981-10-8530-7_44
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8529-1
Online ISBN: 978-981-10-8530-7
eBook Packages: Computer ScienceComputer Science (R0)