Abstract
A unique problem is considered: how to automatically determine whether two images (pictures, drawings, or photos) show the same 3D object pictured in different views. Similarity or equality of the objects seen despite the spotting, size or angle of view is one of the biggest challenges. Thus, a unique proposition of solving this task is proposed. In this complex solution image sets are compared and following question is answered: are these sets similar in the sense that they present an object of the same type? The approach uses CNN-like deep neural networks for object region masking, specific feature extraction and proper type classification, and similarity measures for data matching. Experimental validation is performed on a public car image database.
The research work was supported by Warsaw University of Technology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chollet, F., et al.: Keras (2015). http://keras.io
Dehghan, A., Masood, S.Z., Shu, G., Ortiz, E.G.: View independent vehicle make, model and color recognition using convolutional neural network (2017). https://arxiv.org/abs/1702.01721
Gaillard, M., Egyed-Zsigmond, E.: Large scale reverse image search: a method comparison for almost identical image retrieval. In: Proceedings - INFORSID 2017, p. 127 (2017). https://hal.archives-ouvertes.fr/hal-01591756
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition, pp. 770–778 (2016). https://doi.org/10.1109/CVPR.2016.90
He, K., Gkioxari, G., Dollár, P., Girshick, R.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)
Krause, J., Stark, M., Deng, J., Fei-Fei, L.: 3D object representations for fine-grained categorization. In: 4th International IEEE Workshop on 3D Representation and Recognition (3dRR-13) (2013)
Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft COCO: Common objects in context. In: Computer Vision ECCV 2014, pp. 740–755 (2014)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015. https://doi.org/10.1109/CVPR.2015.7298682
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2016
Wang, J., Song, Y., Leung, T., Rosenberg, C., Wang, J., Philbin, J., Chen, B., Wu, Y.: Learning fine-grained image similarity with deep ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Piwowarski, P., Kasprzak, W. (2021). Multi-stream Fusion in Image Sets Comparison. In: Szewczyk, R., Zieliński, C., Kaliczyńska, M. (eds) Automation 2021: Recent Achievements in Automation, Robotics and Measurement Techniques. AUTOMATION 2021. Advances in Intelligent Systems and Computing, vol 1390. Springer, Cham. https://doi.org/10.1007/978-3-030-74893-7_22
Download citation
DOI: https://doi.org/10.1007/978-3-030-74893-7_22
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-74892-0
Online ISBN: 978-3-030-74893-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)