Semiautomatic Learning of 3D Objects from Video Streams

Carrara, Fabio; Falchi, Fabrizio; Gennaro, Claudio

doi:10.1007/978-3-319-25087-8_20

Fabio Carrara¹⁷,
Fabrizio Falchi¹⁷ &
Claudio Gennaro¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9371))

Included in the following conference series:

International Conference on Similarity Search and Applications

1057 Accesses

Abstract

Object detection and recognition are classical problems in computer vision, but are still challenging without a priori knowledge of objects and with a limited user interaction. In this work, a semiautomatic system for visual object learning from video stream is presented. The system detects movable foreground objects relying on FAST interest points. Once a view of an object has been segmented, the system relies on ORB features to create its descriptor, store it and compare it with descriptors of previously seen views. To this end, a visual similarity function based on geometry consistency of the local features is used. The system groups together similar views of the same object into clusters relying on the transitivity of similarity among them. Each cluster identifies a 3D object and the system learn to autonomously recognize a particular view assessing its cluster membership. When ambiguities arise, the user is asked to validate the membership assignments. Experiments have demonstrated the ability of the system to group together unlabeled views, reducing the labeling work of the user.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Carrara, F., Amato, G., Falchi, F., Gennaro, C.: Efficient foreground-background segmentation using local features for object detection. In: Proceedings of the International Conference on Distributed Smart Cameras, ICDSC 2015, September 08–11, 2015, Seville, Spain (submitted for publication). http://puma.isti.cnr.it/rmydownload.php?filename=cnr.isti/cnr.isti/2015-TR-012/2015-TR-012.pdf
De Beugher, S., Brône, G., Goedemé, T.: Automatic analysis ofin-the-wild mobile eye-tracking experiments using object, face and persondetection. In: Proceedings of the International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014), vol. 1, pp. 625–633 (2014)
Google Scholar
Dubrofsky, E.: Homography estimation. Ph.D. thesis, University of British Columbia (2009)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. Computer Vision and Image Understanding 106(1), 59–70 (2007)
Article Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings, vol. 2, pp. II–264. IEEE (2003)
Google Scholar
Lowe, D.G.: Local feature view clustering for 3d object recognition. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, pp. I–682. IEEE (2001)
Google Scholar
Murase, H., Nayar, S.K.: Visual learning and recognition of 3-d objects from appearance. International Journal of Computer Vision 14(1), 5–24 (1995)
Article Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: Orb: an efficient alternative to sift or surf. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2564–2571. IEEE (2011)
Google Scholar
Savarese, S., Li, F.F.: 3D generic object categorization, localization and pose estimation. In: ICCV, pp. 1–8 (2007)
Google Scholar
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. Springer (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

ISTI-CNR, Via G. Moruzzi 1, 56124, Pisa, Italy
Fabio Carrara, Fabrizio Falchi & Claudio Gennaro

Authors

Fabio Carrara
View author publications
You can also search for this author in PubMed Google Scholar
Fabrizio Falchi
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Gennaro
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Claudio Gennaro .

Editor information

Editors and Affiliations

ISTI-CNR, Pisa, Italy
Giuseppe Amato
University of Strathclyde, Glasgow, United Kingdom
Richard Connor
ISTI-CNR, Pisa, Italy
Fabrizio Falchi
ISTI-CNR, Pisa, Italy
Claudio Gennaro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carrara, F., Falchi, F., Gennaro, C. (2015). Semiautomatic Learning of 3D Objects from Video Streams. In: Amato, G., Connor, R., Falchi, F., Gennaro, C. (eds) Similarity Search and Applications. SISAP 2015. Lecture Notes in Computer Science(), vol 9371. Springer, Cham. https://doi.org/10.1007/978-3-319-25087-8_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-25087-8_20
Published: 17 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25086-1
Online ISBN: 978-3-319-25087-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics