Asymmetric Feature Representation for Object Recognition in Client Server System

Yamauchi, Yuji; Ambai, Mitsuru; Sato, Ikuro; Yoshida, Yuichi; Fujiyoshi, Hironobu; Yamashita, Takayoshi

doi:10.1007/978-3-319-16865-4_39

Yuji Yamauchi⁵,
Mitsuru Ambai⁶,
Ikuro Sato⁶,
Yuichi Yoshida⁶,
Hironobu Fujiyoshi⁵ &
…
Takayoshi Yamashita⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9003))

Included in the following conference series:

Asian Conference on Computer Vision

2002 Accesses

Abstract

This paper proposes asymmetric feature representation and efficient fitting feature spaces for object recognition in client server system. We focus on the fact that the server-side has more sufficient memory and computation power compared to the client-side. Although local descriptors must be compressed on the client-side due to the narrow bandwidth of the Internet, feature vector compression on the server-side is not always necessary. Therefore, we propose asymmetric feature representation for descriptor matching. Our method is characterized by the following three factors. The first is asymmetric feature representation between client- and server-side. Although the binary hashing function causes quantization errors due to the computation of the sgn function \((\cdot )\), which binarizes a real value into \(\{1,-1\}\), such errors only occur on the client-side. As a result, performance degradation is suppressed while the volume of data traffic is reduced. The second is scale optimization to fit two different feature spaces. The third is fast implementation of distance computation based on real-vector decomposition. We can compute efficiently the squared Euclidean distance between the binary code and the real vector. Experimental results revealed that the proposed method helps reduce data traffic while maintaining the object retrieval performance of a client server system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In this paper, as with [6], a binary code is expressed by \( \{-1, 1\}\) instead of \(\{0, 1\}\) in order to simplify the mathematical expressions.
2.
In many cases [6, 11–13, 15, 19], \(f(\cdot )\) is an identity function.
3.
Before the local features are converted, they are mean-centered by using an average descriptor which is computed from training samples.

References

Bay, H., Ess, A., Tuytelaars, T., Van Gool, L.: Speeded-up robust features (surf). Comput. Vis. Image Underst. 110, 346–359 (2008)
Article Google Scholar
Takacs, G., Chandrasekhar, V., Tsai, S., Chen, D., Grzeszczuk, R., Girod, B.: Unified real-time tracking and recognition with rotation-invariant fast features. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2010)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1615–1630 (2005)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Article Google Scholar
Chandrasekhar, V., Takacs, G., Reznik, Y., Grzeszczuk, R., Girod, B.: Compressed histogram of gradients: A low-bitrate descriptor. Int. J. Comput. Vis. 96(3), 384–399 (2011)
Article Google Scholar
Gong, Y., Lazebnik, S.: Iterative quantization : A procrustean approach to learning binary codes. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2011)
Google Scholar
Calonder, M., Lepetit, V., Ozuysal, M., Trzcinski, T., Strecha, C., Fua, P.: BRIEF: Computing a local binary descriptor very fast. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1281–1298 (2012)
Article Google Scholar
Leutenegger, S., Chli, M., Siegwart, R.: BRISK: Binary robust invariant scalable keypoints. In: IEEE International Conference on Computer Vision (2011)
Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: An efficient alternative to SIFT or SURF. In: IEEE International Conference on Computer Vision (2011)
Google Scholar
Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: Fast retina keypoint. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2012)
Google Scholar
Achlioptas, D.: Database-friendly random projections: Johnson-Lindenstrauss with binary coins. J. Comput. Syst. Sci. 66, 671–687 (2003)
Article MathSciNet Google Scholar
Li, P., Hastie, T.J., Church, K.W.: Very Sparse Random Projections. In: International Conference on Knowledge Discovery and Data Mining (2006)
Google Scholar
Ambai, M., Yoshida, Y.: CARD: Compact and real-time descriptors. In: IEEE International Conference on Computer Vision, pp. 97–104 (2011)
Google Scholar
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Neural Information Processing Systems, pp. 1753–1760 (2008)
Google Scholar
Heo, J.P., Lee, Y., He, J., Chang, S.F., Yoon, S.E.: Spherical hashing. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2012)
Google Scholar
Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, in conjunction European Conference on Computer Vision (2004)
Google Scholar
Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3304–3311 (2010)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Weiss, Y., Fergus, R., Torralba, A.: Multidimensional spectral hashing. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 340–353. Springer, Heidelberg (2012)
Chapter Google Scholar
Daume III, H.: Frustratingly easy domain adaptation. In: Annual Meeting of the Association of Computational Linguistics, pp. 256–263 (2007)
Google Scholar
Duan, L., Tsang, I.W., Xu, D., Chua, T.S.: Domain adaptation from multiple sources via auxiliary classifiers. In: Annual International Conference on Machine Learning, pp. 289–296 (2009)
Google Scholar
Yang, J., Yan, R., Hauptmann, A.G.: Cross-domain video concept detection using adaptive svms. In: International Conference on Multimedia, pp. 188–197 (2007)
Google Scholar
Saenko, K., Kulis, B., Fritz, M., Darrell, T.: Adapting visual category models to new domains. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 213–226. Springer, Heidelberg (2010)
Chapter Google Scholar
Geng, B., Tao, D., Xu, C.: Daml : Domain adaptation metric learning. IEEE Trans. Image Process. 20, 2980–2989 (2011)
Article MathSciNet Google Scholar
Kulis, B., Saenko, K., Darrell, T.: What you saw is not what you get: Domain adaptation using asymmetric kernel transforms. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1785–1792 (2011)
Google Scholar
Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: International Conference on Machine Learning, pp. 209–216 (2007)
Google Scholar
Hare, S., Saffari, A., Torr, P.H.S.: Efficient online structured output learning for keypoint-based object tracking. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1894–1901 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Chubu University, Kasugai, Japan
Yuji Yamauchi, Hironobu Fujiyoshi & Takayoshi Yamashita
Denso IT Laboratory, Inc., Tokyo, Japan
Mitsuru Ambai, Ikuro Sato & Yuichi Yoshida

Authors

Yuji Yamauchi
View author publications
You can also search for this author in PubMed Google Scholar
Mitsuru Ambai
View author publications
You can also search for this author in PubMed Google Scholar
Ikuro Sato
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Hironobu Fujiyoshi
View author publications
You can also search for this author in PubMed Google Scholar
Takayoshi Yamashita
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuji Yamauchi .

Editor information

Editors and Affiliations

Technische Universität München, Garching, Bayern, Germany
Daniel Cremers
University of Adelaide, Adelaide, South Australia, Australia
Ian Reid
Keio University, Yokohama, Kanagawa, Japan
Hideo Saito
University of California at Merced, Merced, California, USA
Ming-Hsuan Yang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yamauchi, Y., Ambai, M., Sato, I., Yoshida, Y., Fujiyoshi, H., Yamashita, T. (2015). Asymmetric Feature Representation for Object Recognition in Client Server System. In: Cremers, D., Reid, I., Saito, H., Yang, MH. (eds) Computer Vision – ACCV 2014. ACCV 2014. Lecture Notes in Computer Science(), vol 9003. Springer, Cham. https://doi.org/10.1007/978-3-319-16865-4_39

Download citation

DOI: https://doi.org/10.1007/978-3-319-16865-4_39
Published: 16 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16864-7
Online ISBN: 978-3-319-16865-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics