Multi-vector feature space based on pseudo-Euclidean space and oblique basis

Yamane, Yasuo; Hoshiai, Tadashi; Tsuda, Hiroshi; Ohta, Manabu; Katayama, Kaoru; Ishikawa, Hiroshi

doi:10.1007/s11042-006-0045-z

Multi-vector feature space based on pseudo-Euclidean space and oblique basis

Published: 09 November 2006

Volume 31, pages 287–308, (2006)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Yasuo Yamane^1,3,
Tadashi Hoshiai¹,
Hiroshi Tsuda¹,
Manabu Ohta²,
Kaoru Katayama³ &
…
Hiroshi Ishikawa³

127 Accesses
Explore all metrics

Abstract

The Earth Mover’s Distance (EMD) and the quadratic-form distance (QFD) are representative distances used in similarity searches of images. Although the QFD greatly outperforms the EMD in speed, the EMD outperform the QFD in performance. The EMD, however, has almost no theoretical justification and requires high computation costs. We propose a feature space model we call a “multi-vector feature space based on pseudo-Euclidean space and an oblique basis (MVPO).” In MVPO, an object such as an image is represented by a vector set (roughly speaking, a solid) and the EMD is reinterpreted as the distance between vector sets while the QFD is reinterpreted as the distance between the centroids of vector sets. Therefore MVPO gives a common geometrical view to these distances. We hypothesized that in MVPO the entity of an image is represented by a vector set (solid) and geometrical reasoning is applicable to MVPO. Our hypothesis explains well that the EMD outperforms the QFD in performance because the centroid of a solid is the simplest approximation of it. Our hypothesis implies that the performance of the QFD should be good when solids are far apart but bad when they are close together. We conjectured that discriminability would decline—that is, dissimilar images would be judged to be similar—when the centroids of solids are very close. Our experiment supported this conjecture. And from our hypothesis we conjectured that by making an original solid simpler, we can make an approximation method that has better performance than the QFD and faster than the EMD. The results of our experiment with this method supported our conjecture and consequently our hypothesis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Binary Vectors for Fast Distance and Similarity Estimation

Article 26 January 2017

Images, Fixed Points and Vector Extremum Problems

Article 14 September 2017

Fast total least squares vectorization

Article 18 January 2016

Notes

The proof on an image consisting of four colors in Fig. 11 is got from the property of quadratic-form distance that for any vectors v and w, if d(v, w) = 0 then d((v + w) / 2, v) = 0.

References

Castelli V, Bergman LD (eds) (2002) Image databases—search and retrieval of digital imagery. Wiley, New York
Gaede V, Günther O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231 (June)
Article Google Scholar
Hafner J, Sawhney HS, Equitz W, Flickner M, Niblack W (1995) Efficient color histogram indexing for quadratic form distance functions. IEEE Trans Pattern Anal Mach Intell 17(7):729–736
Article Google Scholar
Jean JSN (1990) A new distance measure for binary images. Proc. IEEE ICASSP ’90, pp3–6, paper no. M5.19
Katayama N, Satoh S (1997) The SR-tree: an index structure for high-dimensional nearest neighbor queries. In: Proc. SIGMOD 1997, pp369–380
Lee D, Barber R, Niblack W, Flickner M, Hafner J, Petkovic D (1994) Query by image content using multiple objects and multiple features: user interface issues. In: Proc. ICIP 1994, pp76–80
Levina E, Bickel P (2001) The Earth Mover’s Distance is the Mallows Distance: some insights from statistics. In: Proc. ICCV 2001, vol 2, pp251–256
Rubner Y, Tomasi C (2001) Perceptual metrics for image database navigation. Kluwer, Dordrecht, The Netherlands
MATH Google Scholar
Rubner Y, Tomasi C, Guibas LJ (1998) A metric for distribution with applications to image databases. In: Proc. IEEE Intl. Conf. on Computer Vision, pp59–66
Weber R, Schek H, Blott S (1998) A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proc. 24th VLDB, pp194–205

Download references

Acknowledgments

We would like to thank senior researcher D. Masumoto, Mr. Y. Uehara, and Mr. T. Baba in Fujitsu Laboratories Ltd. for their helpful advice and discussions during our research.

Author information

Authors and Affiliations

Intelligent Systems Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan
Yasuo Yamane, Tadashi Hoshiai & Hiroshi Tsuda
Graduate School of Natural Science and Technology, Okayama University, 3-1-1 Tsushima-naka, Okayama, 700-8530, Japan
Manabu Ohta
Graduate School of Engineering, Tokyo Metropolitan University, 1-1 Minamiohsawa, Hachiohji, Tokyo, 192-0397, Japan
Yasuo Yamane, Kaoru Katayama & Hiroshi Ishikawa

Authors

Yasuo Yamane
View author publications
You can also search for this author in PubMed Google Scholar
Tadashi Hoshiai
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Tsuda
View author publications
You can also search for this author in PubMed Google Scholar
Manabu Ohta
View author publications
You can also search for this author in PubMed Google Scholar
Kaoru Katayama
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yasuo Yamane.

Appendix

1.1 Determining an Oblique Basis

Instead of using the eigenvalues of the similarity matrix, we show a method based on simultaneous equations taking non-regular matrices into account.

(1)
Method for Regular Similarity Matrix.

Let $T = {\left( {e_{1} ,e_{2} , \ldots ,e_{n} } \right)} = {\left( {\begin{array}{*{20}c} {{e_{{11}} }} & {{e_{{12}} }} & { \cdots } & {{e_{{1n}} }} \\ {0} & {{e_{{22}} }} & { \cdots } & {{e_{{2n}} }} \\ { \vdots } & { \vdots } & { \ddots } & { \vdots } \\ {0} & {0} & { \cdots } & {{e_{{nn}} }} \\ \end{array} } \right)}$

Then Conditions (C1) and (C3) are regarded as being simultaneous equations with the components of T, e ₁₁, e ₁₂···, e _nn. Given that e _ii ≠ 0, these equations can be solved by determining e ₁₁, e ₁₂,···, e _1n, e ₂₂,···, e _2n,···, e _nn in that order. The value of e _ij(i < j) and e _ii are as follows:
$$e_{{ij}} = {{\left( {r_{{ij}} - {\sum\limits_{k = 1}^{i - 1} {e_{{ki}} e_{{kj}} } }} \right)}} \mathord{\left/ {\vphantom {{{\left( {r_{{ij}} - {\sum\limits_{k = 1}^{i - 1} {e_{{ki}} e_{{kj}} } }} \right)}} {e_{{ii}} }}} \right. \kern-\nulldelimiterspace} {e_{{ii}} }{\text{ and }}e_{{ii}} = {\sqrt {1 - {\sum\limits_{k = 1}^{i - 1} {e_{{ki}} ^{2} } }} }.$$
(2)
Oblique Basis for Any Similarity Matrix

We assumed a similarity matrix is regular, but using a method similar to the one above, we can get a linearly independent oblique basis for any similarity matrix in 2n-dimensional pseudo-Euclidean space. In fact, the similarity matrix in the circular case is not regular.

Let ${\mathbf{T}} = {\left( {e_{1} ,e_{2} , \ldots ,e_{n} } \right)} = {\left( {\begin{array}{*{20}c} {1} & {{e_{{12}} }} & { \cdots } & {{e_{{1n}} }} \\ {0} & {1} & { \cdots } & {{e_{{2n}} }} \\ { \vdots } & { \vdots } & { \ddots } & { \vdots } \\ {0} & {0} & { \cdots } & {1} \\ {0} & {0} & { \cdots } & {0} \\ {0} & {{e_{{n + 2,2}} i}} & { \cdots } & {0} \\ { \vdots } & { \vdots } & { \ddots } & { \vdots } \\ {0} & {0} & {0} & {{e_{{2n,n}} i}} \\ \end{array} } \right)}.$

This matrix consists of two square sub-matrices. The upper one is an upper triangular matrix, and its diagonal components are 1. Therefore, e ₁, e ₂,..., e _n are obviously linearly independent. And the lower sub-matrix is a diagonal matrix whose diagonal components are imaginary numbers or 0. And using a method similar to the one above, we can determine an oblique basis. Here all the e _ij’s obtained are real numbers.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yamane, Y., Hoshiai, T., Tsuda, H. et al. Multi-vector feature space based on pseudo-Euclidean space and oblique basis. Multimed Tools Appl 31, 287–308 (2006). https://doi.org/10.1007/s11042-006-0045-z

Download citation

Published: 09 November 2006
Issue Date: December 2006
DOI: https://doi.org/10.1007/s11042-006-0045-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-vector feature space based on pseudo-Euclidean space and oblique basis

Abstract

Access this article

Similar content being viewed by others

Binary Vectors for Fast Distance and Similarity Estimation

Images, Fixed Points and Vector Extremum Problems

Fast total least squares vectorization

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

1.1 Determining an Oblique Basis

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-vector feature space based on pseudo-Euclidean space and oblique basis

Abstract

Access this article

Similar content being viewed by others

Binary Vectors for Fast Distance and Similarity Estimation

Images, Fixed Points and Vector Extremum Problems

Fast total least squares vectorization

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

1.1 Determining an Oblique Basis

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation