Skip to main content
Log in

Model-based similarity estimation of multidimensional temporal sequences

  • Published:
annals of telecommunications - annales des télécommunications Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Content-based queries in multimedia sequence databases where information is sequential is a tough issue, especially when dealing with large-scale applications. One of the key points is similarity estimation between a query sequence and elements of the database. In this paper, we investigate two ways to compare multimedia sequences, one—that comes from the literature—being computed in the feature space while the other one is computed in a model space, leading to a representation less sensitive to noise. We compare these approaches by testing them on a real audio dataset, which points out the utility of working in the model space.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. Note that this method might overestimate dissimilarity if the natural path contains a significant amount of nondiagonal parts.

References

  1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410

    Google Scholar 

  2. Andoni A, Indyk P (2006) Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In: Proceedings of the 47th annual IEEE symposium on foundations of computer science. IEEE, Piscataway, pp 459–468

    Google Scholar 

  3. Bouthemy P, Gelgon M, Ganansia F (1999) A unified approach to shot change detection and camera motion characterization. IEEE Trans Circuits Syst Video Technol 9(7):1030–1044

    Article  Google Scholar 

  4. Bruno E, Marchand-Maillet S (2003) Prédiction temporelle de descripteurs visuels pour la mesure de similarité entre vidéos. In: Proceedings of the GRETSI’03. France

  5. Chen L, Ng R (2004) On the marriage of lp-norms and edit distance. In: Proceedings of the 30th international conference on very large data bases. Toronto, 29 August–3 September 2004, pp 792–803

  6. Ciaccia P, Patella M, Zezula P (1997) M-tree: an efficient access method for similarity search in metric spaces. In: Proceedings of the 23th international conference on very large data bases. Athens, Greece, August 1997. Morgan Kaufmann, San Mateo, pp 426–435

    Google Scholar 

  7. Davis S, Mermelstein P (1980) Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans Audio Speech Lang Process 28(4):357–366

    Google Scholar 

  8. Ding H, Trajcevski G, Scheuermann P, Wang X, Keogh E (2008) Querying and mining of time series data: Experimental comparison of representations and distance measures. In: Proceedings of the 34th international conference on very large data bases. Auckland, 23–28 August 2008

  9. Keogh E (2002) Exact indexing of dynamic time warping. In: Proceedings of the 28th international conference on very large data bases. Hong Kong, 20–23 August 2002, pp 406–417

  10. Lejsek H, Ásmundsson FH, Jónsson BÞ, Amsaleg L (2009) NV-tree: an efficient disk-based index for approximate search in very large high-dimensional collections. IEEE Trans Pattern Anal Mach Intell 31(5):869–883. doi:10.1109/TPAMI.2008.130

    Article  Google Scholar 

  11. Law-To J, Chen L, Joly A, Laptev I, Buisson O, Gouet-Brunet V, Boujemaa N, Stentiford F (2007) Video copy detection: a comparative study. In: Proceedings of the 6th ACM international conference on image and video retrieval. New York, NY, USA, July 2007. ACM, New York, pp 371–378

    Google Scholar 

  12. Mercer J (1909) Functions of positive and negative type, and their connection with the theory of integral equations. Philos Trans R Soc Lond A Contain Pap Math Phys Character 209:415–446

    Google Scholar 

  13. Muscariello A, Gravier G, Bimbot F (2009) Variability tolerant audio motif discovery. In: The 15th international multimedia modeling conference. Sophia Antipolis, 7–9 January 2009

  14. Nistér D, Stewénius H (2006) Scalable recognition with a vocabulary tree. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. New York, 17–22 June 2006

  15. Sakoe H, Chiba S (1978) Dynamic programming optimization for spoken word recognition. IEEE Trans Acoust Speech Signal Process 26:43–49

    Article  MATH  Google Scholar 

  16. Smola AJ, Schoelkopf B (1998) A tutorial on support vector regression. http://www.eknigu.org/info/Cs_Computer%20science/CsAi_AI,%20knowledge/Smola%20A.J.,%20Schoelkopf%20B.%20Tutorial%20on%20support%20vector%20regression%20(2003)(24s).pdf

  17. Tavenard R, Amsaleg L, Gravier G (2007) Machines à vecteurs supports pour la comparaison de séquences de descripteurs. In: Proceedings of the 12th CORESA, pp 247–251

  18. Vapnik VN (1995) The nature of statistical learning theory. Springer, New York

    MATH  Google Scholar 

  19. Vapnik V, Golowich S, Smola A (1997) Support vector method for function approximation. In: Mozer M, Jordan M, Petsche T (eds.) Neural information processing systems, vol 9. MIT, Cambridge

    Google Scholar 

  20. Wilcoxon F (1945) Individual comparisons by ranking methods. Biom Bull 1:80–83

    Article  Google Scholar 

  21. Yi B, Jagadish HV, Faloutsos C (1998) Efficient retrieval of similar time sequences under time warping. In: Proceedings of the 14th international conference on data engineering, pp 201–208

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Romain Tavenard.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tavenard, R., Amsaleg, L. & Gravier, G. Model-based similarity estimation of multidimensional temporal sequences. Ann. Telecommun. 64, 381–390 (2009). https://doi.org/10.1007/s12243-009-0091-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12243-009-0091-4

Keywords

Navigation