Abstract
In this article, we present Savanta, an information gathering interface for temporal, semantic video annotations. In Savanta, we integrate various methods and paradigms for information gathering, including visualisation, filtering, data mining, navigation and search, in order to explore the possible advantages of doing so. We posit that a seamless integration of multiple access methods, combined with an improved interval visualisation scheme and dynamically generated metadata, will result in greater user satisfaction compared to conventional approaches for searching and querying video databases—despite the increased complexity that may result. We perform a formal usability evaluation comparing Savanta to systems based on traditional search/query paradigms, and conclude that Savanta outperforms them with regard to both power and usability, especially for complex and open-ended tasks.















Similar content being viewed by others
References
Adali S, Candan KS, Chen S, Erol K, Subrahmanian VS (1996) The advanced video information system: data structures and query processing. Multimedia Syst 4–4:172–186
Ahlberg C, Williamson C, Shneiderman B (1992) Dynamic queries for information exploration: an implementation and evaluation. Proceedings of ACM CHI’92 Conference on Human Factors in Computer Systems, pp 619–626
Allen JF (1983) Maintaining knowledge about temporal intervals. Commun ACM 26–11:832–843
Anjulan A, Canagarajah N (2006) A novel framework for robust annotation and retrieval in video sequences, CIVR 2006, LNCS 4071, pp. 183–192
Baeza-Yates RA, Ribeiro-Neto BA (1999) Modern information retrieval. ACM, Addison-Wesley
Carrer M, Ligresti L, Ahanger G, Little TDC (1997) An annotation engine for supporting video database population. Multimedia Tools Appl 5–3:233–258
Chan SSM, Li Q, Wu Y, Zhuang Y (2002) Accommodating hybrid retrieval in a comprehensive video database management system. IEEE Trans Multimedia 4–2
Chin JP, Diehl VA, Norman KL (1988) Development of an instrument measuring user satisfaction of the human-computer interface. In: Proceedings of ACM CHI’88 conference on human factors in computing systems, pp 213–218
Christel MG, Kanade T, Mauldin M, Reddy R, Sirbu M, Stevens SM, Wactlar HD (1995) Informedia digital video library. Commun ACM 38–4:57–58
Chua T, Chen L, Wang J (2002) Stratification approach to modeling video. Multimedia Tools Appl 16–1:79–97
Dönderler ME, Saykol E, Ulusoy Ö, Güdükbay U (2003) BilVideo: a video database management system. IEEE MultiMedia 10–1:66–70
Dorado A, Calic J, Izquierdo E (2004) A rule-based video annotation system. IEEE Trans Circuits Syst Video Technol 14–5:622–633
Ekin A, Tekalp AM, Mehrotra R (2004) Integrated semantic–syntactic video modeling for search and browsing. IEEE Trans Multimedia 6–6:839–851
Gershon N, Eick SG, Card S (1998) Design: information visualization. Interactions 5–2:9–15
Hacid M-S, Decleir C, Kouloumdjian J (2000) A database approach for modeling and querying video data. IEEE Trans Knowl Data Eng 12–5:729–750
Hauglid JO (2005) User interfaces for accessing information in digital repositories, Ph.D. thesis, Department of Information and Computer Science, Norwegian University of Science and Technology, NTNU trykk
Hauglid JO, Midtstraum R (2002) SESAM: searching supported by analysis of metadata. In: Proceedings of the 2002 ACM symposium on applied computing (SAC), pp 418–425
Heggland J (2002) OntoLog: temporal annotation using ad hoc ontologies and application profiles. In research and advanced technology for digital libraries (ECDL), pp 118–128
Heggland J (2005) OntoLog: flexible management of temporal video content annotations, Ph.D. thesis, Department of Information and Computer Science, Norwegian University of Science and Technology, NTNU trykk
Hjelsvold R (1995) VideoSTAR-a database for video information sharing. Department of Computer Science and Telematics, Norwegian Institute of Technology
Hjelsvold R, Midtstraum R (1994) Modelling and querying video data. In VLDB’94, In: Proceedings of 20th international conference on Very Large Data Bases, pp 686–694
Hjelsvold R, Midtstraum R, Sandstå O (1995a) A temporal foundation of video databases. In: Clifford J, Tuzhilin A (eds) Recent advances in temporal databases: proceedings of the international workshop on temporal databases, pp 295–314
Hjelsvold R, Langørgen S, Midtstraum R, Sandstå O (1995b) Integrated video archive tools. ACM Multimedia, pp 283–293
Kazman R, Al-Halimi R, Hunt W, Mantei M (1996) Four paradigms for indexing video conferences. IEEE MultiMedia 3–1:63–73
Kokkoras FA, Jiang H, Vlahavas IP, Elmagarmid AK, Houstis EN, Aref WG (2002) Smart videotext: a video data model based on conceptual graphs. Multimedia Syst 8–4:328–338
Mackay WE, Beaudouin-Lafon M (1998) DIVA: exploratory data analysis with multimedia streams. CHI, pp 416–423
Mitchell M, Jolley J (2001) Research design explained, 4th edn. Harcourt, Fort Worth, TX
Nack F, Hardman L (2002) Towards a syntax for multimedia semantics http://www.cwi.nl/ftp/CWIreports/INS/INS-R0204.pdf
Nielsen J (1993) Usability engineering. Academic, Boston
Noldus Information Technology (2007) The observer, http://www.noldus.com/site/doc200401012
Oomoto E, Tanaka K (1993) OVID: design and implementation of a video-object database system. IEEE Trans Knowl Data Eng 5–4:629–643
Ponceleon D, Srinivasan S, Amir A, Petkovic D, Diklic D (1998) Key to effective video retrieval: effective cataloging and browsing. In: Proceedings of the ACM multimedia ’98 conference. ACM, New York, pp. 99–107
Santini S, Jain R (1999) Interfaces for emergent semantics in multimedia databases. In: Proceedings of the IS&T/SPIE conference on storage and retrieval for image and video databases, pp 167–175
Shneiderman B (1996) The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings of the IEEE symposium on visual languages, pp 336–343
Shneiderman B (1997) Designing the user interface: strategies for effective human-computer interaction. Addison-Wesley, Reading, MA
Skou CV (2003) Qualitative media analyzer, http://www.cvs.dk/qma.htm, August
Slaughter LA, Oard DW, Warnick VL, Harding JL, Wilkerson GJ (1998) A graphical interface for speech-based retrieval. In: Proceedings of the 3rd ACM international conference on digital libraries, pp 305–306
Volkmer T, Smith JR, Natsev A (2005) A web-based system for collaborative annotation of large image and video collections: an evaluation and user study. In: Proceedings of the 13th annual ACM international conference on multimedia, November 06–11, 2005, Hilton, Singapore. DOI 10.1145/1101149.1101341
Wactlar HD, Kanade T, Smith MA, Stevens SM (1996) Intelligent access to digital video: informedia project. IEEE Comp 29–5:46–52
Weiss R, Duda A, Gifford DK (1995) Composition and search with a video algebra. IEEE MultiMedia 2–1:12–25
Whittaker S, Hirschberg J, Choi J, Hindle D, Pereira FCN, Singhal A (1999) SCAN: designing and evaluating user interfaces to support retrieval from speech archives. In: Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval pp 26–33
Zhao N, Chen S-C, Shyu M-L (2006) Video database modeling and temporal pattern retrieval using hierarchical Markov model mediator. In: Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW’06)
Acknowledgements
We would like to thank Steinar Line for invaluable help in producing the test data, and him and Kjetil Nørvåg for participating in pilot studies of both Savanta and the usability test.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hauglid, J.O., Heggland, J. Savanta—search, analysis, visualisation and navigation of temporal annotations. Multimed Tools Appl 40, 183–210 (2008). https://doi.org/10.1007/s11042-008-0204-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-008-0204-5