Abstract
In this chapter we examine various techniques for providing content access to information stored in a continuous medium, namely digital audio and digital video. Our coverage of audio is centered around post-processing the output of automatic recognition of speech or phones and we describe the various approaches than have been taken in this area. In order to give reasonable coverage of the possibilities and limitations of content-based access to digital video information we sketch out at a high level, the approaches taken in various video compression algorithms, principally the MPEG family. We then address approaches to shot and scene boundary detection, choosing representative frames for browsing and for search, and various browsing interfaces that have been developed. We finish with an overview of the likely developments in this area in the future.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Browne, P., Smeaton, A.F., Murphy, N., O’Connor N., Marlow, S., Berrut, C. Evaluating and Combining Digital Video Shot Boundary Detection Algorithms. In Proceedings of the Fourth Irish Machine Vision and Information Processing Conference, Queens University Belfast, September 1999.
Downie, J.S. and Nelson, M. Evaluation of a simple and effective music IR system. In: Proceedings of the 22nd ACM-SIGIR Conference, Athens, Greece, July 2000.
Eakins, J.P. Retrieval of Still Images by Content. This volume 2000.
Garofolo, J., Voorhees, E., Auzanne, C., Stanford, C. and Lund, B. TREC-7 Spoken Document Retrieval Track Overview and Results. In NIST Special Publication 500-242: The Seventh Text REtrieval Conference (TREC 7) 79–90, 1999. (also available at http://trec.nist.gov/pubs/trec7/t7_proceedings.html last visited 10 August 2000
Jones, G.J.F., Foote, J.T., Sparck Jones, K. and Young, S.J. Retrieving spoken documents by combining multiple index sources. In Proceedings of SIGIR 96, Research and Development in Information Retrieval, 30–38, Zürich, ACM Press, 1996.
Kazman, R. and Kominek, J. Supporting the Retrieval Process in Multimedia Information Systems. In: Proceedings of HICSS’ 97, Vol. VI, 229–238, 1997.
Koegel Buford, J.F. Multimedia Systems. ACM Press, Addison-Wesley Publishers, New York1994.
Lee, H,. Smeaton AF,. O’Toole, C,. Murphy, N,. Marlow S and O’Connor, N.E. The FÃschlár Digital Video Recording, Analysis, and Browsing System. In Proceedings of RIAO’ 2000: Content-Based Multimedia Information Access, Paris, France, April 12–14, 2000
Lee, H., Smeaton, AF, Berrut, C., Murphy, N, Marlow, S. and O’Connor, N. Implementation and Analysis of Several Keyframe-based Browsing Interfaces to Digital Video. To appear in Proceedings of the Fourth European Conference on Digital Libraries, Lisbon, Portugal, September 2000.
Perry, B., Chang, S-K, Dinsmore, J, Doermann, D, Rosenfeld, A and Stevens, S. Content-Based Access to Multimedia Information. From Technology Trends to State of the Art. Kluwer Academic Publishers, 69–77, 2000.
Quinn, G. and Smeaton, A.F. Optimal Parameters for Segmenting a Stream of Audio into Speech Documents, G. Quinn.: In Proceedings of the ESCA ETRW Workshop on Accessing Information in Spoken Audio: 19–20 April 1999, Cambridge, UK.
Rudnicky, A.I., Hauptmann, A.G. and Lee, K-F. Survey of current speech technology. Communications of the ACM. 37(3):52–57, 1994
Schäuble, P. Multimedia Information Retrieval. Kluwer Academic Publishers 1997.
Sikora, T. MPEG Digital Video-Coding Standards. IEEE Signal Processing Magazine, 82–99, 1997.
Smeaton, A.F., Gilvarry, J., Gormley, G., Tobin, B., Marlow S. and Murphy, N. An Evaluation of Alternative Techniques for Automatic Detection of Shot Boundaries in Digital Video. In: Proceedings of the Third Irish Machine Vision and Information Processing Conference, Dublin, September 1999.
Smeaton, A.F., Morony, M., Quinn G., and Scaife, R. TaiscéalaÃ: Information Retrieval from an Archive of Spoken Radio News. In Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries (ECDL), Crete, C. Nikolaou and C. Stephanidis (Eds.) Springer LNCS 1513, 429–442, 1998.
Tannenbaum, R. S. Theoretical Foundations of Multimedia. W. H. Freeman and Company, The Computer Science Press, New York, 1998.
Voorhees, E.M. and Harman, D.H. The Sixth Text REtrieval Conference (TREC-6). Information Processing & Management 36(1):3–35 1999.
Zhang, H., Low, C. and Smoliar, S. Video Parsing and Browsing Using Compressed Data. Multimedia Tools and Applications. 1:89–111, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Smeaton, A.F. (2000). Indexing, Browsing, and Searching of Digital Video and Digital Audio Information. In: Agosti, M., Crestani, F., Pasi, G. (eds) Lectures on Information Retrieval. ESSIR 2000. Lecture Notes in Computer Science, vol 1980. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45368-7_5
Download citation
DOI: https://doi.org/10.1007/3-540-45368-7_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41933-4
Online ISBN: 978-3-540-45368-0
eBook Packages: Springer Book Archive