Skip to main content

Indexing, Browsing, and Searching of Digital Video and Digital Audio Information

  • Chapter
  • First Online:
Lectures on Information Retrieval (ESSIR 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1980))

Included in the following conference series:

Abstract

In this chapter we examine various techniques for providing content access to information stored in a continuous medium, namely digital audio and digital video. Our coverage of audio is centered around post-processing the output of automatic recognition of speech or phones and we describe the various approaches than have been taken in this area. In order to give reasonable coverage of the possibilities and limitations of content-based access to digital video information we sketch out at a high level, the approaches taken in various video compression algorithms, principally the MPEG family. We then address approaches to shot and scene boundary detection, choosing representative frames for browsing and for search, and various browsing interfaces that have been developed. We finish with an overview of the likely developments in this area in the future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Browne, P., Smeaton, A.F., Murphy, N., O’Connor N., Marlow, S., Berrut, C. Evaluating and Combining Digital Video Shot Boundary Detection Algorithms. In Proceedings of the Fourth Irish Machine Vision and Information Processing Conference, Queens University Belfast, September 1999.

    Google Scholar 

  2. Downie, J.S. and Nelson, M. Evaluation of a simple and effective music IR system. In: Proceedings of the 22nd ACM-SIGIR Conference, Athens, Greece, July 2000.

    Google Scholar 

  3. Eakins, J.P. Retrieval of Still Images by Content. This volume 2000.

    Google Scholar 

  4. Garofolo, J., Voorhees, E., Auzanne, C., Stanford, C. and Lund, B. TREC-7 Spoken Document Retrieval Track Overview and Results. In NIST Special Publication 500-242: The Seventh Text REtrieval Conference (TREC 7) 79–90, 1999. (also available at http://trec.nist.gov/pubs/trec7/t7_proceedings.html last visited 10 August 2000

  5. Jones, G.J.F., Foote, J.T., Sparck Jones, K. and Young, S.J. Retrieving spoken documents by combining multiple index sources. In Proceedings of SIGIR 96, Research and Development in Information Retrieval, 30–38, Zürich, ACM Press, 1996.

    Google Scholar 

  6. Kazman, R. and Kominek, J. Supporting the Retrieval Process in Multimedia Information Systems. In: Proceedings of HICSS’ 97, Vol. VI, 229–238, 1997.

    Google Scholar 

  7. Koegel Buford, J.F. Multimedia Systems. ACM Press, Addison-Wesley Publishers, New York1994.

    Google Scholar 

  8. Lee, H,. Smeaton AF,. O’Toole, C,. Murphy, N,. Marlow S and O’Connor, N.E. The Físchlár Digital Video Recording, Analysis, and Browsing System. In Proceedings of RIAO’ 2000: Content-Based Multimedia Information Access, Paris, France, April 12–14, 2000

    Google Scholar 

  9. Lee, H., Smeaton, AF, Berrut, C., Murphy, N, Marlow, S. and O’Connor, N. Implementation and Analysis of Several Keyframe-based Browsing Interfaces to Digital Video. To appear in Proceedings of the Fourth European Conference on Digital Libraries, Lisbon, Portugal, September 2000.

    Google Scholar 

  10. Perry, B., Chang, S-K, Dinsmore, J, Doermann, D, Rosenfeld, A and Stevens, S. Content-Based Access to Multimedia Information. From Technology Trends to State of the Art. Kluwer Academic Publishers, 69–77, 2000.

    Google Scholar 

  11. Quinn, G. and Smeaton, A.F. Optimal Parameters for Segmenting a Stream of Audio into Speech Documents, G. Quinn.: In Proceedings of the ESCA ETRW Workshop on Accessing Information in Spoken Audio: 19–20 April 1999, Cambridge, UK.

    Google Scholar 

  12. Rudnicky, A.I., Hauptmann, A.G. and Lee, K-F. Survey of current speech technology. Communications of the ACM. 37(3):52–57, 1994

    Article  Google Scholar 

  13. Schäuble, P. Multimedia Information Retrieval. Kluwer Academic Publishers 1997.

    Google Scholar 

  14. Sikora, T. MPEG Digital Video-Coding Standards. IEEE Signal Processing Magazine, 82–99, 1997.

    Google Scholar 

  15. Smeaton, A.F., Gilvarry, J., Gormley, G., Tobin, B., Marlow S. and Murphy, N. An Evaluation of Alternative Techniques for Automatic Detection of Shot Boundaries in Digital Video. In: Proceedings of the Third Irish Machine Vision and Information Processing Conference, Dublin, September 1999.

    Google Scholar 

  16. Smeaton, A.F., Morony, M., Quinn G., and Scaife, R. Taiscéalaí: Information Retrieval from an Archive of Spoken Radio News. In Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries (ECDL), Crete, C. Nikolaou and C. Stephanidis (Eds.) Springer LNCS 1513, 429–442, 1998.

    Google Scholar 

  17. Tannenbaum, R. S. Theoretical Foundations of Multimedia. W. H. Freeman and Company, The Computer Science Press, New York, 1998.

    Google Scholar 

  18. Voorhees, E.M. and Harman, D.H. The Sixth Text REtrieval Conference (TREC-6). Information Processing & Management 36(1):3–35 1999.

    Article  Google Scholar 

  19. Zhang, H., Low, C. and Smoliar, S. Video Parsing and Browsing Using Compressed Data. Multimedia Tools and Applications. 1:89–111, 1995.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Smeaton, A.F. (2000). Indexing, Browsing, and Searching of Digital Video and Digital Audio Information. In: Agosti, M., Crestani, F., Pasi, G. (eds) Lectures on Information Retrieval. ESSIR 2000. Lecture Notes in Computer Science, vol 1980. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45368-7_5

Download citation

  • DOI: https://doi.org/10.1007/3-540-45368-7_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41933-4

  • Online ISBN: 978-3-540-45368-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics