Media Augmentation and Personalization Through Multimedia Processing and Information Extraction

Dimitrova, Nevenka; Zimmerman, John; Janevski, Angel; Agnihotri, Lalitha; Haas, Norman; Li, Dongge; Bolle, Ruud; Velipasalar, Senem; Mcgeeand, Thomas; Nikolovska, Lira

doi:10.1007/1-4020-2164-X_8

Nevenka Dimitrova²⁹,
John Zimmerman³⁰,
Angel Janevski²⁹,
Lalitha Agnihotri²⁹,
Norman Haas³¹,
Dongge Li³²,
Ruud Bolle³¹,
Senem Velipasalar³¹,
Thomas Mcgeeand²⁹ &
…
Lira Nikolovska³³

Part of the book series: Human-Computer Interaction Series ((HCIS,volume 6))

253 Accesses
3 Altmetric

Abstract

This chapter details the value and methods for content augmentation and personalization among different media such as TV and Web. We illustrate how metadata extraction can aid in combining different media to produce a novel content consumption and interaction experience. We present two pilot content augmentation applications. The first, called MyInfo, combines automatically segmented and summarized TV news with information extracted from Web sources. Our news summarization and metadata extraction process employs text summarization, anchor detection and visual key element selection. Enhanced metadata allows matching against the user profile for personalization. Our second pilot application, called InfoSip, performs person identification and scene annotation based on actor presence. Person identification relies on visual, audio, text analysis and talking face detection. The InfoSip application links person identity information with filmographies and biographies extracted from the Web, improving the TV viewing experience by allowing users to easily query their TVs for information about actors in the current scene.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ahanger, G. and Little, T. D. C.: 1997, A System for Customized News Delivery from Video Archives, In: Proceedings of ICMCS’97, (June 3–6) IEEE Press.
Google Scholar
Ardissono, L., Portis, F. and Torasso, P.: 2001, Architecture of a System for the Generation of Personalized Electronic Program Guides, Eighth International Conference on User Modeling: Workshop on Personalization in Future TV. Sonthofen, Germany.
Google Scholar
Blum, D. W.: 1992, Method and Apparatus for Identifying and Eliminating Specific Material from Video Signals, US patent 5, 151, 788, September.
Google Scholar
Boguraev, B. and Neff, M.: 2000, Lexical Cohesion, Discourse Segmentation, and Document Summarization, Proc. RIAO International Conference. April, Paris.
Google Scholar
Bonner, E. L. and Faerber, N. A.: 1982, Editing System for Video Apparatus, US patent 4,314,285, February.
Google Scholar
Boykin, S. and Merlino, A.: 1999, Improving Broadcast News Segmentation Processing, IEEE International Conference on Multimedia and Computing Systems. Florence, Italy, 7–11 June.
Google Scholar
Boykin, S. and Merlino, A.: 2000, Machine Learning of Event Segmentation for News on Demand, Communications of the ACM 43(2), 35–41.
Article Google Scholar
Brown, M. G., Foote, J. T., Jones, G. J. F., Jones, S. K. and Young, S. J.: 1995, Automatic Content-Based Retrieval of Broadcast News, In: Proceedings of ACM Multimedia 95. San Francisco, CA: ACM Press, pp. 35–43.
Google Scholar
Brown, E. W. and Coden, A. R.: 2002, Capitalization Recovery for Text, In: A. R. Coden, E. W. Brown, and S. Srinivasan, (eds.): Information Retrieval Techniques for Speech Applications. Springer, pp. 11–22.
Google Scholar
Brusilovsky, P.: 2003, Adaptive Navigation Support in Educational Hypermedia: The Role of Student Knowledge Level and the Case for Meta-Adaptation. British Journal of Educational Technology 34(4), 487–497.
Article Google Scholar
Chen, L. and Faudemay, P.: 1997, Multi-Criteria Video Segmentation for TV News, In: Proceedings of IEEE First Workshop on Multimedia Signal Processing. Princeton, NJ.
Google Scholar
Connell, J.: 2002, Face Finding, http://www.research.ibm.com/ecvg/jhc_proj/faces.html..
Cotter, P. and Smyth, B.: 2000, PTV: Intelligent Personalized TV Guides, Seventeenth National Conference on Artificial Intelligence. Austin, TX, USA, pp. 957–964.
Google Scholar
Cardie, C.: 1997, Empirical Methods in Information Extraction, AI Magazine 18(4), 65–79.
Google Scholar
Dakss, J., Agamanolis, S., Chalom, E., Bove, V. M., Brooks, K., Nemirovsky, P. and Westner, A.: Hyper Soap: http://www.media.mit.edu/hypersoap..
Das, D. and ter Horst H.: 1998, Recommender Systems for TV. Technical Report WS-98-08 Recommender Systems, Papers from the 1998 Workshop, Madison, WI. Menlo Park, CA: AAAI Press, pp. 35–36.
Google Scholar
Dimitrova, N., Martino, J., Agnihotri, L. and Elenbaas, H.: 1999, Superhistograms for Video Representation, IEEE ICIP. Kobe, Japan.
Google Scholar
Dimitrova, N., Agnihotri, L. and Jasinschi, R.: 2003, Temporal video boundaries, In: Video A. Rosenfeld, D. Doermann, and D. Dementhon (eds.): Mining Book. Kluwer, pp. 61–90.
Google Scholar
Elenbaas, H., Dimitrova, N. and McGee, T.: 1999, PNRS-Personalized News Retrieval System, SPIE Multimedia Storage and Archiving Systems.
Google Scholar
Haas, N., Bolle, R., Dimitrova, N., Janevski, A. and Zimmerman, J.: 2002, Personalized News Through Content Augmentation and Profiling, In: Proceedings of International Conference on Image Processing 2002. Rochester, NY: IEEE Press, September 22–25.
Google Scholar
Hampapur, A., Jain, R. and Weymouth, T.: 1994, Digital Video Segmentation, In: Proceedings of the ACM International Conference on Multimedia. San Francisco, pp. 357–364.
Google Scholar
Hanjalic, A., Lagendijk, R. L. and Biemond, J.: 1999, Semiautomatic News Analysis, Indexing and Classification System Based on Topic Preselection, SPIE Storage and Retrieval for Image and Video Databases VII 3656, January pp. 86–97.
Google Scholar
IBM Intelligent Miner for Text™
Google Scholar
Janevski, A. and Dimitrova, N.: Web Information Extraction for Content Augmentation, In: Proceedings of ICME’ 02. Lausanne, Switzerland: IEEE Press, August 26–29.
Google Scholar
Janevski, A.: UniveristyIE: Extracting Information from University Web Pages, MS Thesis, University of Kentucky, Lexington.
Google Scholar
Jasinschi, R., Dimitrova, N., McGee, T., Agnihotri, L. and Zimmerman, J.: 2001, Video Scouting: An Architecture and System for the Integration of Multimedia Information in Personal TV Applications, IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP). Salt Lake City, UT, USA, May 7–11, pp. 1405–1408.
Google Scholar
Jiang, H. and Elmagarmid, A. K.: 1998, Spatial and Temporal Content-Based Access to Hypervideo Databases, VLDB Journal, 7(4), 226–238.
Google Scholar
Li, D., Wei, G., Sethi, I. K. and Dimitrova, N.: 2001, Person Identification in TV Shows, Journal on Electronic Imaging, Special Issue on Storage, Processing and Retrieval of Digital Media, October.
Google Scholar
Li, D., Dimitrova, N., Li, M. and Sethi, I. K.: 2003, Multimedia Content Processing Through Cross-modality Association, ACM Multimedia. November 2–5, Berkeley.
Google Scholar
Kubey, R. and Csikszentmihaly, M.: 1990, Television and the Quality of Life: How Viewing Shapes Everyday Experiences, Lawrence Erlbaum Associates. Hillsdale NJ, USA.
Google Scholar
Maybury, M. (ed.): February 2000, News On Demand, CACM 43(2): 33–34, 35–79.
Google Scholar
Mani, I., House, D. et al.: 1998, Tipster SUMMAC Text Summarization Evaluation, Final Report, October 1998. Mitre Technical Report MTR W980000138 and Technical report, DARPA.
Google Scholar
McGee, T. and Dimitrova, N.: 1999, Parsing TV Program Structures for Identification and Removal of Non-story Segments, SPIE Conference on Storage and Retrieval for Image and Video Databases VII (ei24).
Google Scholar
Merlino, A., Morey, D. and Maybury, M.: 1997, Broadcast Navigation Using Story segmentation, In: Proceedings of ACM MM’ 97. Seattle, WA: ACM Press, November, pp. 381–388.
Google Scholar
ABC Enhanced TV: http://heavy.etv.go.com/etvHome/..
Microsoft NAB demo of enhanced TV: http://www.microsoft.com/presspass/exec/craig/nab97.asp
Microsoft/CBS interactive TV: http://www.microsoft.com/presspass/press/2000/Sept00/CBSpr.asp
Naphade, M. R., Kozintsev, I. and Huang, T. S.: 2002, A Factor Graph Framework for Semantic Video Indexing, IEEE Transactions on Circuits and Systems for Video Technology 12(1), 40–52.
Google Scholar
Zimmerman, J., Marmaropoulos, G. and van Heerden, C.: 2001, Interface Design of Video Scout: A Selection, Recording, and Segmentation System for TVs, In: Proceedings of Human Computer Interaction International (HCII) 1, New Orleans, LA, USA, August 5–10, pp. 277–281.
Google Scholar

Download references

Author information

Authors and Affiliations

Philips Research, 345 Scarborough Rd., Briarelff Manor, NY, 10510, USA
Nevenka Dimitrova, Angel Janevski, Lalitha Agnihotri & Thomas Mcgeeand
Human-Computer Interaction Institute, Carnegie Mellon, Pittsburgh, PA, USA
John Zimmerman
IBM T.J. Watson, 30 Saw Mill River Road, Hawthorne, NY, 10532, USA
Norman Haas, Ruud Bolle & Senem Velipasalar
Motorola Labs, 1301 East Algonquin Road, Schaumburg, Illinois, 60196
Dongge Li
Department of Architecture, MIT, 265 Massachusetts Avenue N51-340, Cambridge, MA, 02139, USA
Lira Nikolovska

Authors

Nevenka Dimitrova
View author publications
You can also search for this author in PubMed Google Scholar
John Zimmerman
View author publications
You can also search for this author in PubMed Google Scholar
Angel Janevski
View author publications
You can also search for this author in PubMed Google Scholar
Lalitha Agnihotri
View author publications
You can also search for this author in PubMed Google Scholar
Norman Haas
View author publications
You can also search for this author in PubMed Google Scholar
Dongge Li
View author publications
You can also search for this author in PubMed Google Scholar
Ruud Bolle
View author publications
You can also search for this author in PubMed Google Scholar
Senem Velipasalar
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Mcgeeand
View author publications
You can also search for this author in PubMed Google Scholar
Lira Nikolovska
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dimitrova, N. et al. (2004). Media Augmentation and Personalization Through Multimedia Processing and Information Extraction. In: Personalized Digital Television. Human-Computer Interaction Series, vol 6. Springer, Dordrecht. https://doi.org/10.1007/1-4020-2164-X_8

Download citation

DOI: https://doi.org/10.1007/1-4020-2164-X_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-2163-3
Online ISBN: 978-1-4020-2164-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics