skip to main content
10.1145/2523429.2532329acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmindtrekConference Proceedingsconference-collections
tutorial

Extraction of novelty concepts from TV broadcasts with longitudinal user experiments

Published: 01 October 2013 Publication History

Abstract

Recent popularity of online catch-up TV services has facilitated time-shifted TV viewing. However, contemporary services do not utilize the rich information available in broadcast TV content streams. Richer program descriptions and summaries help on-demand viewers to employ new information seeking behaviour to find interesting content. Techniques for content-based analysis of broadcast TV streams aim to improve access to relevant archived TV content and assist in efficient on-demand viewing. In this paper we introduce a methodology that extracts novelty concept words from Finnish broadcast TV stream. The methodology is employed in an online content analysis system, which executes near real-time analysis and indexing of seven free-to-air DVB TV channels. The methodology uses machine learning and statistical data mining techniques to extract descriptive novelty concepts automatically from TV program subtitles. Extracted concepts are further used to summarize and access program content in end-user services that facilitate search and browsing of archived TV content. We show results from user logs of nearly 3 000 sessions to demonstrate how novelty words have been used in our prototype services.

References

[1]
Google. Tesseract. https://code.google.com/p/tesseract-ocr/. {Accessed 13 September 2013}.
[2]
Joerg Schulenburg. Gocr. http://jocr.sourceforge.net/. {Accessed 13 September 2013}.
[3]
Kuukkelitv.fi. KuukkeliTV - Mediaseinä. http://www.kuukkelitv.fi/mediaseina. {Accessed 13 September 2013}.
[4]
Kuukkelitv.fi. KuukkeliTV - Uutispilvi. http://www.kuukkelitv.fi/uutispilvi. {Accessed 13 September 2013}.
[5]
M. Markou and S. Singh. Novelty detection: a review---part 1: statistical approaches. Signal Processing, 83(12):2481--2497, 2003.
[6]
Y. Matsuo and M. Ishizuka. Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools, 13(01):157--169, 2004.
[7]
P. Rayson and R. Garside. Comparing corpora using frequency profiling. In Proceedings of the Workshop on Comparing Corpora, CompareCorpora '00, pages 1--6, Stroudsburg, PA, USA, 2000. Association for Computational Linguistics.
[8]
Snowball. Snowball. http://snowball.tartarus.org/. {Accessed 13 September 2013}.
[9]
I. Soboroff and D. Harman. Novelty detection: the trec experience. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, HLT '05, pages 105--112, Stroudsburg, PA, USA, 2005. Association for Computational Linguistics.
[10]
Sukija. Sukija. http://sourceforge.net/projects/sukija/. {Accessed 13 September 2013}.
[11]
Voikko. Voikko. http://voikko.sourceforge.net/. {Accessed 13 September 2013}.
[12]
K. Zhang, H. Xu, J. Tang, and J. Li. Keyword extraction using support vector machine. In J. Yu, M. Kitsuregawa, and H. Leong, editors, Advances in Web-Age Information Management, volume 4016 of Lecture Notes in Computer Science, pages 85--96. Springer Berlin Heidelberg, 2006.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
AcademicMindTrek '13: Proceedings of International Conference on Making Sense of Converging Media
October 2013
360 pages
ISBN:9781450319928
DOI:10.1145/2523429
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. novelty concept extraction
  2. online TV
  3. video analysis

Qualifiers

  • Tutorial
  • Research
  • Refereed limited

Conference

AcademicMindTrek '13

Acceptance Rates

Overall Acceptance Rate 110 of 207 submissions, 53%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 67
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media