Abstract
This paper describes Hyper Media News (HMNews), a system for the automated aggregation and consumption of information streams from digital television and the Internet. TV newscasts are automatically segmented, annotated and indexed. Such information is then integrated with those available from Internet blogs, newspapers and press agencies. The end result is a set of innovative information services that supplies retrieval, recommendation and browsing of multi-modal news items across different production paradigms, ranging from traditional professional media, e.g. television and press, to new user-centric media platforms such as social networking sites, internet forums and blogs.
Similar content being viewed by others
Notes
Text processing is performed using the OpenNLP toolkit (http://opennlp.sourceforge.net/) trained on both English and Italian data. Further languages can be integrated at any time, provided that either automatic translation services or the corresponding training models are available.
Boilerplate stripping is done using PotaModule (http://sslmitdev-online.sslmit.unibo.it/wac/post_processing.php).
OpenLogos Machine Translation—http://logos-os.dfki.de/
References
Allan J (2002) Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, Norwell, MA, USA
Arlandis J, Over P, Kraaij W (2005) Boundary error analysis and categorization in the TRECVID news story segmentation task. In: Proc. of the 4th international conference on image and video retrieval (CIVR), pp 103–112
Bailer W, Höller F, Messina A, Airola D, Schallauer P, Hausenblas M (2005) State of the art of content analysis tools for video, audio and speech. Technical report, PrestoSpace D15.3, March
Banerjee S, Ramanathan K, Gupta A (2007) Clustering short texts using Wikipedia. In: Proc. of the intl. conf. on research and development in information retrieval, pp 787–788
Basili R, Cammisa M, Donati E (2005) RitroveRAI: a web application for semantic indexing and hyperlinking of multimedia news. In: Proc. of the intl. semantic web conf.
Bekkerman R, Jeon J (2007) Multi-modal clustering for multimedia collections. In Proc. of the IEEE conf. on computer vision and pattern recognition (CVPR)
Brugnara F, Cettolo M, Federico M, Giuliani D (2000) A system for the segmentation and transcription of Italian radio news. In: Proc. of RIAO, content-based multimedia information access
Chua T, Chang S, Chaisorn L, Hsu W (2004) Story boundary detection in large broadcast news video archives - techniques, experience and trends. In: Proc. of ACM MM 2004
Deerwester SC, Dumais ST, Landauer TK, Furnas GW, Harshman RA (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):391–407
Deléglise P, Estéve Y, Meignier S, Merlin T (2005) The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news. In: Proc. of interspeech’05
De Santo M, Percannella G, Sansone C, Vento M (2006) Unsupervised news video segmentation by combined audio-video analysis. In: Proc. of multimedia content representation, classification and security, pp 273–281
Deschacht K, Moens MF (2008) Finding the best picture: cross-media retrieval of content. In: Proc. of the 30th European conf. on information retrieval (ECIR)
Di Iulio M, Messina A (2008) Use of probabilistic clusters supports for broadcast news segmentation. In: DEXA workshops, pp 600–604
Domeniconi C, Peng J, Yan B (2010) Composite kernels for semi-supervised clustering. Knowl Inf Syst 28(1):99–116
Domeniconi C, Gunopulos D, Ma S, Yan B, Al-Razgan M, Papadopoulos D (2007) Locally adaptive metrics for clustering high dimensional data. Data Min Knowl Discov 14:63–97
Farrús M, Ejarque P, Temko A, Hernando J (2007) Histogram equalization in svm multimodal person verification. In: Proc. of the intl. conf. on advances in biometrics
Fellbaum C (ed) (1998) WordNet: an electronic lexical database. MIT Press
Fruchterman TMJ, Reingold EM (1991) Graph drawing by force-directed placement. Softw Pract Exp 21:1129–1164
Getahun F, Tekli J, Chbeir R, Viviani M, Yetongnon K (2009) Relating rss news/items. In: Proc. of the 9th intl. conf. on web engineering, pp 442–452
Hauptmann A, Baron RV, Chen MY, Christel M, Duygulu P, Huang C, Jin R, Lin Wh, Ng T, Moraveji N, Snoek CGM, Tzanetakis G, Yang J, Yan R, Wactlar HD (2003) Informedia at trecvid 2003: analyzing and searching broadcast news video. In: Proc. of TRECVID
Henzinger M, Chang BW, Milch B, Brin S (2003) Query-free news search. In: Proc. of the 12th intl. world wide web conf. (WWW03), pp 1–10
Hoashi K (2004) Shot boundary determination on MPEG compressed domain and story segmentation experiments for TRECVID 2004. In: TRECVID 2004
Huang W, Webster D (2004) Intelligent rss news aggregation based on semantic context. In: ACM SIGIR workshop on information retrieval in context, pp 40–42
IJntema W, Goossen F, Frasincar F, Hogenboom F (2010) Ontology-based news recommendation. In: EDBT ’10: proc. of the 2010 EDBT/ICDT workshops, pp 1–6
Kamahara J, Nomura Y, Ueda K, Kandori K, Shimojo S, Miyahara H (1999) A tv news recommendation system with automatic recomposition. In: AMCP ’98: proc. of the first intl. conf. on advanced multimedia content processing, pp 221–235
Katakis I, Tsoumakas G, Banos E, Bassiliades N, Vlahavas I (2009) An adaptive personalized news dissemination system. J Intell Inf Syst 32(2):191–212
Kraaj W, Smeaton A, Over P (2004) Trecvid 2004: an overview. In: TRECVID 2004
Laudy C, Ganascia JG (2008) Information fusion in a tv program recommendation system. In: 11th intl. conf. on information fusion, 2008, pp 1–8
Li X, Yan J, Deng Z, Ji L, Fan W, Zhang B, Chen Z (2007) A novel clustering-based RSS aggregator. In: Proc. of the 16th intl. world wide web conf. (WWW07), pp 1309–1310
Liu J, Dolan P, Pedersen E (2010) Personalized news recommendation based on click behavior. In: IUI ’10: pro. of the 14th intl. conf. on intelligent user interfaces, pp 31–40
Mahler RPS (2007) Statistical multisource-multitarget information fusion. Artech House, Inc., Norwood, MA, USA
Messina A (2011) Computational analysis of mass media communication: methods and systems. University of Turin, PhD School of Business and Management
Messina A, Montagnuolo M (2009) A generalised cross-modal clustering method applied to multimedia news semantic indexing and retrieval. In: Proc. of the 18th intl. world wide web conf. (WWW09)
Messina A, Montagnuolo M (2011) Heterogeneous data co-clustering by pseudo-semantic affinity functions. In: Proc. of the 2nd Italian information retrieval workshop (IIR2011)
Messina A, Borgotallo R, Dimino G, Airola Gnota D, Boch L (2008) Ants: a complete system for automatic news programme annotation based on multimodal analysis. In: Intl. workshop on image analysis for multimedia interactive services
Nakamura Y, Itou T, Tezuka H, Ishihara T, Abe M (2010) Personalized tv-program recommendations based on life log. In: Proc. of the intl. conf. on consumer electronics (ICCE), pp 143–144
Nguyen LD, Woon KY, Tan AH (2008) A self-organizing neural model for multimedia information fusion. In: 11th intl. conf. on information fusion, 2008, pp 1–7
O’Connor N, Czirjek C, Deasy S, Marlow S, Murphy N, Smeaton A (2001) News story segmentation in the fischlar video indexing system. In: Proc. of the int.l conf. on image processing, pp 7–10
Paliouras G, Mouzakidis A, Moustakas V, Skourlas C (2008) Pns: a personalized news aggregator on the web. Intelligent interactive systems in knowledge-based environments, studies in computational intelligence, vol 104, pp 175–197
Pao HT, Xu YY, Chung SC, Fu HC (2007) Constructing and application of multimedia tv news archives. In: Intl. workshop on multimedia content analysis and mining
Quénot GM, Mararu D, Ayache S, Charhad M, Besacier L (2004) CLIPS-LIS-LSR-LABRI experiments at TRECVID 2004. In: TRECVID 2004
Takama Y, Muto Y (2009) Profile generation for tv program recommendation based on utterance analysis. JACIII 13(2):86–90
Volkmer T, Tahahoghi SMM, Williams HE (2004) RMIT university at TRECVID 2004. In TRECVID 2004
Xu C, Wang J, Lu H, Zhang Y (2008) A novel framework for semantic annotation and personalized retrieval of sports video. IEEE Trans Multimedia 10(3):421–436
Zhai Y, Chao X, Zhang Y, Javed O, Yilmaz A, Rafi F (2004) University of Central Florida at trecvid 2004. In: TRECVID 2004
Author information
Authors and Affiliations
Corresponding author
Additional information
Part of the content of this paper has been at the 3rd International Workshop on Automated Information Extraction in Media Production, AIEMPro’10, Florence 25-29 October 2010.
Rights and permissions
About this article
Cite this article
Messina, A., Montagnuolo, M., Di Massa, R. et al. Hyper Media News: a fully automated platform for large scale analysis, production and distribution of multimodal news content. Multimed Tools Appl 63, 427–460 (2013). https://doi.org/10.1007/s11042-011-0859-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-011-0859-1