Skip to main content
Log in

Hyper Media News: a fully automated platform for large scale analysis, production and distribution of multimodal news content

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper describes Hyper Media News (HMNews), a system for the automated aggregation and consumption of information streams from digital television and the Internet. TV newscasts are automatically segmented, annotated and indexed. Such information is then integrated with those available from Internet blogs, newspapers and press agencies. The end result is a set of innovative information services that supplies retrieval, recommendation and browsing of multi-modal news items across different production paradigms, ranging from traditional professional media, e.g. television and press, to new user-centric media platforms such as social networking sites, internet forums and blogs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. http://editor.googlemashups.com/

  2. http://pipes.yahoo.com/

  3. http://www.outbrain.com/

  4. Text processing is performed using the OpenNLP toolkit (http://opennlp.sourceforge.net/) trained on both English and Italian data. Further languages can be integrated at any time, provided that either automatic translation services or the corresponding training models are available.

  5. http://search.cpan.org/dist/AI-Categorizer/

  6. http://lucene.apache.org/

  7. Boilerplate stripping is done using PotaModule (http://sslmitdev-online.sslmit.unibo.it/wac/post_processing.php).

  8. http://annalupini.blog.kataweb.it/

  9. http://mariotedeschini.blog.kataweb.it/

  10. www.lastampa.it/_web/CMSTP/tmplrubriche/giornalisti/hrubrica.asp?ID_blog=196

  11. http://mediablog.corriere.it/

  12. http://laderiva.corriere.it/

  13. http://leviedellasia.corriere.it/

  14. www.lastampa.it/_web/cmstp/tmplrubriche/giornalisti/hrubrica.asp?ID_blog=124

  15. www.lastampa.it/_web/cmstp/tmplrubriche/giornalisti/hrubrica.asp?ID_blog=242

  16. www.lastampa.it/_web/cmstp/tmplrubriche/giornalisti/hrubrica.asp?ID_blog=197

  17. http://search.cpan.org/~kwilliams/AI-Categorizer/

  18. http://iorio.blog.kataweb.it/

  19. OpenLogos Machine Translation—http://logos-os.dfki.de/

References

  1. Allan J (2002) Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, Norwell, MA, USA

    Book  MATH  Google Scholar 

  2. Arlandis J, Over P, Kraaij W (2005) Boundary error analysis and categorization in the TRECVID news story segmentation task. In: Proc. of the 4th international conference on image and video retrieval (CIVR), pp 103–112

  3. Bailer W, Höller F, Messina A, Airola D, Schallauer P, Hausenblas M (2005) State of the art of content analysis tools for video, audio and speech. Technical report, PrestoSpace D15.3, March

  4. Banerjee S, Ramanathan K, Gupta A (2007) Clustering short texts using Wikipedia. In: Proc. of the intl. conf. on research and development in information retrieval, pp 787–788

  5. Basili R, Cammisa M, Donati E (2005) RitroveRAI: a web application for semantic indexing and hyperlinking of multimedia news. In: Proc. of the intl. semantic web conf.

  6. Bekkerman R, Jeon J (2007) Multi-modal clustering for multimedia collections. In Proc. of the IEEE conf. on computer vision and pattern recognition (CVPR)

  7. Brugnara F, Cettolo M, Federico M, Giuliani D (2000) A system for the segmentation and transcription of Italian radio news. In: Proc. of RIAO, content-based multimedia information access

  8. Chua T, Chang S, Chaisorn L, Hsu W (2004) Story boundary detection in large broadcast news video archives - techniques, experience and trends. In: Proc. of ACM MM 2004

  9. Deerwester SC, Dumais ST, Landauer TK, Furnas GW, Harshman RA (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):391–407

    Article  Google Scholar 

  10. Deléglise P, Estéve Y, Meignier S, Merlin T (2005) The LIUM speech transcription system: a CMU Sphinx III-based system for french broadcast news. In: Proc. of interspeech’05

  11. De Santo M, Percannella G, Sansone C, Vento M (2006) Unsupervised news video segmentation by combined audio-video analysis. In: Proc. of multimedia content representation, classification and security, pp 273–281

  12. Deschacht K, Moens MF (2008) Finding the best picture: cross-media retrieval of content. In: Proc. of the 30th European conf. on information retrieval (ECIR)

  13. Di Iulio M, Messina A (2008) Use of probabilistic clusters supports for broadcast news segmentation. In: DEXA workshops, pp 600–604

  14. Domeniconi C, Peng J, Yan B (2010) Composite kernels for semi-supervised clustering. Knowl Inf Syst 28(1):99–116

    Article  Google Scholar 

  15. Domeniconi C, Gunopulos D, Ma S, Yan B, Al-Razgan M, Papadopoulos D (2007) Locally adaptive metrics for clustering high dimensional data. Data Min Knowl Discov 14:63–97

    Article  MathSciNet  Google Scholar 

  16. Farrús M, Ejarque P, Temko A, Hernando J (2007) Histogram equalization in svm multimodal person verification. In: Proc. of the intl. conf. on advances in biometrics

  17. Fellbaum C (ed) (1998) WordNet: an electronic lexical database. MIT Press

  18. Fruchterman TMJ, Reingold EM (1991) Graph drawing by force-directed placement. Softw Pract Exp 21:1129–1164

    Article  Google Scholar 

  19. Getahun F, Tekli J, Chbeir R, Viviani M, Yetongnon K (2009) Relating rss news/items. In: Proc. of the 9th intl. conf. on web engineering, pp 442–452

  20. Hauptmann A, Baron RV, Chen MY, Christel M, Duygulu P, Huang C, Jin R, Lin Wh, Ng T, Moraveji N, Snoek CGM, Tzanetakis G, Yang J, Yan R, Wactlar HD (2003) Informedia at trecvid 2003: analyzing and searching broadcast news video. In: Proc. of TRECVID

  21. Henzinger M, Chang BW, Milch B, Brin S (2003) Query-free news search. In: Proc. of the 12th intl. world wide web conf. (WWW03), pp 1–10

  22. Hoashi K (2004) Shot boundary determination on MPEG compressed domain and story segmentation experiments for TRECVID 2004. In: TRECVID 2004

  23. Huang W, Webster D (2004) Intelligent rss news aggregation based on semantic context. In: ACM SIGIR workshop on information retrieval in context, pp 40–42

  24. IJntema W, Goossen F, Frasincar F, Hogenboom F (2010) Ontology-based news recommendation. In: EDBT ’10: proc. of the 2010 EDBT/ICDT workshops, pp 1–6

  25. Kamahara J, Nomura Y, Ueda K, Kandori K, Shimojo S, Miyahara H (1999) A tv news recommendation system with automatic recomposition. In: AMCP ’98: proc. of the first intl. conf. on advanced multimedia content processing, pp 221–235

  26. Katakis I, Tsoumakas G, Banos E, Bassiliades N, Vlahavas I (2009) An adaptive personalized news dissemination system. J Intell Inf Syst 32(2):191–212

    Article  Google Scholar 

  27. Kraaj W, Smeaton A, Over P (2004) Trecvid 2004: an overview. In: TRECVID 2004

  28. Laudy C, Ganascia JG (2008) Information fusion in a tv program recommendation system. In: 11th intl. conf. on information fusion, 2008, pp 1–8

  29. Li X, Yan J, Deng Z, Ji L, Fan W, Zhang B, Chen Z (2007) A novel clustering-based RSS aggregator. In: Proc. of the 16th intl. world wide web conf. (WWW07), pp 1309–1310

  30. Liu J, Dolan P, Pedersen E (2010) Personalized news recommendation based on click behavior. In: IUI ’10: pro. of the 14th intl. conf. on intelligent user interfaces, pp 31–40

  31. Mahler RPS (2007) Statistical multisource-multitarget information fusion. Artech House, Inc., Norwood, MA, USA

    MATH  Google Scholar 

  32. Messina A (2011) Computational analysis of mass media communication: methods and systems. University of Turin, PhD School of Business and Management

  33. Messina A, Montagnuolo M (2009) A generalised cross-modal clustering method applied to multimedia news semantic indexing and retrieval. In: Proc. of the 18th intl. world wide web conf. (WWW09)

  34. Messina A, Montagnuolo M (2011) Heterogeneous data co-clustering by pseudo-semantic affinity functions. In: Proc. of the 2nd Italian information retrieval workshop (IIR2011)

  35. Messina A, Borgotallo R, Dimino G, Airola Gnota D, Boch L (2008) Ants: a complete system for automatic news programme annotation based on multimodal analysis. In: Intl. workshop on image analysis for multimedia interactive services

  36. Nakamura Y, Itou T, Tezuka H, Ishihara T, Abe M (2010) Personalized tv-program recommendations based on life log. In: Proc. of the intl. conf. on consumer electronics (ICCE), pp 143–144

  37. Nguyen LD, Woon KY, Tan AH (2008) A self-organizing neural model for multimedia information fusion. In: 11th intl. conf. on information fusion, 2008, pp 1–7

  38. O’Connor N, Czirjek C, Deasy S, Marlow S, Murphy N, Smeaton A (2001) News story segmentation in the fischlar video indexing system. In: Proc. of the int.l conf. on image processing, pp 7–10

  39. Paliouras G, Mouzakidis A, Moustakas V, Skourlas C (2008) Pns: a personalized news aggregator on the web. Intelligent interactive systems in knowledge-based environments, studies in computational intelligence, vol 104, pp 175–197

  40. Pao HT, Xu YY, Chung SC, Fu HC (2007) Constructing and application of multimedia tv news archives. In: Intl. workshop on multimedia content analysis and mining

  41. Quénot GM, Mararu D, Ayache S, Charhad M, Besacier L (2004) CLIPS-LIS-LSR-LABRI experiments at TRECVID 2004. In: TRECVID 2004

  42. Takama Y, Muto Y (2009) Profile generation for tv program recommendation based on utterance analysis. JACIII 13(2):86–90

    Google Scholar 

  43. Volkmer T, Tahahoghi SMM, Williams HE (2004) RMIT university at TRECVID 2004. In TRECVID 2004

  44. Xu C, Wang J, Lu H, Zhang Y (2008) A novel framework for semantic annotation and personalized retrieval of sports video. IEEE Trans Multimedia 10(3):421–436

    Article  Google Scholar 

  45. Zhai Y, Chao X, Zhang Y, Javed O, Yilmaz A, Rafi F (2004) University of Central Florida at trecvid 2004. In: TRECVID 2004

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maurizio Montagnuolo.

Additional information

Part of the content of this paper has been at the 3rd International Workshop on Automated Information Extraction in Media Production, AIEMPro’10, Florence 25-29 October 2010.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Messina, A., Montagnuolo, M., Di Massa, R. et al. Hyper Media News: a fully automated platform for large scale analysis, production and distribution of multimodal news content. Multimed Tools Appl 63, 427–460 (2013). https://doi.org/10.1007/s11042-011-0859-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-011-0859-1

Keywords

Navigation