research-article

Enhancing news organization for convenient retrieval and browsing

Authors:

Hanqing LuAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 10, Issue 1

Article No.: 1, Pages 1 - 20

https://doi.org/10.1145/2488732

Published: 27 December 2013 Publication History

Abstract

To facilitate users to access news quickly and comprehensively, we design a news search and browsing system named GeoVisNews, in which the news elements of “Where”, “Who”, “What” and “When” are enhanced via news geo-localization, image enrichment and joint ranking, respectively. For news geo-localization, an Ordinal Correlation Consistent Matrix Factorization (OCCMF) model is proposed to maintain the relevance rankings of locations to a specific news document and simultaneously capture intra-relations among locations and documents. To visualize news, we develop a novel method to enrich news documents with appropriate web images. Specifically, multiple queries are first generated from news documents for image search, and then the appropriate images are selected from the collected web images by an intelligent fusion approach based on multiple features. Obtaining the geo-localized and image enriched news resources, we further employ a joint ranking strategy to provide relevant, timely and popular news items as the answer of user searching queries. Extensive experiments on a large-scale news dataset collected from the web demonstrate the superior performance of the proposed approaches over related methods.

References

[1]

Amitay, E., Sivan, R., and Soffer, A. 2004. Web-a-Where: Geotagging web content. In Proceedings of SIGIR. 273--280.

Digital Library

[2]

Andogah, G. 2010. Geographically constrained information retrieva. Ph.D. thesis, University of Groningen.

[3]

Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual web search engine. In Proceedings of WWW. 107--117.

Digital Library

[4]

Candeias, R. and Martins, B. 2011. Associating relevant photos to georeferenced textual documents through rank aggregation. In Proceedings of the Terra Cognita Workshop.

[5]

Christel, M. G., Hauptmann, A. G., Wactlar, H. D., and Ng, T. D. 2002. Collages as dynamic summaries for news video. In Proceedings of MM. 561--569.

Digital Library

[6]

Cilibrasi, R. L. and Vitanyi, P. M. B. 2007. The google similarity distance. IEEE Trans. Knowl. Data Eng. 19, 3, 370--383.

Digital Library

[7]

Coyne, B. and Sproat, R. 2001. WordsEye: An automatic text-to-scene conversion system. In Proceedings of Computer Graphics and Interactive Techniques. 487--496.

Digital Library

[8]

Delgado, D., Magalhaes, J., and Correia, N. 2010. Assisted news reading with automated illustrations. In Proceedings of MM. 1647--1650.

Digital Library

[9]

Deschacht, K. and Moens, M. 2008. Finding the best picture: Cross-media retrieval of content. In Proceedings of ECIR.

Digital Library

[10]

Ding, J., Gravano, L., and Shivakumar, N. 2000. Computing geographical scopes of web sources. In Proceedings of the International Conference on Very Large Data Bases. 545--556.

Digital Library

[11]

Gey, F., Larson, R., Sanderson, M., Joho, H., Clough, P., and Petras, V. 2005. GeoCLEF: The CLEF 2005 cross-language geographic information retrieval track overview. In Proceedings of CLEF'05. 908--919.

Digital Library

[12]

Gravier, G., Guinaudeau, C., Lecorvé, G., and Sébillot, P. 2011. Exploiting speech for automatic TV delinearization: From streams to cross-media semantic navigation. J. Image Video Proc.

[13]

Huston, S. and Croft, W. B. 2010. Evaluating verbose query processing techniques. In Proceedings of SIGIR. 291--298.

Digital Library

[14]

Järvelin, K. and Kekäläinen, J. 2002. Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 4, 422--446.

Digital Library

[15]

Jiao, B., Yang, L., Xu, J., and Wu, F. 2010. Visual summarization of web pages. In Proceedings of SIGIR. 499--506.

Digital Library

[16]

Joachims, T. 2002. Optimizing search engines using clickthrough data. In Proceedings of KDD. 31--43.

Digital Library

[17]

Jones, K. S., Walker, S., and Robertson, S. E. 2000. A probabilistic model of information retrieval: Development and comparative experiments. Inf. Proc. Manage. 36, 6, 779--808.

Digital Library

[18]

Joshi, D., Wang, J. Z., and Li, J. 2004. The story picturing engine: Finding elite images to illustrate a story using mutual reinforcement. In Proceedings of the ACM Workshop on Multimedia Information Retrieval. 119--126.

Digital Library

[19]

King, B. M. and Minium, E. M. 1999. Statistical Reasoning in Psychology and Education. Wiley, New York.

[20]

Kumaran, G. and Carvalho, V. R. 2009. Reducing long queries using query quality predictors. In Proceedings of SIGIR. 564--571.

Digital Library

[21]

Law-To, J., Grefenstete, G., Gauvain, J.-L., Gravier, G., Lamel, L., and Despres, J. 2009. VoxaleadNews: Robust automatic segmentation of video content into browsable and searchable subjects. In Proceedings of MM.

Digital Library

[22]

Leidner, J. L. 2008. Toponym resolution in text: Annotation, evaluation and applications of spatialgrounding of place names.dissertation.com.

[23]

Li, Z., Liu, J., Zhu, X., Liu, T., and Lu, H. 2010a. Image annotation using multi-correlation probabilistic matrix factorization. In Proceedings of MM. 1187--1190.

Digital Library

[24]

Li, Z., Liu, J., Zhu, X., and Lu, H. 2010b. Multi-modal multi-correlation person-centric news retrieval. In Proceedings of CIKM.

Digital Library

[25]

Li, Z., Wang, M., Liu, J., Xu, C., and Lu, H. 2011. News contextualization with geographic and visual information. In Proceedings of MM.

Digital Library

[26]

Liu, S., Zhou, M. X., Pan, S., Song, Y., Qian, W., Cai, W., and Lian, X. 2012. Tiara: Interactive, topic-based visual text summarization and analysis. ACM Trans. Intell. Syst. Tech. 3, 2, 25--25.

Digital Library

[27]

Lu, X., Pang, Y., Hao, Q., and Zhang, L. 2009. Visualizing textual travelogue with location relevant images. In Proceedings of International Workshop on Location Based Social Networks.

Digital Library

[28]

Martins, B. 2009. Geographically aware web textmining. Ph.D. thesis, University of Lisbon.

[29]

McGurk, H. and MacDonald, J. 1976. Hearing lips and seeing voices. Nature 264, 5588, 746--748.

[30]

Ohtsuki, K., Bessho, K., Matsuo, Y., Matsunaga, S., and Hayashi, Y. 2006. Automatic multimedia indexing: Combining audio, speech, and visual information to index broadcast news. IEEE Signal Process. Mag. 23, 2, 69--78.

[31]

Okuoka, T., Takahashi, T., Deguchi, D., Ide, I., and Murase, H. 2009. Labeling news topic threads with Wikipedia entries. In Proceedings of the IEEE International Symposium on Multimedia. 501--504.

Digital Library

[32]

Olivares, X., Ciaramita, M., and van Zwol, R. 2008. Boosting image retrieval through aggregating search results based on visual annotations. In Proceedings of MM. 189--198.

Digital Library

[33]

Page, L., Brin, S., Motwani, R., and Winograd, T. 1999. The PageRank citation ranking: Bringing order to the web. Tech. rep. Stanford Digital Library Technologies Project.

[34]

Rother, C., Bordeaux, L., Hamadi, Y., and Autocollage, A. B. 2006. AutoCollage. In Proceedings of SIGGRAPH.

Digital Library

[35]

Salakhutdinov, R. and Mnih, A. 2007. Probabilistic matrix factorization. In Proceedings of NIPS. 1257--1264.

[36]

Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D. A., and Deussen, O. 2009. Document cards: A top trumps visualization for documents. IEEE Trans. Visual. Comput. Graph. 15, 6, 1145--1152.

Digital Library

[37]

Sturm, J. F. 2009. Site matters: The value of local newspaper web sites. Tech. rep., NAA.

[38]

Teevan, J., Cutrell, E., Fisher, D., Drucker, S. M., Ramos, G., Andre, P., and Hu, C. 2009. Visual snippets: Summarizing web pages for search and revisitation. In Proceedings of International Conference on Human Factors in Computing Systems. 2023--2032.

Digital Library

[39]

Wang, B., Li, Z., Li, M., and Ma, W.-Y. 2006a. Large-scale duplicate detection for web image search. In Proceedings of ICME. 353--356.

[40]

Wang, J., Quan, L., Sun, J., Tang, X., and Shum, H.-Y. 2006b. Picture collage. In Proceedings of CVPR.

Digital Library

[41]

Yan, R. and Hauptmann, A. G. 2003. The combination limit in multimedia retrieval. In Proceedings of MM. 339--342.

Digital Library

[42]

Zhang, L., Chen, L., Jing, F., Deng, K., and Ma, W.-Y. 2006. Enjoyphoto—A verticcal image search engine for enjoying high-quality photos. In Proceedings of MM. 367--376.

Digital Library

[43]

Zhao, R. and Grosky, W. I. 2002. Narrowing the semantic gap—Improved text-based web document retrieval using visual features. ACM Trans. on Multimedia 4, 2, 189--200.

Digital Library

[44]

Zong, W., Wu, D., Sun, A., Lim, E.-P., and Goh, D. H.-L. 2005. On assigning place names to geography related web pages. In Proceedings of ACM/IEEE-CS Joint Conference on Digital Libraries. ACM, New York, USA, 354--362.

Digital Library

Cited By

Zhai ZZhang XFang FYao L(2023)Text classification of Chinese news based on multi-scale CNN and LSTM hybrid modelMultimedia Tools and Applications10.1007/s11042-023-14450-w82:14(20975-20988)Online publication date: 6-Feb-2023
https://doi.org/10.1007/s11042-023-14450-w
More AChaudhuri S(2019)A Pseudo-likelihood Approach for Geo-localization of Events from Crowd-sourced Sensor-MetadataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332170115:3(1-26)Online publication date: 20-Aug-2019
https://doi.org/10.1145/3321701
de Lucas EMarcuello PParcerisa JGonzalez A(2019)Visibility Rendering OrderIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2018.286624630:2(473-485)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1109/TPDS.2018.2866246
Show More Cited By

Index Terms

Enhancing news organization for convenient retrieval and browsing
1. Information systems
  1. Information retrieval
  2. Information systems applications
    1. Multimedia information systems

Recommendations

News contextualization with geographic and visual information
MM '11: Proceedings of the 19th ACM international conference on Multimedia

In this paper, we investigate the contextualization of news documents with geographic and visual information. We propose a matrix factorization approach to analyze the location relevance for each news document. We also propose a method to enrich the ...
Multi-view Latent Hashing for Efficient Multimedia Search
MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Hashing techniques have attracted broad research interests in recent multimedia studies. However, most of existing hashing methods focus on learning binary codes from data with only one single view, and thus cannot fully utilize the rich information ...
Probabilistic Matrix Factorization With Semantic And Visual Neighborhoods For Image Tag Completion
ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

We present an image tag completion method, namely PMF-SVN, where the key idea is to exploit images' Semantically and Visually similar Neighborhoods (SVNs) in the learning process of a Probabilistic Matrix Factorization (PMF) framework. We propose a two-...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 10, Issue 1

December 2013

166 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/2559928

Issue’s Table of Contents

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 December 2013

Accepted: 01 March 2013

Revised: 01 September 2012

Received: 01 June 2012

Published in TOMM Volume 10, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

Ministry of Science and Technology of the People's Republic of China
Open Project Program of the National Laboratory of Pattern Recognition (NLPR)
National Natural Science Foundation of China
Program for New Century Excellent Talents in University

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

18
Total Citations
View Citations
292
Total Downloads

Downloads (Last 12 months)4
Downloads (Last 6 weeks)3

Reflects downloads up to 17 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhai ZZhang XFang FYao L(2023)Text classification of Chinese news based on multi-scale CNN and LSTM hybrid modelMultimedia Tools and Applications10.1007/s11042-023-14450-w82:14(20975-20988)Online publication date: 6-Feb-2023
https://doi.org/10.1007/s11042-023-14450-w
More AChaudhuri S(2019)A Pseudo-likelihood Approach for Geo-localization of Events from Crowd-sourced Sensor-MetadataACM Transactions on Multimedia Computing, Communications, and Applications10.1145/332170115:3(1-26)Online publication date: 20-Aug-2019
https://doi.org/10.1145/3321701
de Lucas EMarcuello PParcerisa JGonzalez A(2019)Visibility Rendering OrderIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2018.286624630:2(473-485)Online publication date: 1-Feb-2019
https://dl.acm.org/doi/10.1109/TPDS.2018.2866246
Jelodar HWang YYuan CFeng XJiang XLi YZhao L(2019)Latent Dirichlet allocation (LDA) and topic modelingMultimedia Tools and Applications10.1007/s11042-018-6894-478:11(15169-15211)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s11042-018-6894-4
Li HTian ZMaeda RChen XFeng JXu JBahar I(2018)Co-manage power delivery and consumption for manycore systems using reinforcement learningProceedings of the International Conference on Computer-Aided Design10.1145/3240765.3240787(1-8)Online publication date: 5-Nov-2018
https://dl.acm.org/doi/10.1145/3240765.3240787
Li HXu JWang ZMaeda RYang PTian Z(2018)Workload-Aware Adaptive Power Delivery System Management for Many-Core ProcessorsIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2017.277808037:10(2076-2086)Online publication date: 1-Oct-2018
https://dl.acm.org/doi/10.1109/TCAD.2017.2778080
Yudi JHumberto Llanos CHuebner M(2017)System-level design space identification for Many-Core Vision ProcessorsMicroprocessors & Microsystems10.1016/j.micpro.2017.05.01352:C(2-22)Online publication date: 1-Jul-2017
https://dl.acm.org/doi/10.1016/j.micpro.2017.05.013
Wang XLi ZTang J(2017)Multimedia news QAImage and Vision Computing10.1016/j.imavis.2017.01.00460:C(162-170)Online publication date: 1-Apr-2017
https://dl.acm.org/doi/10.1016/j.imavis.2017.01.004
Li ZLi Z(2017)Understanding-Oriented Multimedia News RetrievalUnderstanding-Oriented Multimedia Content Analysis10.1007/978-981-10-3689-7_5(101-129)Online publication date: 27-May-2017
https://doi.org/10.1007/978-981-10-3689-7_5
Li ZTang JWang XLiu JLu H(2016)Multimedia News Summarization in SearchACM Transactions on Intelligent Systems and Technology10.1145/28229077:3(1-20)Online publication date: 1-Feb-2016
https://dl.acm.org/doi/10.1145/2822907
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents