skip to main content
10.1145/1183614.1183670acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Ranking web objects from multiple communities

Published: 06 November 2006 Publication History

Abstract

Vertical search is a promising direction as it leverages domain-specific knowledge and can provide more precise information for users. In this paper, we study the Web object-ranking problem, one of the key issues in building a vertical search engine. More specifically, we focus on this problem in cases when objects lack relationships between different Web communities, and take high-quality photo search as the test bed for this investigation. We proposed two score fusion methods that can automatically integrate as many Web communities (Web forums) with rating information as possible. The proposed fusion methods leverage the hidden links discovered by a duplicate photo detection algorithm, and aims at minimizing score differences of duplicate photos in different forums. Both intermediate results and user studies show the proposed fusion methods are practical and efficient solutions to Web object ranking in cases we have described. Though the experiments were conducted on high-quality photo ranking, the proposed algorithms are also applicable to other ranking problems, such as movie ranking and music ranking.

References

[1]
Google image search. http://images.google.com.
[2]
Google local search. http://local.google.com/.
[3]
Google news search. http://news.google.com.
[4]
Google paper search. http://Scholar.google.com.
[5]
Google product search. http://froogle.google.com.
[6]
Google video search. http://video.google.com.
[7]
Scientific literature digital library. http://citeseer.ist.psu.edu.
[8]
Yahoo image search. http://images.yahoo.com.
[9]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. New York: ACM Press; Harlow, England: Addison-Wesley, 1999.
[10]
W. Bin, L. Zhiwei, L. Ming Jing, and M. Wei-Ying. Large-scale duplicate detection for web image search. In Proceedings of the International Conference on Multimedia and Expo, page 353, 2006.
[11]
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. In Computer Networks, volume 30, pages 107--117, 1998.
[12]
C. Burges, T. Shaked, E. Renshaw, A. Lazier, M. Deeds, N. Hamilton, and G. Hullender. Learning to rank using gradient descent. In Proceedings of the 22nd international conference on Machine learning, pages 89--96, 2005.
[13]
C. Dwork, R. Kumar, M. Naor, and D. Sivakumar. Rank aggregation methods for the web. In Proceedings 10th International Conference on World Wide Web, pages 613--622, Hong-Kong, 2001.
[14]
R. Fagin, R. Kumar, and D. Sivakumar. Comparing top k lists. SIAM Journal on Discrete Mathematics, 17(1):134--160, 2003.
[15]
Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research, 4(1):933--969(37), 2004.
[16]
IMDB. Formula for calculating the top rated 250 titles in imdb. http://www.imdb.com/chart/top.
[17]
T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 133--142, 2002.
[18]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604--632, 1999.
[19]
R. Nallapati. Discriminative models for information retrieval. In Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pages 64--71, 2004.
[20]
Z. Nie, Y. Ma, J.-R. Wen, and W.-Y. Ma. Object-level web information retrieval. In Technical Report of Microsoft Research, volume MSR-TR-2005-11, 2005.
[21]
Z. Nie, Y. Zhang, J.-R. Wen, and W.-Y. Ma. Object-level ranking: Bringing order to web objects. In Proceedings of the 14th international conference on World Wide Web, pages 567--574, Chiba, Japan, 2005.
[22]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. In Technical report, Stanford Digital Libraries, 1998.
[23]
A. Savakis, S. Etz, and A. Loui. Evaluation of image appeal in consumer photography. In SPIE Human Vision and Electronic Imaging, pages 111--120, 2000.
[24]
D. Sullivan. Hitwise search engine ratings. Search Engine Watch Articles, http://searchenginewatch.com/reports/article.php/3099931, August 23, 2005.
[25]
S. Susstrunk and S. Winkler. Color image quality on the internet. In IS&T/SPIE Electronic Imaging 2004: Internet Imaging V, volume 5304, pages 118--131, 2004.
[26]
H. Tong, M. Li, Z. H.J., J. He, and Z. C.S. Classification of digital photos taken by photographers or home users. In Pacific-Rim Conference on Multimedia (PCM), pages 198--205, 2004.
[27]
W. Xi, B. Zhang, Z. Chen, Y. Lu, S. Yan, W.-Y. Ma, and E. A. Fox. Link fusion: a unified link analysis framework for multi-type interrelated data objects. In Proceedings of the 13th international conference on World Wide Web, pages 319--327, 2004.

Cited By

View all
  • (2021)Web Object Ranking for Location-Based Web Object SearchAdvances in Smart Communication and Imaging Systems10.1007/978-981-15-9938-5_16(151-165)Online publication date: 14-Apr-2021
  • (2021)Geographical Labeling of Web Objects Through Maximum Marginal ClassificationAdvances in Data Science and Information Engineering10.1007/978-3-030-71704-9_52(713-724)Online publication date: 30-Oct-2021
  • (2018)Community Adaptive Search EnginesInternational Journal of Advanced Intelligence Paradigms10.1504/IJAIP.2009.0267631:4(432-443)Online publication date: 19-Dec-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
November 2006
916 pages
ISBN:1595934332
DOI:10.1145/1183614
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. image search
  2. ranking
  3. web objects

Qualifiers

  • Article

Conference

CIKM06
CIKM06: Conference on Information and Knowledge Management
November 6 - 11, 2006
Virginia, Arlington, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Web Object Ranking for Location-Based Web Object SearchAdvances in Smart Communication and Imaging Systems10.1007/978-981-15-9938-5_16(151-165)Online publication date: 14-Apr-2021
  • (2021)Geographical Labeling of Web Objects Through Maximum Marginal ClassificationAdvances in Data Science and Information Engineering10.1007/978-3-030-71704-9_52(713-724)Online publication date: 30-Oct-2021
  • (2018)Community Adaptive Search EnginesInternational Journal of Advanced Intelligence Paradigms10.1504/IJAIP.2009.0267631:4(432-443)Online publication date: 19-Dec-2018
  • (2018)Probabilistic classification techniques to perform geographical labeling of web objectsCluster Computing10.1007/s10586-018-1822-yOnline publication date: 14-Feb-2018
  • (2017)Geographical labeling of web objects through density estimator model2017 International Conference on Computing Methodologies and Communication (ICCMC)10.1109/ICCMC.2017.8282649(1130-1135)Online publication date: Jul-2017
  • (2015)A survey analysis on duplicate detection in Hierarchical Data2015 International Conference on Pervasive Computing (ICPC)10.1109/PERVASIVE.2015.7087099(1-6)Online publication date: Jan-2015
  • (2015)Levenshtein distance algorithm for efficient and effective XML duplicate detection2015 International Conference on Computer, Communication and Control (IC4)10.1109/IC4.2015.7375698(1-5)Online publication date: Sep-2015
  • (2013)Efficient and Effective Duplicate Detection in Hierarchical DataIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2012.6025:5(1028-1041)Online publication date: 1-May-2013
  • (2013)A Stochastic Model for Measuring Popularity and Reliability in Social Network SystemsProceedings of the 2013 IEEE International Conference on Systems, Man, and Cybernetics10.1109/SMC.2013.84(462-467)Online publication date: 13-Oct-2013
  • (2008)Adaptive search engines as discovery gamesProceedings of the 6th International Conference on Advances in Mobile Computing and Multimedia10.1145/1497185.1497280(444-449)Online publication date: 24-Nov-2008
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media