skip to main content
10.1145/2063576.2063850acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
poster

Search result diversification for enterprise data

Published:24 October 2011Publication History

ABSTRACT

Search result diversification aims to return a list of diversified relevant documents in order to satisfy different user information needs. Most of the efforts focused on Web Search, and few studies have considered another important search domain, i.e., enterprise search. Unlike Web search, enterprise search deals with both unstructured and structured data. In this paper, we propose to integrate the structured and unstructured data to discover meaningful query subtopics in search result diversification. Experimental results show that integrating structured and unstructured information allows us to discover high quality query, which are effective in diversifying the retrieval results.

References

  1. R. Agrawal, S. Gollapudi, A. Halverson, and S. Ieong. Diversifying Search Results. In WSDM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. Carterette and P. Chandar. Probabilistic Models of Novel Document Rankings for Faceted Topic Retrieval. In CIKM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Z. Chen and T. Li. Addressing diverse user preferences in sql-query-result navigation. In Proceedings of SIGMOD'07, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. L. A. Clarke, N. Craswell, and I. Soboroff. Overview of the TREC 2009 Web Track. In Proceedings of TREC'09, 2009.Google ScholarGoogle Scholar
  5. E. Demidova, P. Fankhauser, X. Zhou, and W. Nejdl. Divq: Diversification for keyword search over structured databases. In Proceedings of SIGIR'10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Z. Dou, S. Hu, K. Chen, R. Song, and J. R. Wen. Multi-dimensional search result diversification. In Proceedings of WSDM'11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Fang and C. Zhai. Semantic Term Matching in Axiomatic Approaches to Information Retrieval. In SIGIR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Feldman and C. Sherman. The High Cost of Not Finding Information. In Technical Report No. 29127, IDC, 2003.Google ScholarGoogle Scholar
  9. S. Geva. Gpx - gardens point xml ir at inex 2006. In Proceedings of INEX'06, 2006.Google ScholarGoogle Scholar
  10. C. Hauff and D. Hiemstra. University of Twente @ TREC 2009: Indexing half a billion web pages. In Proceedings of TREC'09, 2009.Google ScholarGoogle Scholar
  11. T. Hofmann. Probabilistic latent semantic analysis. In Proceedings of UAI'99, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. Lubell-Doughtie and K. Hofmann. Improving result diversity using probabilistic latent semantic analysis. In Proceedings of DIR'11, 2011.Google ScholarGoogle Scholar
  13. R. L. T. Santos, C. Macdonald, and I. Ounis. Exploiting Query Reformulations for Web Search Result Diversification. In WWW, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. L. T. Santos, C. Macdonald, and I. Ounis. Selectively Diversifying Web Search Results. In CIKM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of SIGIR'01, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. W. Zheng, X. Wang, H. Fang, and H. Cheng. An exploration of pattern-based subtopic modeling for search result diversification. In Proceedings of JCDL'11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Search result diversification for enterprise data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge management
      October 2011
      2712 pages
      ISBN:9781450307178
      DOI:10.1145/2063576

      Copyright © 2011 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 October 2011

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader