skip to main content
10.1145/2811222.2811234acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Efficient Visualisation of the Relative Distribution of Keyword Search Results in a Corpus Data Cube

Published: 22 October 2015 Publication History

Abstract

Most keyword searches target precision for finding the most relevant document. However some target recall, finding all relevant documents. Our system supports high recall searches that return hundreds or thousands of relevant results. In particular, it provides a visualization that shows the distribution of search results relative to the distribution of items for the entire corpus. Such relative distributional features include over and under representation, clusters and outliers. The contribution of this paper is efficient visualisation, that is, how to provide the best relative distribution view for a given data cube size. This requirement is translated to: for which limited size meta-data summary cube are search results disambiguated the most in our relative distribution view. We identify metrics and several algorithms for such a summary cube selection.

References

[1]
S. Chaudhuri and U. Dayal. An overview of data warehousing and OLAP technology. ACM SIGMOD Record, 26 (1): 65--74, 1997.
[2]
Flamenco faceted search system. http://flamenco.berkeley.edu/.
[3]
M. Handcock and M. Morris. Relative Distribution Methods. Social Methodology, 28 (1): 53--97, 1998.
[4]
J. Hartigan and B. Kleiner. A mosaic of television rating. In The American Statistician, 38 (1): 32--35, 1984.
[5]
M. Hearst. Clustering versus faceted categories for information exploration. In CACM, 49 (4): 59--61, 2006.
[6]
Lucene. http://lucene.apache.org/.
[7]
A. Inselberg. The plane with parallel coordinates. The visual computer, 1 (4): 69--91, 1985.
[8]
C. Ordonez, Z. Chen, and J. García-García. Interactive exploration and visualization of OLAP cubes. In ACM DOLAP, 83--87, 2011.
[9]
C. Plaisant, B. Shneiderman, K. Doan and T. Burns. Interface and data architecture for query preview in networked information systems. ACM Transactions on Information Systems 17 (3): 320- 341, 1999.
[10]
M. Sifer, J. Lin, Y. Watanobe and S. Bhalla. Integrating Keyword Search with Multiple Dimension Tree Views over a Summary Corpus Data Cube. In Proc. ACM SIGMOD: 1167--1170, 2010.
[11]
M. Sifer and J. Lin. Refining search results with facet landscapes. In Proc. ACM SGIR: 181, 2008.

Cited By

View all
  • (2015)DOLAP 2015 Workshop SummaryProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806876(1939-1940)Online publication date: 17-Oct-2015

Index Terms

  1. Efficient Visualisation of the Relative Distribution of Keyword Search Results in a Corpus Data Cube

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          DOLAP '15: Proceedings of the ACM Eighteenth International Workshop on Data Warehousing and OLAP
          October 2015
          108 pages
          ISBN:9781450337854
          DOI:10.1145/2811222
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 22 October 2015

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. faceted search
          2. olap
          3. user interface
          4. visualisation

          Qualifiers

          • Short-paper

          Conference

          CIKM'15
          Sponsor:

          Acceptance Rates

          DOLAP '15 Paper Acceptance Rate 8 of 31 submissions, 26%;
          Overall Acceptance Rate 29 of 79 submissions, 37%

          Upcoming Conference

          CIKM '25

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 10 Feb 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2015)DOLAP 2015 Workshop SummaryProceedings of the 24th ACM International on Conference on Information and Knowledge Management10.1145/2806416.2806876(1939-1940)Online publication date: 17-Oct-2015

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media