Skip to main content

Visually Exploring Concept-Based Fuzzy Clusters in Web Search Results

  • Chapter
Book cover Advances in Web Intelligence and Data Mining

Part of the book series: Studies in Computational Intelligence ((SCI,volume 23))

Abstract

Users of web search systems often have difficulty determining the relevance of search results to their information needs. Clustering has been suggested as a method for making this task easier. However, this introduces new challenges such as naming the clusters, selecting multiple clusters, and re-sorting the search results based on the cluster information. To address these challenges, we have developed Concept Highlighter, a tool for visually exploring concept-based fuzzy clusters in web search results. This tool automatically generates a set of concepts related to the users’ queries, and performs single-pass fuzzy c-means clustering on the search results using these concepts as the cluster centroids. A visual interface is provided for interactively exploring the search results. In this paper, we describe the features of Concept Highlighter and its use in finding relevant documents within the search results through concept selection and document surrogate highlighting.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ACM. ACM computing classification system. http://www.acm.org/class/.

    Google Scholar 

  2. James C. Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York, 1981.

    MATH  Google Scholar 

  3. Cynthia A. Brewer. www.colorbrewer.org, 2005.

    Google Scholar 

  4. Douglass Cutting, David Karger, Jan Pedersend, and John Tukey. Scatter/gather: A cluster-based approach to browsing large document collections. In Proceedings of the A CM SIGIR. Conference on Research and Development in Information Retrieval, 1992.

    Google Scholar 

  5. G. W. Furnas, T. K. Landauer, L. M. Gomez, and S. T. Dumais. The vocabulary problem in human-system communication. Communications of the ACM, 30(11), 1987.

    Google Scholar 

  6. Google. Google web API. www.google.com/apis/, 2005.

    Google Scholar 

  7. Grokker. http://www.grokker.com/.

    Google Scholar 

  8. Marti Hearst and Jan Pedersen. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In Proceedings of the ACM SIGIR. Conference on Research and Development in Information Retrieval, 1996.

    Google Scholar 

  9. Orland Hoeber, Xue-Dong Yang, and Yiyu Yao. Conceptual query expansion. In Proceedings of the Atlantic Web Intelligence Conference, 2005.

    Google Scholar 

  10. Orland Hoeber, Xue-Dong Yang, and Yiyu Yao. Visualization support for interactive query refinement. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, 2005.

    Google Scholar 

  11. A.K. Jain, M.N. Murty, and P.J. Flynn. Data clustering: A review. ACM Computing Surveys, 31(3), September 1999.

    Google Scholar 

  12. Martin Porter. An algorithm for suffix stripping. Program, 14(3), 1980.

    Google Scholar 

  13. S. E. Robertson and K. Sparck Jones. Simple proven approaches to text retrieval. Technical Report TR356, Cambridge University Computer Laboratory, 1997.

    Google Scholar 

  14. Craig Silverstein, Monika Henzinger, Hannes Marais, and Michael Moricz. Analysis of a very large web search engine query log. SIGIR Forum, 33(1), 1999.

    Google Scholar 

  15. Amanda Spink, Dietmar Wolfram, B. J. Jansen, and Tefko Saracevic. Searching the web: the public and their queries. Journal of the American Society for Information Science and Technology, 52(3), 2001.

    Google Scholar 

  16. Edward Tufte. Envisioning Information. Graphics Press, 1990.

    Google Scholar 

  17. Vivisimo. http://www.vivisimo.com/.

    Google Scholar 

  18. Colin Ware. Information Visualization: Perception for Design. Morgan Kaufmann, 2004.

    Google Scholar 

  19. Yiyu Yao. Information retrieval support systems. In Proceedings of the 2002 IEEE World Congress on Computational Intelligence, 2002.

    Google Scholar 

  20. Oren Zamir and Oren Etzioni. Web document clustering: A feasibility demonstration. In Proceedings of the ACM SIGIR. Conference on Research and Development in Information Retrieval, 1998.

    Google Scholar 

  21. Oren Zamir and Oren Etzioni. Grouper: A dynamic clustering interface to web search results. In Proceedings of the Eighth International World Wide Web Conference, 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Hoeber, O., Yang, XD. (2006). Visually Exploring Concept-Based Fuzzy Clusters in Web Search Results. In: Last, M., Szczepaniak, P.S., Volkovich, Z., Kandel, A. (eds) Advances in Web Intelligence and Data Mining. Studies in Computational Intelligence, vol 23. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-33880-2_9

Download citation

  • DOI: https://doi.org/10.1007/3-540-33880-2_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33879-6

  • Online ISBN: 978-3-540-33880-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics