Abstract
This paper presents an algorithm to improve a web search query based on the feedback on the viewed documents. A user who is searching for information on the Web marks the retrieved (viewed) documents as relevant or irrelevant to further expose the information needs expressed in the original query. A new web search query matching this improved understanding of the user’s information needs is synthesized from these text documents. The methodology provides a way for creating web search query that matches the user’s information need even when the user may have difficulty in doing so directly due to lack of experience in the query design or lack of familiarity of the search domain. A user survey has shown that the algorithmically formed query has recall coverage and precision characteristics better than those achieved by the experienced human web searchers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aho, A.V., Ullman, J.D.: Foundations of Computer Science. Computer Science Press, NY (1992)
Aula, A., Jhaveri, N., Kaki, M.: Information Seach and Re-Access Strategies of Experienced Web Users. In: The intl. World Wide Web (WWW2005) conf. ACM Press, New York (2005)
Baeza-Yates, R., Riberio-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading, Ma (1991)
Cohen, W.W.: Fast Effective Rule Induction. In: 12th Intl. Conf. on Machine Learning (1995)
Cohen, W.W., Singer, Y.: Learning to Query the Web. In: AAAI-96 Workshop on Internet-Based Information Systems, AAAI Press, Menlo Park, CA (1996)
Hölscher, C., Strube, G.: Web Search Behavior of Internet Experts and Newbies. In: Proc. of the 9th intl. World Wide Web conf. on Computer networks: the intl. journal of computer and telecommunications networking, North-Holland, Amsterdam (2000)
Jansen, B.J., Spink, A., Bateman, J., Saracevic, T.: Real Life Information Retrieval: A Study of User Queries on the Web. SIGIR Forum 32(1), 5–17 (1998)
Kopec, D., Marsland, T.A.: Search. The CRC Press, Inc (1997)
Malhotra, V., Patro, S., Johnson, D.: Synthesise Web Queries: Search the Web by Examples. In: 7th Intl Conf. on Enterprise Information Systems (ICEIS2005), vol. 2, INSTICC, Portugal (2005)
Oyama, S., Kokubo, T., Ishida, T.: Domain-Specific Web Search with Keyword Spices. IEEE Transaction on Knowledge and Data Engineering 16(1), 17–27 (2004)
Patro, S., Malhotra, V.: Characteristics of the Boolean Web Search Query: Estimating Success from Characteristics. In: 1st intl. conf. on web info. systems and technologies (WEBIST2005). INSTICC, Portugal (2005)
Patro, S.: Synthesising Web Search Queries from Example Text Documents. Master of Science Thesis, School of Computing, University of Tasmania, Launceston. (2006) http://www.eprints.comp.utas.edu.au
Ruthven, I., Lalmas, M.: A Survey on the Use of Relevance Feedback for Information Systems. Knowledge engineering Review 18(2), 95–145 (2003)
Sanchez, S.N., Triantaphyllou, E., Chen, J., Liao, T.W.: An Incremental Learning Algorithm for Constructing Boolean Functions from Positive and Negative Examples. Computers and Operations Research 29(12), 1677–1700 (2002)
Sebastiani, F.: Machine Learning in Automated Text Categorization. ACM Comp. Surveys 34(1), 1–47 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Patro, S., Malhotra, V., Johnson, D. (2007). An Algorithm to Use Feedback on Viewed Documents to Improve Web Query. In: Filipe, J., Cordeiro, J., Pedrosa, V. (eds) Web Information Systems and Technologies. Lecture Notes in Business Information Processing, vol 1. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74063-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-74063-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74062-9
Online ISBN: 978-3-540-74063-6
eBook Packages: Computer ScienceComputer Science (R0)