Abstract
We hypothesize that the variance in volume of high-velocity queries over time can be explained by observing that these queries are formulated in response to events in the world that users are interested in. Based on it, this paper describes a system, Zed, which automatically finds explanations for high velocity queries, by extracting descriptions of relevant and temporally-proximate events from the news stream. Zed can thus provide a meaningful explanation of what the general public is interested in at any time. We evaluated performance of several variant methods on top velocity “celebrity name” queries from Yahoo, using news stories from several sources for event extraction. Results bear out the event-causation hypothesis, in that Zed currently finds acceptable event-based explanations for about 90% of the queries examined.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chien, S., Immorlica, N.: Semantic similarity between search engine queries using temporal correlation. In: Proc. WWW 2005, Chiba, Japan, pp. 2–11 (2005)
Radev, D., Blair-Goldensohn, S., Zhang, Z.: Experiments in single and multidocument summarization using MEAD. In: Proc. Document Understanding Conference (2001)
Radev, D.R., Blair-Goldensohn, S., Zhang, Z., Raghavan, R.S.: Newsinessence: a system for domain-independent, real-time news clustering and multi-document summarization. In: Proceedings of HLT 2001, pp. 1–4 (2001)
Filatova, E., Hatzivassiloglou, V.: Event-based extractive summarization. In: ACL Workshop on Summarization, Barcelona, Spain (2004)
Vanderwende, L., Banko, M., Menezes, A.: Event-centric summary generation. In: Proc. Document Understanding Conference at HLT-NAACL, Boston, MA (2004)
Saggion, H., Bontcheva, K., Cunningham, H.: Robust generic and query-based summarization. In: Proceedings of EACL 2003, pp. 235–238 (2003)
Amini, M.R.: Interactive learning for text summarization. In: Zighed, A.D.A., Komorowski, J., Żytkow, J.M. (eds.) PKDD 2000. LNCS (LNAI), vol. 1910, pp. 44–52. Springer, Heidelberg (2000)
Chowdhury, A., Beitzel, S., Jensen, E., Sai-lee, M., Grossman, D., Frieder, O., et al.: IIT TREC-9 - Entity Based Feedback with Fusion. TREC-9 (2000)
Robertson, S.E., Walker, S., Hancock-Beaulieu, M.: Experimentation as a way of life: Okapi at TREC. Information Processing and Management 36, 95–108 (2000)
Briscoe, E.J., Carroll, J.: Robust accurate statistical annotation of general text. In: Proceedings of LREC, pp. 1499–1504 (2002)
Cohen, J.: A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20, 37–46 (1960)
Maclure, M., Willett, W.: Misinterpretation and misuse of the kappa statistic. American Journal of Epidemiology 126, 161–169 (1987)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, M., Argamon, S., Chowdhury, A., Sidhu, K. (2006). ZED: Explaining Temporal Variations in Query Volume. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_53
Download citation
DOI: https://doi.org/10.1007/11811305_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)