Abstract
Exploiting the cumulative behavior of users is a common technique used to improve many popular online services. We build a tag spell checker using a graph-based model. In particular, we present a novel technique based on the graph of tags associated with objects made available by online sites such as Flickr and YouTube. We show the effectiveness of our approach on the basis of an experimentation done on real-world data. We show a precision of up to 93% with a recall (i.e., the number of errors detected) of up to 100%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Silvestri, F.: Mining query logs: Turning search usage data into knowledge. Found. Trends Inf. Retr. 4, 1–174 (2010)
Surowiecki, J.: The Wisdom of Crowds. Anchor (2005)
Kukich, K.: Techniques for automatically correcting words in text. ACM Comput. Surv. 24, 377–439 (1992)
Cucerzan, S., Brill, E.: Spelling correction as an iterative process that exploits the collective knowledge of web users. In: Proc. EMNLP (2004)
Whitelaw, C., Hutchinson, B., Chung, G.Y., Ellis, G.: Using the web for language independent spellchecking and autocorrection. In: Proc. EMNLP 2009. ACL (2009)
Schaback, J.: Multi-level feature extraction for spelling correction (2007)
Merhav, Y., Frieder, O.: On multiword entity ranking in peer-to-peer search. In: Proc. SIGIR 2008. ACM, New York (2008)
Ahmad, F., Kondrak, G.: Learning a spelling error model from search query logs. In: Proc. HLT 2005. ACL (2005)
Freund, J.: Mathematical Statistics. Prentice-Hall, Englewood Cliffs (1962)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nardini, F.M., Silvestri, F., Vahabi, H., Vahabi, P., Frieder, O. (2010). On Tag Spell Checking. In: Chavez, E., Lonardi, S. (eds) String Processing and Information Retrieval. SPIRE 2010. Lecture Notes in Computer Science, vol 6393. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16321-0_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-16321-0_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16320-3
Online ISBN: 978-3-642-16321-0
eBook Packages: Computer ScienceComputer Science (R0)